ByteDance Releases Lance: 3B Unified Multimodal Model Matching 7B Benchmarks

One Model, All Modalities

ByteDance Research has released Lance, a lightweight unified multimodal model with 3 billion active parameters. Unlike siloed models that specialize in a single task, Lance handles image generation, video generation, image editing, video editing, image understanding, and video QA — all within a single architecture. It is available under the Apache 2.0 license.

Core Capabilities

Lance supports text-to-image (768×768), text-to-video (up to 121 frames at 480p), instruction-based image editing, frame-aware video editing, and visual question answering for both images and video. The model was fine-tuned from Qwen2.5-VL-3B-Instruct using a staged multi-task training recipe on 128 A100 GPUs.

Benchmark Performance

Despite its compact size, Lance delivers impressive results: DPG score of 84.67 (competitive with 7B models), GenEval 0.90, GEdit 7.30 (best-in-class among unified models), and VBench 85.11 (highest among tested models for video generation). These results challenge the assumption that unified models must sacrifice quality for versatility.

Availability

Model weights and inference scripts are available on GitHub (bytedance/Lance) and Hugging Face (bytedance-research/Lance). A minimum of 40GB VRAM is required. The release has generated significant interest in r/LocalLLaMA, where it received over 600 upvotes as a compelling option for locally-run multimodal tasks.

AI Reddit Mar 14, 2026 2 min read

r/singularity highlights Meituan's 8-step open-source image editor LongCat-Image-Edit-Turbo

r/singularity pointed to Meituan's LongCat-Image-Edit-Turbo, a distilled open-source image editor that claims high-quality results in just 8 NFEs. The release pairs an Apache 2.0 Hugging Face model with a public arXiv report and community scrutiny over benchmark framing.

#meituan #image-editing #open-source

AI X/Twitter Apr 25, 2026 2 min read

DeepSeek-V4 opens 1M context with 1.6T/49B and 284B/13B split

Why it matters: open models rarely arrive with both giant context claims and deployable model splits. DeepSeek put hard numbers on the release with a 1M-context design, a 1.6T/49B Pro model, and a 284B/13B Flash variant.

#deepseek #open-weights #llm

AI Hacker News Jun 14, 2026 1 min read

“Open source AI must win” resonates as model access becomes infrastructure risk

The short manifesto spread because it frames closed AI access as an operational dependency, not just a licensing preference.

#open-source #local-ai #ai-governance