Linear Scaling Video VLMs for Long Video Understanding Paper • 2605.31598 • Published 5 days ago • 10
FlowLong: Inference-time Long Video Generation via Manifold-constrained Tweedie Matching Paper • 2605.20910 • Published 14 days ago • 29
Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generation Paper • 2605.16003 • Published 19 days ago • 5
Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video Paper • 2605.15182 • Published 20 days ago • 39
RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO Paper • 2605.15190 • Published 20 days ago • 13
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 20 days ago • 85
Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation Paper • 2605.15141 • Published 20 days ago • 93
TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking Paper • 2605.12587 • Published 22 days ago • 37
MuSS: A Large-Scale Dataset and Cinematic Narrative Benchmark for Multi-Shot Subject-to-Video Generation Paper • 2604.23789 • Published 25 days ago • 6
Continuous-Time Distribution Matching for Few-Step Diffusion Distillation Paper • 2605.06376 • Published 27 days ago • 26
Lightning Unified Video Editing via In-Context Sparse Attention Paper • 2605.04569 • Published 28 days ago • 18
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation Paper • 2605.03849 • Published 29 days ago • 126
Stream-T1: Test-Time Scaling for Streaming Video Generation Paper • 2605.04461 • Published 28 days ago • 105
Diffusion Templates: A Unified Plugin Framework for Controllable Diffusion Paper • 2604.24351 • Published Apr 27 • 11
VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects Paper • 2604.16272 • Published Apr 17 • 3
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published Apr 15 • 164
PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference Paper • 2603.25730 • Published Mar 26 • 53