V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper • 2512.11799 • Published 4 days ago • 29
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder Paper • 2512.11749 • Published 4 days ago • 34
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published 5 days ago • 24
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory Paper • 2512.04519 • Published 13 days ago • 3
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper • 2512.10881 • Published 5 days ago • 26
DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers Paper • 2503.14487 • Published Mar 18 • 28
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards Paper • 2512.00473 • Published 18 days ago • 24
PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling Paper • 2512.04784 • Published 15 days ago • 24
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper • 2512.05150 • Published 14 days ago • 71
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation Paper • 2512.04678 • Published 13 days ago • 39
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 13 days ago • 166
Light-X: Generative 4D Video Rendering with Camera and Illumination Control Paper • 2512.05115 • Published 12 days ago • 10
RELIC: Interactive Video World Model with Long-Horizon Memory Paper • 2512.04040 • Published 13 days ago • 23
OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published 14 days ago • 30
UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers Paper • 2512.04504 • Published 13 days ago • 15
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression Paper • 2512.05081 • Published 12 days ago • 30
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published 11 days ago • 36