Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models Paper • 2604.26951 • Published 4 days ago • 42
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 4 days ago • 89
Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding Paper • 2604.26779 • Published 4 days ago • 6
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora Paper • 2604.24819 • Published 6 days ago • 82
Meta-CoT: Enhancing Granularity and Generalization in Image Editing Paper • 2604.24625 • Published 6 days ago • 25
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 6 days ago • 66
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company Paper • 2604.22446 • Published 9 days ago • 116
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published 9 days ago • 63
ELT: Elastic Looped Transformers for Visual Generation Paper • 2604.09168 • Published 23 days ago • 20
ProEval: Proactive Failure Discovery and Efficient Performance Estimation for Generative AI Evaluation Paper • 2604.23099 • Published 8 days ago • 3
Stabilizing Efficient Reasoning with Step-Level Advantage Selection Paper • 2604.24003 • Published 6 days ago • 7
Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis Paper • 2604.24198 • Published 6 days ago • 19
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 6 days ago • 114
MedVR: Annotation-Free Medical Visual Reasoning via Agentic Reinforcement Learning Paper • 2604.08203 • Published 23 days ago • 2
Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets Paper • 2604.22294 • Published 9 days ago • 16
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 9 days ago • 219
AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 19 days ago • 162
WorldMark: A Unified Benchmark Suite for Interactive Video World Models Paper • 2604.21686 • Published 10 days ago • 36
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling Paper • 2604.19734 • Published 12 days ago • 29