Optimization-Guided Diffusion for Interactive Scene Generation Paper • 2512.07661 • Published Dec 8, 2025 • 4
HPLT 3.0: Very Large-Scale Multilingual Resources for LLM and MT. Mono- and Bi-lingual Data, Multilingual Evaluation, and Pre-Trained Models Paper • 2511.01066 • Published Nov 2, 2025 • 2
FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching Paper • 2604.06757 • Published 4 days ago • 8
CodeX Collection The best available Pre-curated coding datasets on Platform. • 2 items • Updated Mar 2 • 3
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Paper • 2501.02506 • Published Jan 5, 2025 • 11
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web Paper • 2604.08516 • Published 3 days ago • 33
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 6 items • Updated 1 day ago • 24
SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds Paper • 2604.08544 • Published 3 days ago • 11
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published 3 days ago • 87
FIT: A Large-Scale Dataset for Fit-Aware Virtual Try-On Paper • 2604.08526 • Published 3 days ago • 17
On the Step Length Confounding in LLM Reasoning Data Selection Paper • 2604.06834 • Published 4 days ago • 3
On the Continuity of Rotation Representations in Neural Networks Paper • 1812.07035 • Published Dec 17, 2018 • 1
SentiAvatar: Towards Expressive and Interactive Digital Humans Paper • 2604.02908 • Published 9 days ago • 1
RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis Paper • 2404.16754 • Published Apr 25, 2024 • 1
Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization Paper • 2604.07343 • Published 4 days ago • 8
FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration Paper • 2603.29557 • Published 12 days ago • 17