Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism Paper • 2604.09544 • Published 4 days ago • 2
MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines Paper • 2603.06679 • Published 15 days ago • 5
Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos Paper • 2603.25645 • Published 18 days ago • 4
ULTRA: Unified Multimodal Control for Autonomous Humanoid Whole-Body Loco-Manipulation Paper • 2603.03279 • Published Mar 3 • 1
HandX: Scaling Bimanual Motion and Interaction Generation Paper • 2603.28766 • Published 14 days ago • 12
HandX: Scaling Bimanual Motion and Interaction Generation Paper • 2603.28766 • Published 14 days ago • 12
AVO: Agentic Variation Operators for Autonomous Evolutionary Search Paper • 2603.24517 • Published 19 days ago • 10
BioVITA: Biological Dataset, Model, and Benchmark for Visual-Textual-Acoustic Alignment Paper • 2603.23883 • Published 20 days ago • 6
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions Paper • 2603.23495 • Published 20 days ago • 3
AnimalCLAP: Taxonomy-Aware Language-Audio Pretraining for Species Recognition and Trait Inference Paper • 2603.22053 • Published 21 days ago • 3
V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising Paper • 2603.16792 • Published 27 days ago • 3
Motion2Motion: Cross-topology Motion Transfer with Sparse Correspondence Paper • 2508.13139 • Published Aug 18, 2025 • 4
MeshMosaic: Scaling Artist Mesh Generation via Local-to-Global Assembly Paper • 2509.19995 • Published Sep 24, 2025 • 2
SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction Paper • 2510.07723 • Published Oct 9, 2025 • 5
DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image Paper • 2406.17988 • Published Jun 26, 2024
Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation Paper • 2506.06440 • Published Jun 6, 2025
Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image Paper • 2511.01767 • Published Nov 3, 2025
PartSAM: A Scalable Promptable Part Segmentation Model Trained on Native 3D Data Paper • 2509.21965 • Published Sep 26, 2025
TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels Paper • 2512.08358 • Published Dec 9, 2025 • 6