Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Yuhao Dong PRO
THUdyh
AI & ML interests
None yet
Recent Activity
upvoted a paper 1 day ago
S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence upvoted a paper 15 days ago
VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding authored a paper 23 days ago
From Pixels to Words -- Towards Native One-Vision Models at Scale