arxiv:2507.07984
Chenming Zhu
ChaimZhu
AI & ML interests
Multimodal Large Language Models, 3D Perception and Understanding, Embodied AI
Recent Activity
upvoted
a
paper
10 days ago
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
upvoted
a
paper
about 1 month ago
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning