Chenming Zhu's picture

1 17 3

Chenming Zhu

ChaimZhu

·

https://zcmax.github.io/

AI & ML interests

Multimodal Large Language Models, 3D Perception and Understanding, Embodied AI

Recent Activity

upvoted a paper 10 days ago

MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

upvoted a paper 16 days ago

Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation

upvoted a paper about 1 month ago

G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

View all activity

Organizations

Papers 2

arxiv:2507.07984

arxiv:2409.18125

models 1

ChaimZhu/LLaVA-3D-7B

Updated Oct 18, 2024 • 195 • 4

datasets 2

ChaimZhu/LLaVA-3D-Data

Viewer • Updated Jul 11 • 859k • 73

ChaimZhu/LLaVA-3D-Demo-Data

Viewer • Updated Oct 18, 2024 • 1.02k • 1.64k • 3