StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling
-
mengwei0427/StreamVLN_Video_qwen_1_5_r2r_rxr_envdrop_scalevln
Robotics • 8B • Updated • 302 • 2 -
mengwei0427/StreamVLN_Video_qwen_1_5_r2r_rxr_envdrop_scalevln_v1_3
Text Generation • 8B • Updated • 108 -
mengwei0427/StreamVLN_Video_qwen_1_5_r2r_rxr_envdrop_scalevln_real_world
8B • Updated • 41 -
chchnii/StreamVLN-ScanQA-SQA3D-Data
Viewer • Updated • 53.1k • 33 • 1