Sparrow is a data augmentation method that enriches the instruction diversity of video data. You can find related data and weights here.
Shukang Yin
xjtupanda
AI & ML interests
Computer Vision, Multimodal learning
Recent Activity
upvoted a paper about 22 hours ago
UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification upvoted a paper about 1 month ago
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video UnderstandingOrganizations
None yet