5 18 20

Zuhao Yang

mwxely

https://mwxely.github.io/

AI & ML interests

Large Multimodal Models

Recent Activity

upvoted a paper about 5 hours ago

mHC: Manifold-Constrained Hyper-Connections

upvoted a paper 3 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

upvoted a paper 4 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

View all activity

Organizations

upvoted a paper about 5 hours ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 2 days ago • 118

upvoted a paper 3 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 24 days ago • 128

upvoted a paper 4 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published 25 days ago • 115

upvoted 2 papers 11 days ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Paper • 2512.17532 • Published 14 days ago • 64

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 11 days ago • 61

upvoted 3 papers 17 days ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 227

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 210

New activity in mwxely/TransitBench 17 days ago

[bot] Conversion to Parquet

#1 opened 6 months ago by

parquet-converter

When do you release the code?

#2 opened 4 months ago by

zhangzb

authored a paper 17 days ago

A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models

Paper • 2511.15098 • Published Nov 19, 2025

updated 3 datasets 23 days ago

liked a dataset 23 days ago

longvideotool/VideoSIAH-Eval

Viewer • Updated 23 days ago • 1.28k • 150 • 2

published a dataset 23 days ago

longvideotool/VideoSIAH-Eval

Viewer • Updated 23 days ago • 1.28k • 150 • 2

commented a paper 25 days ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25, 2025 • 182 •

liked a dataset 28 days ago

longvideotool/LongVT-Demo

Viewer • Updated Nov 26, 2025 • 5 • 227 • 1

New activity in longvideotool/LongVT-Source 28 days ago

[bot] Conversion to Parquet

#2 opened 28 days ago by

parquet-converter

Missing “wemath (WeMath data)” zip file in LongVT-Source dataset training data

#1 opened 29 days ago by

Seele77

Zuhao Yang

AI & ML interests

Recent Activity

Organizations

mwxely's activity

[bot] Conversion to Parquet

When do you release the code?

[bot] Conversion to Parquet

Missing “wemath (WeMath data)” zip file in LongVT-Source dataset training data