9 32 57

Jiaming Han

csuhan

https://csuhan.com

csuhan

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper 12 days ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

upvoted a paper 17 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

upvoted a paper about 1 month ago

OneThinker: All-in-one Reasoning Model for Image and Video

View all activity

Organizations

None yet

upvoted a paper 12 days ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published 15 days ago • 36

upvoted a paper 17 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published 19 days ago • 63

upvoted 3 papers about 1 month ago

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published Dec 2, 2025 • 32

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 220

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published Nov 25, 2025 • 27

liked a dataset about 2 months ago

jasonzhango/SPAR-7M-RGBD

Updated Jun 15, 2025 • 440 • 7

upvoted a paper 3 months ago

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Paper • 2510.08555 • Published Oct 9, 2025 • 63

liked a dataset 3 months ago

WINDop/OpenGPT-4o-Image

Updated Nov 2, 2025 • 968 • 18

updated a dataset 3 months ago

csuhan/demo_prompts_2

Viewer • Updated Sep 25, 2025 • 52 • 76

published a dataset 3 months ago

csuhan/demo_prompts_2

Viewer • Updated Sep 25, 2025 • 52 • 76

upvoted a collection 3 months ago

Qwen3-Omni

Collection

6 items • Updated 3 days ago • 177

updated a model 3 months ago

csuhan/TA-Tok-c16-256px

Updated Sep 22, 2025

published a model 3 months ago

csuhan/TA-Tok-c16-256px

Updated Sep 22, 2025

liked a dataset 3 months ago

sled-umich/3D-GRAND

Updated Oct 10, 2025 • 612 • 11

updated a collection 4 months ago

Tar

Collection

[NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations • 11 items • Updated Sep 20, 2025 • 1

upvoted 2 papers 4 months ago

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Paper • 2509.09680 • Published Sep 11, 2025 • 43

Discrete Noise Inversion for Next-scale Autoregressive Text-based Image Editing

Paper • 2509.01984 • Published Sep 2, 2025 • 6

updated a dataset 4 months ago

csuhan/ImageNet1K-T2I-QwenVL-QwenImage

Viewer • Updated Sep 1, 2025 • 1.1M • 110

published a dataset 4 months ago

csuhan/ImageNet1K-T2I-QwenVL-QwenImage

Viewer • Updated Sep 1, 2025 • 1.1M • 110

updated a model 4 months ago

csuhan/Tar-Lumina2

Updated Sep 1, 2025

Jiaming Han

AI & ML interests

Recent Activity

Organizations

csuhan's activity