HanSaem Kim's picture

214 14

HanSaem Kim

kensaem

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

upvoted a paper 2 days ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

upvoted a paper 2 days ago

PersonaLive! Expressive Portrait Image Animation for Live Streaming

View all activity

Organizations

None yet

upvoted 3 papers 2 days ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

Paper • 2512.11799 • Published 4 days ago • 29

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published 4 days ago • 34

PersonaLive! Expressive Portrait Image Animation for Live Streaming

Paper • 2512.11253 • Published 5 days ago • 24

upvoted 3 papers 5 days ago

VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory

Paper • 2512.04519 • Published 13 days ago • 3

Stronger Normalization-Free Transformers

Paper • 2512.10938 • Published 5 days ago • 17

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Paper • 2512.10881 • Published 5 days ago • 26

upvoted 2 papers 7 days ago

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Paper • 2503.14487 • Published Mar 18 • 28

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards

Paper • 2512.00473 • Published 18 days ago • 24

upvoted 5 papers 8 days ago

LongCat-Image Technical Report

Paper • 2512.07584 • Published 9 days ago • 17

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Paper • 2512.04784 • Published 15 days ago • 24

TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

Paper • 2512.05150 • Published 14 days ago • 71

Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation

Paper • 2512.04678 • Published 13 days ago • 39

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 13 days ago • 166

upvoted 7 papers 9 days ago

Light-X: Generative 4D Video Rendering with Camera and Illumination Control

Paper • 2512.05115 • Published 12 days ago • 10

RELIC: Interactive Video World Model with Long-Horizon Memory

Paper • 2512.04040 • Published 13 days ago • 23

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published 14 days ago • 30

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 20 days ago • 126

UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers

Paper • 2512.04504 • Published 13 days ago • 15

Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression

Paper • 2512.05081 • Published 12 days ago • 30

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published 11 days ago • 36