Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis Paper • 2411.19509 • Published Nov 29, 2024 • 3
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion Paper • 2512.04926 • Published 12 days ago • 41
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper • 2512.05150 • Published 14 days ago • 71
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 13 days ago • 166
view article Article SARLO-80: Worldwide Slant SAR Language Optic Dataset at 80 cm Resolution 16 days ago • 3
Evaluating In Silico Creativity: An Expert Review of AI Chess Compositions Paper • 2510.23772 • Published Oct 27 • 1
Prompt-to-Prompt Image Editing with Cross Attention Control Paper • 2208.01626 • Published Aug 2, 2022 • 3
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale Paper • 2407.05282 • Published Jul 7, 2024 • 16
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech Paper • 2106.06103 • Published Jun 11, 2021 • 4
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17 • 89
Video-As-Prompt: Unified Semantic Control for Video Generation Paper • 2510.20888 • Published Oct 23 • 45