Klear: Unified Multi-Task Audio-Video Joint Generation Paper • 2601.04151 • Published 2 days ago • 12
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 11 days ago • 64
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 7 days ago • 48
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published 8 days ago • 106
Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion Paper • 2512.23709 • Published 11 days ago • 48
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 11 days ago • 5
VA-π: Variational Policy Alignment for Pixel-Aware Autoregressive Generation Paper • 2512.19680 • Published 18 days ago • 10
Spatia: Video Generation with Updatable Spatial Memory Paper • 2512.15716 • Published 23 days ago • 30
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published 23 days ago • 93
Region-Constraint In-Context Generation for Instructional Video Editing Paper • 2512.17650 • Published 21 days ago • 50
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 18 days ago • 62
Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation Paper • 2512.17040 • Published 22 days ago • 27
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published 29 days ago • 34
RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing Paper • 2512.16864 • Published 22 days ago • 10
FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction Paper • 2512.16900 • Published 22 days ago • 10