SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer Paper • 2605.30409 • Published 23 days ago • 40
Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation Paper • 2605.15141 • Published May 14 • 94
SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning Paper • 2606.10804 • Published 11 days ago • 43
LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing Paper • 2606.06042 • Published 16 days ago • 24
GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration Paper • 2605.31039 • Published 22 days ago • 44
Bootstrap Your Generator: Unpaired Visual Editing with Flow Matching Paper • 2606.03911 • Published 18 days ago • 22
LVSA: Training-Free Sparse Attention for Long Video Diffusion Paper • 2605.31057 • Published 22 days ago • 14
Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos Paper • 2605.18233 • Published May 18 • 92
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published May 1 • 85
ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control Paper • 2604.20816 • Published Apr 22 • 15
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published Apr 24 • 63
DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution Paper • 2507.01012 • Published Jul 1, 2025 • 2
JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion Paper • 2601.22143 • Published Jan 29 • 12
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published Apr 9 • 115
Focal Guidance: Unlocking Controllability from Semantic-Weak Layers in Video Diffusion Models Paper • 2601.07287 • Published Jan 12 • 6
SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing Paper • 2603.19228 • Published Mar 19 • 68
view article Article NEO-unify: Building Native Multimodal Unified Models End to End sensenova • Mar 5 • 165