SP$^3$: Spherical Priors for Plug-and-Play Restoration Paper • 2606.16396 • Published 14 days ago • 15
VideoMDM: Towards 3D Human Motion Generation From 2D Supervision Paper • 2606.13364 • Published 18 days ago • 20
Text-to-Image Models Need Less from Text Encoders Than You Think Paper • 2606.03715 • Published 27 days ago • 11
MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image Paper • 2605.10616 • Published May 11 • 142
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation Paper • 2605.13724 • Published May 13 • 105
Versatile Editing of Video Content, Actions, and Dynamics without Training Paper • 2603.17989 • Published Mar 18 • 18
Spanning the Visual Analogy Space with a Weight Basis of LoRAs Paper • 2602.15727 • Published Feb 17 • 13
MineTheGap: Automatic Mining of Biases in Text-to-Image Models Paper • 2512.13427 • Published Dec 15, 2025 • 2
CRISP: Persistent Concept Unlearning via Sparse Autoencoders Paper • 2508.13650 • Published Aug 19, 2025 • 16
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Paper • 2405.12211 • Published May 20, 2024 • 2
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices Paper • 2405.12211 • Published May 20, 2024 • 2
From Posterior Sampling to Meaningful Diversity in Image Restoration Paper • 2310.16047 • Published Oct 24, 2023 • 2