6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models Paper • 2603.18742 • Published 26 days ago • 10
6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models Paper • 2603.18742 • Published 26 days ago • 10
6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models Paper • 2603.18742 • Published 26 days ago • 10
HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration Paper • 2603.07815 • Published Mar 8 • 10
HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration Paper • 2603.07815 • Published Mar 8 • 10
HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration Paper • 2603.07815 • Published Mar 8 • 10
SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing Paper • 2603.08982 • Published Mar 9 • 15
SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing Paper • 2603.08982 • Published Mar 9 • 15
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published Feb 13 • 44
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published Feb 13 • 44
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published Feb 13 • 44
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published Feb 13 • 58
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published Feb 13 • 58