25 30 6

Jintao Zhang

jt-zhang

https://jt-zhang.github.io/

jt-zhang

AI & ML interests

Efficient ML

Recent Activity

authored a paper 19 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

upvoted a paper 19 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

submitted a paper 19 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

View all activity

Organizations

authored a paper 19 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Paper • 2603.18742 • Published 26 days ago • 10

upvoted a paper 19 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Paper • 2603.18742 • Published 26 days ago • 10

submitted a paper to Daily Papers 19 days ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Paper • 2603.18742 • Published 26 days ago • 10

authored a paper 29 days ago

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Paper • 2603.07815 • Published Mar 8 • 10

upvoted a paper 29 days ago

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Paper • 2603.07815 • Published Mar 8 • 10

submitted a paper to Daily Papers 29 days ago

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Paper • 2603.07815 • Published Mar 8 • 10

authored 2 papers about 1 month ago

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

Paper • 2603.08982 • Published Mar 9 • 15

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82

upvoted 2 papers about 1 month ago

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

Paper • 2603.08982 • Published Mar 9 • 15

authored a paper about 1 month ago

SageBwd: A Trainable Low-bit Attention

Paper • 2603.02170 • Published Mar 2 • 19

upvoted a paper about 1 month ago

SageBwd: A Trainable Low-bit Attention

Paper • 2603.02170 • Published Mar 2 • 19

submitted a paper to Daily Papers about 1 month ago

SageBwd: A Trainable Low-bit Attention

Paper • 2603.02170 • Published Mar 2 • 19

authored a paper about 2 months ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published Feb 13 • 44

updated a collection about 2 months ago

efficient ml

Collection

11 items • Updated Feb 20 • 2

upvoted a paper about 2 months ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published Feb 13 • 44

submitted a paper to Daily Papers about 2 months ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published Feb 13 • 44

authored a paper about 2 months ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published Feb 13 • 58

upvoted a paper about 2 months ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published Feb 13 • 58

updated a collection about 2 months ago

efficient ml

Collection

11 items • Updated Feb 20 • 2

Jintao Zhang

AI & ML interests

Recent Activity

Organizations

jt-zhang's activity