- Downloads last month
- 27
Papers for Blackroot/SimpleDiffusion-TensorProductAttentionRope
Paper • 2501.06425 • Published • 91
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion
Paper • 2410.19324 • Published • 3
Expanded Gating Ranges Improve Activation Functions
Paper • 2405.20768 • Published
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale
Paper • 2309.06497 • Published • 7
Flow Matching for Generative Modeling
Paper • 2210.02747 • Published • 4




