Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper โข 2603.10145 โข Published 28 days ago โข 12
Running on CPU Upgrade 217 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens ๐ 217 Explore synthetic data experiments on a virtual bookshelf
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day โข 634 items โข Updated about 6 hours ago โข 93