view article Article AI evals are becoming the new compute bottleneck evaleval • 12 days ago • 26
view article Article Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents ibm-research • 27 days ago • 28
view article Article The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ huggingface • Feb 3 • 53
view article Article Featherless AI on Hugging Face Inference Providers 🔥 +4 wxgeorge, pohnean-recursal, picocreator, celinah, Wauplin, sbrandeis • Jun 12, 2025 • 49
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper • 2506.03143 • Published Jun 3, 2025 • 54
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 354 items • Updated Mar 2 • 25
view article Article The NLP Course is becoming the LLM Course +8 burtenshaw, reach-vb, lewtun, fdaudens, pcuenq, tomaarsen, coyotte508, mishig, sergiopaniego, julien-c • Apr 3, 2025 • 106
view article Article Open R1: How to use OlympicCoder locally for coding +3 burtenshaw, reach-vb, lewtun, edbeeching, yagilb • Mar 20, 2025 • 63
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset sandhawalia, cadene • Mar 11, 2025 • 107