TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 38
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 7 days ago • 38
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5 • 59
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 16 items • Updated 9 days ago • 137
view article Article Building for an Open Future - our new partnership with Google Cloud Nov 13 • 47
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29 • 58
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published Oct 6 • 126
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare +1 Apr 19, 2024 • 189
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! +4 Aug 8 • 106
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5 • 508
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 • 171
view article Article AI Total Cost of Ownership Calculator: Evaluate the cost of in-house AI deployment vs AI APIs Sep 20, 2023 • 5
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3 • 301
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! +1 Jun 6 • 55
sarvam-m Collection Collection of all variations of the sarvam-m model • 3 items • Updated May 24 • 19