The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated Nov 16, 2025 • 113
view article Article Smol2Operator: Post-Training GUI Agents for Computer Use +3 A-Mahla, merve, sergiopaniego, reach-vb, lewtun • Sep 23, 2025 • 138
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs davidberenstein1957 • May 7, 2025 • 42
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. tiiuae • May 15, 2025 • 36
Building Math Agents with Multi-Turn Iterative Preference Learning Paper • 2409.02392 • Published Sep 4, 2024 • 16
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30, 2024 • 42
Comparing DPO with IPO and KTO Collection A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 8, 2025 • 32
UDOP Collection UDOP is a general multimodal model for document AI • 4 items • Updated May 1, 2025 • 25