MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 7 items • Updated 15 days ago • 55
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 624
view article Article CUGA on Hugging Face: Democratizing Configurable AI Agents ibm-research • Dec 15, 2025 • 67
Nemotron RAG Collection Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs • 10 items • Updated 8 days ago • 92
view article Article Introducing ColQwen-Omni: Retrieve in every modality manu • Jul 17, 2025 • 76
view article Article How to Choose the Best Open Source LLM for Your Project in 2025 dvilasuero • Sep 9, 2025 • 77
view article Article 🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders adaamko • Aug 31, 2025 • 16
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders thomwolf, matthieu-lapeyre • Jul 9, 2025 • 799
view article Article Gemma 3n fully available in the open-source ecosystem! +6 ariG23498, pcuenq, sergiopaniego, reach-vb, FL33TW00D-HF, Xenova, Steveeeeeeen, kashif • Jun 26, 2025 • 121
view article Article *Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings manu • Jun 2, 2025 • 28
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 611
view article Article Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs Omartificial-Intelligence-Space • May 1, 2025 • 7
Hallucination detection Collection Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications. • 4 items • Updated May 18, 2025 • 19
view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
view article Article Fine-tune ModernBERT for text classification using synthetic data davidberenstein1957 • Dec 30, 2024 • 39
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun • Jan 28, 2025 • 889
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM davidberenstein1957 • Jan 3, 2025 • 38
view article Article Train 400x faster Static Embedding Models with Sentence Transformers tomaarsen • Jan 15, 2025 • 230