Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 1 day ago • 22
V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising Paper • 2603.16792 • Published 4 days ago • 3
Mamba-3: Improved Sequence Modeling using State Space Principles Paper • 2603.15569 • Published 5 days ago • 5
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 5 days ago • 57
ECoLAD: Deployment-Oriented Evaluation for Automotive Time-Series Anomaly Detection Paper • 2603.10926 • Published 10 days ago • 1
Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection Paper • 2603.12916 • Published 8 days ago • 3
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published 9 days ago • 62
Running on Zero MCP Featured 79 FLUX.2 Klein 9B KV 🎨 79 Generate or edit images from text and optional photos
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 1 day ago • 230
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 11 days ago • 178