DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data Paper • 2604.19859 • Published 6 days ago • 47
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 65
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 5 days ago • 229
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation Paper • 2604.18168 • Published 7 days ago • 96
GRM2 Collection Powerfull Reasoning-focused models for general reasoning and agentic tasks. • 2 items • Updated 3 days ago • 4
GRM-2.5 Collection Reasoning models for complex reasoning, challenging tasks, and all kinds of chat and everyday use. • 2 items • Updated 3 days ago • 3
Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision Paper • 2512.15489 • Published Dec 17, 2025 • 13
Nemotron Supervised Fine-Tuning Collection SFT datasets covering math, code, chat, safety, agentic, VLM, multilingual, and specialized domains. • 38 items • Updated 3 days ago • 3
view article Article DeepSeek-V4: a million-token context that agents can actually use 3 days ago • 29
ClawNet: Human-Symbiotic Agent Network for Cross-User Autonomous Cooperation Paper • 2604.19211 • Published 6 days ago • 10
TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents Paper • 2602.07274 • Published Feb 6 • 210
A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression Paper • 2604.19572 • Published 6 days ago • 18