Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining Paper • 2603.11103 • Published 7 days ago • 8
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning Paper • 2510.06261 • Published Oct 5, 2025 • 6
Reasoned Safety Alignment: Ensuring Jailbreak Defense via Answer-Then-Check Paper • 2509.11629 • Published Sep 15, 2025 • 1
Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection Paper • 2406.00806 • Published Jun 2, 2024
VideoWorld 2: Learning Transferable Knowledge from Real-world Videos Paper • 2602.10102 • Published Feb 10 • 14
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper • 2602.05393 • Published Feb 5 • 8
Protein Autoregressive Modeling via Multiscale Structure Generation Paper • 2602.04883 • Published Feb 4 • 3
SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning Paper • 2602.02472 • Published Feb 2 • 46
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation Paper • 2601.21420 • Published Jan 29 • 42