Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published 3 days ago • 64
I-Con: A Unifying Framework for Representation Learning Paper • 2504.16929 • Published Apr 23, 2025 • 30
Pretraining Frame Preservation in Autoregressive Video Memory Compression Paper • 2512.23851 • Published 10 days ago • 22
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process Paper • 2512.23988 • Published 10 days ago • 15
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 21 days ago • 83
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics Paper • 2512.12602 • Published 26 days ago • 42
Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published 22 days ago • 14
Scaling Latent Reasoning via Looped Language Models Paper • 2510.25741 • Published Oct 29, 2025 • 221
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning Paper • 2510.25992 • Published Oct 29, 2025 • 45
Memory Retrieval and Consolidation in Large Language Models through Function Tokens Paper • 2510.08203 • Published Oct 9, 2025 • 9
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 184
The Three Regimes of Offline-to-Online Reinforcement Learning Paper • 2510.01460 • Published Oct 1, 2025 • 1
RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems Paper • 2510.02263 • Published Oct 2, 2025 • 8
Generalized Parallel Scaling with Interdependent Generations Paper • 2510.01143 • Published Oct 1, 2025 • 5
Mem-α: Learning Memory Construction via Reinforcement Learning Paper • 2509.25911 • Published Sep 30, 2025 • 14
Scalable Reinforcement Post-Training Beyond Static Human Prompts: Evolving Alignment via Asymmetric Self-Play Paper • 2411.00062 • Published Oct 31, 2024 • 1
Distilled Pretraining: A modern lens of Data, In-Context Learning and Test-Time Scaling Paper • 2509.01649 • Published Sep 1, 2025 • 2