On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 22 days ago • 232
3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code Paper • 2606.01057 • Published 23 days ago • 8
SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories Paper • 2606.01311 • Published 23 days ago • 37
SOD: Step-wise On-policy Distillation for Small Language Model Agents Paper • 2605.07725 • Published May 8 • 25
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published May 20 • 207
Zero-Shot Sim-to-Real Robot Learning: A Dexterous Manipulation Study on Reactive Catching Paper • 2605.09789 • Published May 10 • 6
Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding Paper • 2605.07637 • Published May 12 • 20
AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning Paper • 2605.00425 • Published May 8 • 23
Lightning Unified Video Editing via In-Context Sparse Attention Paper • 2605.04569 • Published May 6 • 18