Reasoning Core: A Scalable RL Environment for LLM Symbolic Reasoning Paper • 2509.18083 • Published Sep 22, 2025 • 5 • 2
Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem Paper • 2509.06809 • Published Sep 8, 2025 • 3 • 2
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods Paper • 2407.21630 • Published Jul 31, 2024 • 8 • 2
Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation Paper • 2407.13481 • Published Jul 18, 2024 • 10 • 3