Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models
Paper
• 2405.20541 • Published
• 24
None defined yet.
OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning
KARL: Knowledge Agents via Reinforcement Learning