MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment Paper ⢠2512.09636 ⢠Published Dec 10, 2025 ⢠26
MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment Paper ⢠2512.09636 ⢠Published Dec 10, 2025 ⢠26
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation Paper ⢠2510.09116 ⢠Published Oct 10, 2025 ⢠97
From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models Paper ⢠2508.13491 ⢠Published Aug 19, 2025 ⢠59
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications Paper ⢠2503.20990 ⢠Published Mar 26, 2025 ⢠19