davidshu's picture

4 1

davidshu

davids97

·

AI & ML interests

None yet

Organizations

None yet

upvoted 4 papers 4 months ago

An Empirical Study of Testing Practices in Open Source AI Agent Frameworks and Agentic Applications

Paper • 2509.19185 • Published Sep 23, 2025 • 3

TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them

Paper • 2509.21117 • Published Sep 25, 2025 • 30

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24, 2025 • 42

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 80