Intelligent-Internet/swebench-pro-claude-sonnet-4.5-ii-agent-trajectories Viewer • Updated Nov 7 • 726 • 17
Intelligent-Internet/OpenAI-HealthBench-II-Medical-8B-1706-GPT-4.1 Viewer • Updated Jun 17 • 5k • 79 • 2