arxiv:2501.08328
Richard Zhuang PRO
RZ412
AI & ML interests
LLM Routing, LLM + Games, Post-Training, Agents
Recent Activity
updated a dataset 20 minutes ago
DCAgent2/terminal_bench_2_sft__Kimi_2_5_swesmith_oracle_maxeps_32k__Qwen3_8B_20260330_014452 published a dataset 21 minutes ago
DCAgent2/terminal_bench_2_sft__Kimi_2_5_swesmith_oracle_maxeps_32k__Qwen3_8B_20260330_014452 updated a dataset about 2 hours ago
DCAgent2/swebench_verified_random_100_folders_nemotron_1000_opt1k__Qwen3_8B_20260330_014815