-
ReportBench: Evaluating Deep Research Agents via Academic Survey Tasks
Paper • 2508.15804 • Published • 15 -
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 53 -
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 108
Tobias Völzing
wumingshi
·
AI & ML interests
None yet
Recent Activity
updated
a collection
12 days ago
Fundamental
updated
a collection
22 days ago
Fundamental
updated
a collection
about 1 month ago
Fine-Tuning
Organizations
None yet