Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Weizhi Zhang's picture
1 5 2

Weizhi Zhang

WZDavid
Aaron-Cu's profile picture TreeForest's profile picture
·
  • weizhi-zhang-3175441a7

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 3 months ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 55

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26 • 134
upvoted a paper 6 months ago

Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

Paper • 2507.09477 • Published Jul 13 • 86
upvoted a paper 7 months ago

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Paper • 2505.16421 • Published May 22 • 19
upvoted a paper 9 months ago

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

Paper • 2503.21460 • Published Mar 27 • 83
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs