Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

MixEval

community
https://mixeval.github.io/
NiJinjie
Psycoy
Activity Feed

AI & ML interests

LLM & LMM evaluation

Recent Activity

yuexiang96  authored a paper 8 days ago
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
yuexiang96  authored a paper 8 days ago
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
yuexiang96  authored a paper 8 days ago
Simulating Environments with Reasoning Models for Agent Training
View all activity

Yifan Song's profile picture Xiang Yue's profile picture Jinjie Ni's profile picture Bo Li's profile picture Deepanway's profile picture David Junhao ZHANG's profile picture Fuzhao Xue's profile picture

MixEval 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs