Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenEvals
community
Activity Feed
Follow
126
AI & ML interests
LLM evaluation
Recent Activity
alozowski
authored
a paper
about 19 hours ago
YourBench: Easy Custom Evaluation Sets for Everyone
SaylorTwift
updated
a Space
6 days ago
OpenEvals/evaluation-guidebook
SaylorTwift
updated
a Space
6 days ago
OpenEvals/README
View all activity
Team members
10
OpenEvals
's datasets
3
Sort: Recently updated
OpenEvals/MuSR
Viewer
•
Updated
8 days ago
•
756
•
39
OpenEvals/SimpleQA
Viewer
•
Updated
8 days ago
•
4.33k
•
323
•
3
OpenEvals/aime_24
Viewer
•
Updated
8 days ago
•
30
•
62