Spaces

·

The AI App Directory

New Space Get PRO Learn more

MTEB Leaderboard

Embedding Leaderboard

UGI Leaderboard

Uncensored General Intelligence Leaderboard

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

Arena Leaderboard

View the LMArena model leaderboard

Heatmap Leaderboard

Organization Activity Heatmap

MLX Benchmark V2 Leaderboard

Evaluating LLMs on Apple MLX framework

Open Japanese LLM Leaderboard

Explore LLM benchmark leaderboard and submit models

Open ASR Leaderboard

Explore and compare speech‑recognition model benchmarks

Image Arena Leaderboard

Image Generation and Image Editing Arena & Leaderboard

TabArena

TabArena

Qimma Leaderboard

Qimma leaderboard

Pii Masking Benchmark Leaderboard

PII Masking Benchmark Leaderboard

GAIA Leaderboard

Submit your model answers to GAIA benchmark and view leaderboard

LLM Hallucination Leaderboard

View and filter LLM hallucination leaderboard

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

Open FinLLM Leaderboard

Explore and compare LLM performance on financial benchmarks

Big Code Models Leaderboard

Explore and submit code model evaluations on a leaderboard

Open Medical-LLM Leaderboard

Explore and submit models for benchmarking

Berkeley Function Calling Leaderboard

View the Berkeley Function-Calling Leaderboard

La Leaderboard

Evaluate open LLMs in the languages of LATAM and Spain.

LLM Performance Leaderboard

View the latest LLM performance leaderboard online

Low-bit Quantized Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

BigCodeBench Leaderboard

Explore code-generation model leaderboards and task details

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots