Open Multilingual Llm Leaderboard
Search for model performance across languages and benchmarks
Search for model performance across languages and benchmarks
Request evaluation for a new model
Evaluate Persian LLMs on various tasks
Predict customer churn based on input details
View and submit LLM evaluations
Track, rank and evaluate open LLMs in the italian language!
Test your AI models with Giskard
Track, rank and evaluate open LLMs' CoT quality
Track, rank and evaluate open LLMs in Portuguese
View leaderboard of LLMs on EQ-Bench
Benchmark machine learning interatomic potential at scale
Measure BERT model performance using WebGPU and WASM
Leaderboard of information retrieval models in French
Merge AI models using a YAML configuration file
Display and filter leaderboard data for language models
Explore and submit LLM benchmarks
Ranking for Open-sourced LLMs in different domains
Track, rank and evaluate open LLMs and chatbots
Quantize models to OpenVINO format
View and submit LLM benchmark evaluations
View and submit LLM evaluations
Convert Hugging Face model repos to safetensors files
Display evaluation results for large language models
Submit and track LLM evaluations