Spaces

·

The AI App Directory

New Space Get PRO Learn more

MTEB Leaderboard

Embedding Leaderboard

Open SLM Leaderboard

Open Small Language Model Leaderboard

UGI Leaderboard

Uncensored General Intelligence Leaderboard

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

Open ASR Leaderboard

Compare speech-to-text models using benchmark scores

Tiny-LM Leaderboard

Every tiny LM, same eval harness, transparent benchmarks

Arena Leaderboard

View the LMArena leaderboard in full‑screen

AISA-ArabicFC Leaderboard

Live auto-evaluator + leaderboard · ArabicNLP 2026

Low-bit LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

DeepResearch Bench

Explore Deep Research Agent benchmark rankings

LeaderboardV3

Explore embedding model rankings across 100+ benchmarks

VBench Leaderboard

Submit video model evaluation results to a public benchmark

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

Image Arena Leaderboard

Image Generation and Image Editing Arena & Leaderboard

Open Universal Arabic Asr Leaderboard

A benchmark for open-source multi-dialect Arabic ASR models

Open Telco Leaderboard

Benchmarking LLMs on telecommunications tasks

MLX Benchmark V2 Leaderboard

Evaluating LLMs on Apple MLX framework

VANTAGE Bench Leaderboard

Explore VANTAGE‑Bench model rankings with interactive filters

Coding Agent Leaderboard

Compare coding agent models + harnesses

GAIA Leaderboard

Submit and view GAIA model evaluation leaderboard

LLM-Perf Leaderboard

Compare LLM hardware performance and find the best model

Big Code Models Leaderboard

Explore and compare code model performance on a leaderboard

Open Chinese LLM Leaderboard

Explore LLM benchmark scores and submit your model for evaluation

Open Medical-LLM Leaderboard

Explore and submit models for benchmarking