Spaces

·

The AI App Directory

New Space Get PRO Learn more

MTEB Leaderboard

Embedding Leaderboard

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

UGI Leaderboard

Uncensored General Intelligence Leaderboard

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

Open ASR Leaderboard

Explore speech model benchmarks and submit evaluation requests

DeepResearch Bench

Showcase benchmark leaderboard for DeepResearch models

LLM Hallucination Leaderboard

View and filter LLM hallucination leaderboard

Open Universal Arabic Asr Leaderboard

A benchmark for open-source multi-dialect Arabic ASR models

EuroEval Leaderboard

The robust European language model benchmark.

TabArena

TabArena

LLM-Perf Leaderboard

Compare LLM performance to find the best model for your hardware

Big Code Models Leaderboard

Explore and submit evaluations for code generation models

Hallucination Evaluation Leaderboard

Redirect to leaderboard page

VBench Leaderboard

Submit and view video model benchmark scores

LLM Performance Leaderboard

View LLM performance rankings on an interactive leaderboard

Vidore Leaderboard

Compare and rank visual document retrieval models across different benchmarks

The timm Leaderboard

Compare and visualize PyTorch image model performance metrics

Agent Leaderboard

Ranking of LLMs for agentic tasks

Leaderboard: Physical Reasoning from Video

Submit model evaluations and view leaderboard results

MTEB Leaderboard

Embedding Leaderboard

VBench-IBench-Leaderboard

Which Video and Image Generation Model is better?

MedVidBench Leaderboard

MedVidBench Benchmark Leaderboard - 8 medical video tasks

LMArena Leaderboard

View the LMArena model performance leaderboard

GAIA Leaderboard

Submit model results and view GAIA benchmark leaderboard