Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
davidberenstein1957 's Collections
guardrails
Smol but mighty
Useful Spaces
LLM evals and benchmark datasets
Dataset Viber annotators
Cool and fun Spaces
Model Leaderboards
Useful models
Useful datasets

Model Leaderboards

updated Jan 22, 2025
Upvote
1

  • Running on CPU Upgrade
    7.45k

    MTEB Leaderboard

    πŸ“Š
    7.45k

    Embedding Leaderboard


  • Running
    Agents
    431

    Reward Bench Leaderboard

    πŸ“
    431

    Explore and compare model scores on RewardBench benchmarks


  • Running on CPU Upgrade
    14k

    Open LLM Leaderboard

    πŸ†
    14k

    Track, rank and evaluate open LLMs and chatbots


  • Running
    4.91k

    Arena Leaderboard

    πŸ†
    4.91k

    View the LMArena leaderboard in full‑screen


  • Running
    Agents
    1.51k

    Big Code Models Leaderboard

    πŸ“ˆ
    1.51k

    Explore and compare code model performance on a leaderboard


  • Running
    Agents
    232

    AI2 WildBench Leaderboard (V2)

    🦁
    232

    Display LLM performance leaderboards with customizable views


  • Running on CPU Upgrade
    Agents
    1.02k

    Open VLM Leaderboard

    🌎
    1.02k

    VLMEvalKit Evaluation Results Collection


  • Running
    Agents
    231

    BigCodeBench Leaderboard

    πŸ₯‡
    231

    Explore code-generation model leaderboards and task details


  • Running
    Agents
    Featured
    588

    LLM-Perf Leaderboard

    πŸ†
    588

    Compare LLM hardware performance and find the best model


  • Running
    116

    MTEB Arena

    βš”
    116

    Display MTEB Arena interface

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs