Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OliP 's Collections
NewGen small LMs
Leading Leaderboards
2024 Papers of the year
2023 (and before) Papers of the Year
LLM Deployment
Vision-Language
Long-Context
Audio
Special LMs <10B
๐ŸŒถ๏ธ Spaces
Evaluation
Applications
Coding

Leading Leaderboards

updated Nov 6, 2024
Upvote
-

  • Running on CPU Upgrade
    13.9k

    Open LLM Leaderboard

    ๐Ÿ†
    13.9k

    Track, rank and evaluate open LLMs and chatbots


  • Running on CPU Upgrade
    7.04k

    MTEB Leaderboard

    ๐Ÿฅ‡
    7.04k

    Embedding Leaderboard


  • Running
    4.72k

    LMArena Leaderboard

    ๐Ÿ†
    4.72k

    View LMArena model leaderboard


  • Running
    230

    BigCodeBench Leaderboard

    ๐Ÿฅ‡
    230

    Explore code-generation model leaderboards and task details


  • Running on CPU Upgrade
    990

    Open VLM Leaderboard

    ๐ŸŒŽ
    990

    VLMEvalKit Evaluation Results Collection


  • Running
    196

    Vidore Leaderboard

    ๐Ÿฅ‡
    196

    Compare and rank visual document retrieval models across different benchmarks


  • Running on CPU Upgrade
    Featured
    1.22k

    Open ASR Leaderboard

    ๐Ÿ†
    1.22k

    Explore and compare speechโ€‘recognition model benchmarks


  • Running
    123

    Berkeley Function Calling Leaderboard

    ๐Ÿƒ
    123

    View the Berkeley Function-Calling Leaderboard


  • Runtime error
    21

    LiveBench

    ๐Ÿฅ‡
    21


  • Running
    32

    JudgeBench Leaderboard

    ๐Ÿ†
    32

    Generate a leaderboard for evaluating language models

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs