Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Alan Blanchet's picture

Alan Blanchet

Alanox

laclouis5's profile picture

21world's profile picture

·

https://alan-blanchet.fr/

AlanBlanchet

AI & ML interests

None yet

Organizations

Alanox 's collections 1

LLM Evaluation Benchmarks

This collection is here is make references to the evaluation benchmarks we see in traditional LLM papers

Running on CPU Upgrade

Agents

246

MMLU-Pro Leaderboard

🥇

246

More advanced and challenging multi-task evaluation
Running on CPU Upgrade

Agents

605

GAIA Leaderboard

🦾

605

Submit and score your model on the GAIA benchmark

LLM Evaluation Benchmarks

This collection is here is make references to the evaluation benchmarks we see in traditional LLM papers

Running on CPU Upgrade

Agents

246

MMLU-Pro Leaderboard

🥇

246

More advanced and challenging multi-task evaluation
Running on CPU Upgrade

Agents

605

GAIA Leaderboard

🦾

605

Submit and score your model on the GAIA benchmark

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs