Running Agents 37 BigCodeArena 🚀 37 Compare two AI models by sending them code and seeing their responses
Running Agents 231 BigCodeBench Leaderboard 🥇 231 Explore code-generation model leaderboards and task details