Running on CPU Upgrade 21 BigCodeBench Evaluator 🥇 21 Evaluate code samples using specified parameters
Running Agents 1.51k Big Code Models Leaderboard 📈 1.51k Explore and submit code model evaluations on a leaderboard