Running Agents 430 Reward Bench Leaderboard 📐 430 Explore RewardBench model rankings with filters and samples
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots