SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published 26 days ago • 97
PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents Paper • 2406.13923 • Published Jun 20, 2024 • 23
Running on CPU Upgrade 12.8k 12.8k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots