Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 30 days ago • 13 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated 29 days ago • 11.6k • 5.15k • 26
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 30 days ago • 13
Multilingual Leaderboards Leaderboards for languages other than English Running on CPU Upgrade 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 122 Open Chinese LLM Leaderboard 🏆 122 Explore and submit LLM benchmarks Running on CPU Upgrade 169 Open Arabic LLM Leaderboard 🏆 169 Track, rank and evaluate open Arabic LLMs and chatbots Running 40 OpenLLM French leaderboard 🇫🇷 🥇 40 Explore and submit LLM benchmarks
Running on CPU Upgrade 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain.
Running on CPU Upgrade 169 Open Arabic LLM Leaderboard 🏆 169 Track, rank and evaluate open Arabic LLMs and chatbots
Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 30 days ago • 13 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated 29 days ago • 11.6k • 5.15k • 26
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 30 days ago • 13
Multilingual Leaderboards Leaderboards for languages other than English Running on CPU Upgrade 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 122 Open Chinese LLM Leaderboard 🏆 122 Explore and submit LLM benchmarks Running on CPU Upgrade 169 Open Arabic LLM Leaderboard 🏆 169 Track, rank and evaluate open Arabic LLMs and chatbots Running 40 OpenLLM French leaderboard 🇫🇷 🥇 40 Explore and submit LLM benchmarks
Running on CPU Upgrade 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain.
Running on CPU Upgrade 169 Open Arabic LLM Leaderboard 🏆 169 Track, rank and evaluate open Arabic LLMs and chatbots