MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper โข 2502.14499 โข Published 20 days ago โข 178
Running on CPU Upgrade 5.05k 5.05k MTEB Leaderboard ๐ฅ Select benchmarks and languages for text embeddings evaluation