SearchArena In-the-wild Interactions with Search-LLMs w/ Human Preferences lmarena-ai/search-arena-v1-7k Viewer • Updated Apr 14 • 7k • 102 • 22 lmarena-ai/search-arena-24k Viewer • Updated May 16 • 24.1k • 383 • 18 Search Arena: Analyzing Search-Augmented LLMs Paper • 2506.05334 • Published Jun 5 • 17
Prompt-to-Leaderboard lmarena-ai/p2l-7b-grk-01112025 7B • Updated Feb 25 • 2 • 4 lmarena-ai/p2l-3b-grk-01112025 3B • Updated Feb 25 • 1 lmarena-ai/p2l-1.5b-grk-01112025 2B • Updated Feb 25 lmarena-ai/p2l-0.5b-grk-01112025 0.5B • Updated Feb 25 • 4
Arena-Hard-Auto An automatic evaluation tool for LLMs. Running 3 3 Arena Hard Viewer ⚡ Browse and evaluate model judgments from benchmarks lmarena-ai/arena-hard-auto Updated May 1 • 335 • 4
SearchArena In-the-wild Interactions with Search-LLMs w/ Human Preferences lmarena-ai/search-arena-v1-7k Viewer • Updated Apr 14 • 7k • 102 • 22 lmarena-ai/search-arena-24k Viewer • Updated May 16 • 24.1k • 383 • 18 Search Arena: Analyzing Search-Augmented LLMs Paper • 2506.05334 • Published Jun 5 • 17
Arena-Hard-Auto An automatic evaluation tool for LLMs. Running 3 3 Arena Hard Viewer ⚡ Browse and evaluate model judgments from benchmarks lmarena-ai/arena-hard-auto Updated May 1 • 335 • 4
Prompt-to-Leaderboard lmarena-ai/p2l-7b-grk-01112025 7B • Updated Feb 25 • 2 • 4 lmarena-ai/p2l-3b-grk-01112025 3B • Updated Feb 25 • 1 lmarena-ai/p2l-1.5b-grk-01112025 2B • Updated Feb 25 lmarena-ai/p2l-0.5b-grk-01112025 0.5B • Updated Feb 25 • 4