new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Jun 10

Submitted by

akhaliq

Mixture-of-Agents Enhances Large Language Model Capabilities

·
5 authors

Submitted by

akhaliq

CRAG -- Comprehensive RAG Benchmark

·
27 authors

Submitted by

akhaliq

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

·
9 authors

Submitted by

akhaliq

Large Language Model Confidence Estimation via Black-Box Access

·
5 authors

Submitted by

akhaliq

GenAI Arena: An Open Evaluation Platform for Generative Models

·
7 authors

Submitted by

akhaliq

Proofread: Fixes All Errors with One Tap

·
9 authors

Submitted by

akhaliq

NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

·
11 authors

Submitted by

akhaliq

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

·
9 authors

Submitted by

akhaliq

Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

·
25 authors