mathias-atla/prometheus-eval-feedback-collection-100samples-trainvalid Viewer • Updated 2 days ago • 200 • 11
mathias-atla/prometheus-eval-feedback-collection-100samples-trainvalid Viewer • Updated 2 days ago • 200 • 11
view article Article Judge Arena: Benchmarking LLMs as Evaluators By kaikaidai and 7 others • Nov 19, 2024 • 56