PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation Paper • 2409.06820 • Published Sep 10, 2024 • 66
Saiga Collection LLaMA-based or Mistral-based Russian chat models • 9 items • Updated 9 days ago • 28