MSTS: A Multimodal Safety Test Suite for Vision-Language Models Paper • 2501.10057 • Published 21 days ago • 8
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages Paper • 2501.08284 • Published 24 days ago • 6
Running 222 222 AI2 WildBench Leaderboard (V2) 🦁 Display and explore model leaderboards and chat history
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 11