MSTS: A Multimodal Safety Test Suite for Vision-Language Models Paper • 2501.10057 • Published 21 days ago • 8
Gemma 2: Improving Open Language Models at a Practical Size Paper • 2408.00118 • Published Jul 31, 2024 • 76
Near to Mid-term Risks and Opportunities of Open-Source Generative AI Paper • 2404.17047 • Published Apr 25, 2024 • 1
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14, 2024 • 31
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 11
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 11
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 11
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 11
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 11
Flesch or Fumble? Evaluating Readability Standard Alignment of Instruction-Tuned Language Models Paper • 2309.05454 • Published Sep 11, 2023
Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation Paper • 2402.12593 • Published Feb 19, 2024
QuALITY: Question Answering with Long Input Texts, Yes! Paper • 2112.08608 • Published Dec 16, 2021 • 2
Does Putting a Linguist in the Loop Improve NLU Data Collection? Paper • 2104.07179 • Published Apr 15, 2021 • 1
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Paper • 2206.04615 • Published Jun 9, 2022 • 5
Gemini: A Family of Highly Capable Multimodal Models Paper • 2312.11805 • Published Dec 19, 2023 • 44