BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper • 2502.07346 • Published about 1 month ago • 51
Corpus: Evaluation datasets for ES & LATAM Collection Corpus of La Leaderboard, the open LLM leaderboard for ES & LATAM • 56 items • Updated Feb 5 • 4
LlamaLens Collection This collection contains resources and a specialized family of models for analyzing news and social media content in a multilingual context. • 8 items • Updated about 12 hours ago • 5
DPLM-2: A Multimodal Diffusion Protein Language Model Paper • 2410.13782 • Published Oct 17, 2024 • 21
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains Paper • 2407.18961 • Published Jul 18, 2024 • 40
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 80
YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 68
Adapting Large Language Models via Reading Comprehension Paper • 2309.09530 • Published Sep 18, 2023 • 77