Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published 26 days ago • 66
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 159
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper • 2402.06619 • Published Feb 9, 2024 • 55
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 111
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 194
Aya Datasets Collection The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 5 items • Updated 15 days ago • 17
C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated 15 days ago • 55