view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling 1 day ago • 36
NeuTTS Air Collection NeuTTS Air is a speech foundation model that runs on CPU in real-time, with instant voice cloning. • 3 items • Updated 1 day ago • 21
NeuTTS Nano Multilingual Collection Collection NeuTTS Nano is a TTS model, 3x smaller than NeuTTS Air, that runs on CPU in real-time - now in English, Spanish, French, and German versions! • 12 items • Updated 1 day ago • 13
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition • Updated about 6 hours ago • 5.21k • 510
mmBERT: a modern multilingual encoder Collection mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9, 2025 • 51
Falcon-H1-Tiny Collection A series of extremely small, yet powerful language models redefining capabilities at small scale • 22 items • Updated 29 days ago • 35
Running 6 Nanobeir Hybrid Evaluation ⚡ 6 Overview of a selection of embedding model across dimensions
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated 23 days ago • 208