Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. ⢠7 items ⢠Updated Dec 24, 2025 ⢠55
Cosmos Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/nvidia-cosmos-2 ⢠31 items ⢠Updated 8 days ago ⢠299
šŖ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos ⢠12 items ⢠Updated May 5, 2025 ⢠244
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 ⢠50 items ⢠Updated Dec 11, 2025 ⢠137
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi ⢠16 items ⢠Updated Dec 24, 2025 ⢠243
Octopus v2: On-device language model for super agent Paper ⢠2404.01744 ⢠Published Apr 2, 2024 ⢠58
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models Paper ⢠2402.07033 ⢠Published Feb 10, 2024 ⢠19
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Paper ⢠2401.09417 ⢠Published Jan 17, 2024 ⢠62
ML for Tools Collection Collection of papers about ML for using tools! ⢠25 items ⢠Updated Jan 17, 2024 ⢠10
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation Paper ⢠2401.04468 ⢠Published Jan 9, 2024 ⢠49
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! ⢠30 items ⢠Updated Jun 12, 2024 ⢠250
Understanding LLMs: A Comprehensive Overview from Training to Inference Paper ⢠2401.02038 ⢠Published Jan 4, 2024 ⢠65
Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Paper ⢠2312.03694 ⢠Published Dec 6, 2023 ⢠2