FastVLM Collection Efficient Vision Encoding for Vision Language Models • 9 items • Updated about 17 hours ago • 78
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 5 items • Updated 2 days ago • 65
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 27 days ago • 334
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 23 days ago • 226
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated Jul 12 • 117
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Jul 1 • 75
xLAM-2 Collection A family of Large Action Model for multi-turn conversation and tool-use • 10 items • Updated Jul 28 • 21
FLUX.1 Collection A collection of our FLUX.1 models and LoRAs. • 10 items • Updated 28 days ago • 192
MiniMax-M1 Collection MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated Jul 3 • 110
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 12 items • Updated about 18 hours ago • 33
Jina Reader-LM Collection Convert HTML content to LLM-friendly Markdown/JSON content • 4 items • Updated Jul 20 • 16