Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 3 days ago • 22
Hamanasu Collection A brand new series of Models from yours truly, Designed for Intelligence, Creativity and Roleplay - R/Locallama keeps DELETING MY GODDAMN COMMENTS • 24 items • Updated 5 days ago • 8
Llama Nemotron Collection Open, Production-ready Enterprise Models • 3 items • Updated 4 days ago • 20
EXAONE-Deep Collection EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 9 items • Updated 5 days ago • 77
DeepHermes Collection Preview models of hybrid reasoner Hermes series • 6 items • Updated 10 days ago • 26
D_AU - Thinking / Reasoning Models - Reg and MOEs. Collection QwQ,DeepSeek, EXONE, DeepHermes, and others "thinking/reasoning" AIs / LLMs in regular model type, MOE (mix of experts), and Hybrid model formats. • 38 items • Updated 1 day ago • 3
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 19 days ago • 68
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) • 8 items • Updated 10 days ago • 56
Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated Feb 17 • 30
Anemll converted models Collection Preconverted models for https://github.com/Anemll/Anemll. ctx = context, 0.1.x = converted with Anemll v. 0.1.x. x = 1 & 2 are equal model wise • 6 items • Updated Feb 19 • 3