Char Level Models My Character level models I trained. Corianas/Microllama_Char_88k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 12 Corianas/Corianas-micro-reactor Text Generation • 85.2M • Updated Feb 17, 2025 • 10 Corianas/Microllama_Char_100k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 14 Corianas/Microllama_Char_300k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 8
Foundational_data TinyGSM: achieving >80% on GSM8k with small language models Paper • 2312.09241 • Published Dec 14, 2023 • 39 TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 38
TinyGSM: achieving >80% on GSM8k with small language models Paper • 2312.09241 • Published Dec 14, 2023 • 39
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 38
Char Level Models My Character level models I trained. Corianas/Microllama_Char_88k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 12 Corianas/Corianas-micro-reactor Text Generation • 85.2M • Updated Feb 17, 2025 • 10 Corianas/Microllama_Char_100k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 14 Corianas/Microllama_Char_300k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 8
Foundational_data TinyGSM: achieving >80% on GSM8k with small language models Paper • 2312.09241 • Published Dec 14, 2023 • 39 TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 38
TinyGSM: achieving >80% on GSM8k with small language models Paper • 2312.09241 • Published Dec 14, 2023 • 39
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 38