Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
accuracy-maker
/
Llama-3.2-1B-GRPO-gsm8k
like
0
Text Generation
Safetensors
English
llama
instruct
post-training
GRPO
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
main
Llama-3.2-1B-GRPO-gsm8k
Commit History
Upload folder using huggingface_hub
762c3a6
verified
accuracy-maker
commited on
Feb 12
initial commit
8709827
verified
accuracy-maker
commited on
Feb 12