Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ItsMaxNorm
/
Llama-3.2-3B-Instruct-Open-R1-Distill
like
1
Text Generation
Transformers
Safetensors
FreedomIntelligence/medical-o1-reasoning-SFT
llama
Generated from Trainer
open-r1
trl
sft
conversational
text-generation-inference
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
5fb7f5c
Llama-3.2-3B-Instruct-Open-R1-Distill
/
trainer_state.json
ItsMaxNorm
Model save
3eadb1c
verified
20 days ago
raw
Copy download link
history
276 kB
File too large to display, you can
check the raw version
instead.