Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jainamit
/
qwen-2.5-3b-r1-countdown
like
0
Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
qwen-2.5-3b-r1-countdown
/
runs
Commit History
Training in progress, step 450
23b2a94
verified
jainamit
commited on
Feb 3
Training in progress, step 425
de60efb
verified
jainamit
commited on
Feb 3
Training in progress, step 400
c424b65
verified
jainamit
commited on
Feb 3
Training in progress, step 375
4b72c70
verified
jainamit
commited on
Feb 3
Training in progress, step 350
33dd323
verified
jainamit
commited on
Feb 3
Training in progress, step 325
7946b99
verified
jainamit
commited on
Feb 3
Training in progress, step 300
3483afe
verified
jainamit
commited on
Feb 3
Training in progress, step 275
e96c866
verified
jainamit
commited on
Feb 3
Training in progress, step 250
06e2498
verified
jainamit
commited on
Feb 3
Training in progress, step 225
47d7a8e
verified
jainamit
commited on
Feb 3
Training in progress, step 200
4e1cdcb
verified
jainamit
commited on
Feb 3
Training in progress, step 175
b511094
verified
jainamit
commited on
Feb 3
Training in progress, step 150
1d646aa
verified
jainamit
commited on
Feb 3
Training in progress, step 125
2feda26
verified
jainamit
commited on
Feb 3
Training in progress, step 100
78afbb4
verified
jainamit
commited on
Feb 3
Training in progress, step 50
8df088a
verified
jainamit
commited on
Feb 3
Training in progress, step 25
001fe92
verified
jainamit
commited on
Feb 3