Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
pullpull
/
qwen-2.5-3b-r1-countdown
like
0
Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
7a66bcd
qwen-2.5-3b-r1-countdown
Commit History
Training in progress, step 450
7a66bcd
verified
pullpull
commited on
Feb 2
Training in progress, step 425
b35470c
verified
pullpull
commited on
Feb 2
Training in progress, step 400
3116e03
verified
pullpull
commited on
Feb 2
Training in progress, step 375
7896c52
verified
pullpull
commited on
Feb 2
Training in progress, step 350
14ec1db
verified
pullpull
commited on
Feb 2
Training in progress, step 325
6880ed0
verified
pullpull
commited on
Feb 2
Training in progress, step 300
e448051
verified
pullpull
commited on
Feb 2
Training in progress, step 275
f0aa33a
verified
pullpull
commited on
Feb 2
Training in progress, step 250
3875d0b
verified
pullpull
commited on
Feb 2
Training in progress, step 225
dfeddfc
verified
pullpull
commited on
Feb 2
Training in progress, step 200
5081997
verified
pullpull
commited on
Feb 2
Training in progress, step 175
5a3cc69
verified
pullpull
commited on
Feb 2
Training in progress, step 150
e51b382
verified
pullpull
commited on
Feb 2
Training in progress, step 100
15b540b
verified
pullpull
commited on
Feb 2
Training in progress, step 75
a4d2d81
verified
pullpull
commited on
Feb 2
Training in progress, step 50
7ea591c
verified
pullpull
commited on
Feb 2
Training in progress, step 25
24df065
verified
pullpull
commited on
Feb 2
initial commit
8681d81
verified
pullpull
commited on
Feb 2