Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
XueyingJia
/
pythia-1b-online-dpo-ground-truth-lead
like
0
Transformers
Safetensors
XueyingJia/online_dpo_repo_augmented
Generated from Trainer
trl
online-dpo
Inference Endpoints
arxiv:
2402.04792
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
pythia-1b-online-dpo-ground-truth-lead
/
adapter_model.safetensors
Commit History
Training in progress, step 2513
dd22b63
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 2500
3f4279d
verified
XueyingJia
commited on
Dec 10, 2024
Training in progress, step 2400
887f5dc
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 2300
91f92d1
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 2200
61d4b79
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 2100
33ba1cb
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 2000
cbb412a
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 1900
6cec586
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 1800
4904327
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 1700
08238cd
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 1500
8877707
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 1400
4efa98b
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 1300
3221eba
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 1200
805271f
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 1100
5215470
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 1000
b202fde
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 900
4f1b091
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 800
8c1d7e4
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 700
453c482
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 600
da5d9a1
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 500
0196874
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 400
b4196d6
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 300
0c268b7
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 200
cd4f28c
verified
XueyingJia
commited on
Dec 9, 2024
Training in progress, step 100
a32c329
verified
XueyingJia
commited on
Dec 9, 2024