Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Yaxin1992
/
llama3-8b-dpo-1000-hq
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
llama3
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
llama3-8b-dpo-1000-hq
Commit History
Model save
1d90b10
verified
Yaxin1992
commited on
Aug 20, 2024
Training in progress, step 1000
c938fd3
verified
Yaxin1992
commited on
Aug 20, 2024
Training in progress, step 900
5b4c17c
verified
Yaxin1992
commited on
Aug 20, 2024
Training in progress, step 600
485870f
verified
Yaxin1992
commited on
Aug 20, 2024
Training in progress, step 300
c353fc3
verified
Yaxin1992
commited on
Aug 20, 2024
Model save
e48fe20
verified
Yaxin1992
commited on
Aug 19, 2024
Training in progress, step 1000
dd82f88
verified
Yaxin1992
commited on
Aug 19, 2024
Training in progress, step 900
f9f8278
verified
Yaxin1992
commited on
Aug 19, 2024
Training in progress, step 600
ef1aa15
verified
Yaxin1992
commited on
Aug 19, 2024
Training in progress, step 300
6599938
verified
Yaxin1992
commited on
Aug 19, 2024
initial commit
4c87a83
verified
Yaxin1992
commited on
Aug 19, 2024