Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Yaxin1992
/
llama3-8b-dpo-1000-hq
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
llama3
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
f9f8278
llama3-8b-dpo-1000-hq
Commit History
Training in progress, step 900
f9f8278
verified
Yaxin1992
commited on
Aug 19, 2024
Training in progress, step 600
ef1aa15
verified
Yaxin1992
commited on
Aug 19, 2024
Training in progress, step 300
6599938
verified
Yaxin1992
commited on
Aug 19, 2024
initial commit
4c87a83
verified
Yaxin1992
commited on
Aug 19, 2024