Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sylvain471
/
llama-3-1-8b-dpo-math-ep1
like
0
Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
trl
dpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
llama-3-1-8b-dpo-math-ep1
Commit History
Add files using upload-large-folder tool
179e9c5
verified
sylvain471
commited on
Feb 11
Model save
6dc7cde
verified
sylvain471
commited on
Feb 10
Training in progress, epoch 3
d82da56
verified
sylvain471
commited on
Feb 10
Training in progress, epoch 2
addc6d2
verified
sylvain471
commited on
Feb 10
Training in progress, epoch 1
c54e3ff
verified
sylvain471
commited on
Feb 10
initial commit
618dad9
verified
sylvain471
commited on
Feb 10