Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
hannahbillo
/
dpo-llama3-8b-sample-rules
like
0
PEFT
TensorBoard
Safetensors
llama
trl
dpo
Generated from Trainer
License:
llama3.1
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
hannahbillo
commited on
Aug 18, 2024
Commit
43f264c
·
verified
·
1 Parent(s):
2cee8f0
Create config.json
Browse files
Files changed (1)
hide
show
config.json
+0
-0
config.json
ADDED
Viewed
File without changes