Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ZHLiu627
/
zephyr-7b-dpo-full
like
0
Text Generation
Transformers
TensorBoard
Safetensors
updated
original
mistral
alignment-handbook
Generated from Trainer
trl
dpo
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
zephyr-7b-dpo-full
Commit History
RDPO-7b-beta0.01-eta0.001
1ccad75
verified
ZHLiu627
commited on
Mar 9, 2024
Model save
a65e9a6
verified
ZHLiu627
commited on
Mar 9, 2024
End of training
450b8c2
verified
ZHLiu627
commited on
Mar 5, 2024
Model save
a40768c
verified
ZHLiu627
commited on
Mar 5, 2024
End of training
ff8c91a
verified
ZHLiu627
commited on
Feb 25, 2024
Model save
336d38c
verified
ZHLiu627
commited on
Feb 25, 2024
initial commit
a59c1cb
verified
ZHLiu627
commited on
Feb 25, 2024