Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
thewordsmiths
/
Mistral-7B-v0.3_sft_LoRA_100000_dpo_LoRA
like
0
Follow
The Wordsmiths
3
Transformers
Safetensors
English
text-generation-inference
unsloth
mistral
trl
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Mistral-7B-v0.3_sft_LoRA_100000_dpo_LoRA
Commit History
Upload model trained with Unsloth
33e810f
verified
paultltc
commited on
Jun 3, 2024
Upload model trained with Unsloth
719a76e
verified
paultltc
commited on
Jun 3, 2024
Upload README.md with huggingface_hub
2fb4c34
verified
paultltc
commited on
Jun 3, 2024
initial commit
1e1ed8d
verified
paultltc
commited on
Jun 3, 2024