Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Hanyang-W
/
llama3.1-8b-instruct-dpo-full

Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
trl
dpo
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
llama3.1-8b-instruct-dpo-full / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
Hanyang-W's picture
Hanyang-W
Training in progress, step 300
48a893e verified 28 days ago
  • Aug06_14-29-59_harmonious-basil-porcupine-5bfbcc8f57-g8fd2
    Training in progress, step 100 28 days ago
  • Aug06_14-33-19_harmonious-basil-porcupine-5bfbcc8f57-g8fd2
    Training in progress, step 100 28 days ago
  • Aug06_14-35-20_harmonious-basil-porcupine-5bfbcc8f57-g8fd2
    Training in progress, step 100 28 days ago
  • Aug06_14-37-11_harmonious-basil-porcupine-5bfbcc8f57-g8fd2
    Training in progress, step 300 28 days ago