CombinHorizon's picture
Update README.md
deb4f6c verified
|
raw
history blame
445 Bytes
metadata
license: llama3
pipeline_tag: text-generation
library_name: transformers
tags:
  - KTO
datasets:
  - princeton-nlp/llama3-ultrafeedback
language:
  - en
base_model:
  - meta-llama/Meta-Llama-3-8B-Instruct

This is a model released from the preprint: SimPO: Simple Preference Optimization with a Reference-Free Reward Please refer to our repository for more details.