metadata
license: llama3
pipeline_tag: text-generation
library_name: transformers
tags:
- KTO
datasets:
- princeton-nlp/llama3-ultrafeedback
language:
- en
base_model:
- meta-llama/Meta-Llama-3-8B-Instruct
This is a model released from the preprint: SimPO: Simple Preference Optimization with a Reference-Free Reward Please refer to our repository for more details.