Model Card for llama2_DPO_test_v1 used huggingface TRL _ DPOtrainer
- Downloads last month
- 1,948
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model Card for llama2_DPO_test_v1 used huggingface TRL _ DPOtrainer