Model Card for llama2_DPO_test_v1 used huggingface TRL _ DPOtrainer

Downloads last month
1,948
Safetensors
Model size
13.2B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for wngkdud/llama2_DPO_test_v1

Quantizations
1 model