This is DPO improved version of cloudyu/Mixtral_11Bx2_MoE_19B
DPO Trainer
metrics not test!
Chat template
Files info