emmabhl
/

Qwen1.5-0.5B-Chat-ORCA-cDPO

Text Generation

Model card Files Files and versions Community

emmabhl commited on May 29, 2024

Commit

a08f93c

·

verified ·

1 Parent(s): 7cc48b9

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ widget:
 # Qwen1.5-0.5B-Chat with EPFL DPO fine-tuning
-Qwen1.5-0.5B-Chat DPO fine-tuned on the microsoft/orca-math-word-problems-200k dataset that consists of ~200K grade school math word problems
 ## Model Details
@@ -29,7 +29,7 @@ answer open-ended and multiple-choice questions from Orca Math dataset
 ### Training Data
-Training data is not publicly available.
 ### Training Procedure

 # Qwen1.5-0.5B-Chat with EPFL DPO fine-tuning
+Qwen1.5-0.5B-Chat DPO fine-tuned on the Orca Math dataset that consists of ~200K grade school math word problems
 ## Model Details
 ### Training Data
+HuggingFace dataset : microsoft/orca-math-word-problems-200k
 ### Training Procedure