jpacifico
/

Chocolatine-3B-Instruct-DPO-Revised

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jpacifico commited on Aug 4, 2024

Commit

fa3e742

·

verified ·

1 Parent(s): 558125f

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -26,6 +26,18 @@ Chocolatine is the **best-performing 3B model** on the [OpenLLM Leaderboard](htt
 ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/openllm_choco3b_revised.png?raw=false)
 ### MT-Bench-French
 Chocolatine-3B-Instruct-DPO-Revised is outperforming GPT-3.5-Turbo on [MT-Bench-French](https://huggingface.co/datasets/bofenghuang/mt-bench-french) by Bofeng Huang,

 ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/openllm_choco3b_revised.png?raw=false)
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               |27.63|
+|IFEval (0-Shot)    |56.23|
+|BBH (3-Shot)       |37.16|
+|MATH Lvl 5 (4-Shot)|14.5|
+|GPQA (0-shot)      |9.62|
+|MuSR (0-shot)      |15.1|
+|MMLU-PRO (5-shot)  |33.21|
 ### MT-Bench-French
 Chocolatine-3B-Instruct-DPO-Revised is outperforming GPT-3.5-Turbo on [MT-Bench-French](https://huggingface.co/datasets/bofenghuang/mt-bench-french) by Bofeng Huang,