Adding Evaluation Results

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show

README.md +18 -5

README.md CHANGED Viewed

@@ -1,6 +1,8 @@
 ---
 license: llama3
-base_model: Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3
 tags:
 - alignment-handbook
 - axolotl
@@ -8,6 +10,7 @@ tags:
 - dpo
 - sft
 - generated_from_trainer
 datasets:
 - princeton-nlp/llama3-ultrafeedback-armorm
 - Magpie-Align/Magpie-Pro-MT-300K-v0.1
@@ -16,9 +19,6 @@ datasets:
 model-index:
 - name: Magpie-Align/Llama-3-8B-Magpie-Align-v0.3
   results: []
-language:
-- en
-- zh
 ---
 ![Magpie](https://cdn-uploads.huggingface.co/production/uploads/653df1323479e9ebbe3eb6cc/FWWILXrAGNwWr52aghV0S.png)
@@ -366,4 +366,17 @@ ArmoRM paper:
 }
 ```
-**Questions?** Please contact [Zhangchen](https://zhangchenxu.com/) by email.

 ---
+language:
+- en
+- zh
 license: llama3
 tags:
 - alignment-handbook
 - axolotl
 - dpo
 - sft
 - generated_from_trainer
+base_model: Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3
 datasets:
 - princeton-nlp/llama3-ultrafeedback-armorm
 - Magpie-Align/Magpie-Pro-MT-300K-v0.1
 model-index:
 - name: Magpie-Align/Llama-3-8B-Magpie-Align-v0.3
   results: []
 ---
 ![Magpie](https://cdn-uploads.huggingface.co/production/uploads/653df1323479e9ebbe3eb6cc/FWWILXrAGNwWr52aghV0S.png)
 }
 ```
+**Questions?** Please contact [Zhangchen](https://zhangchenxu.com/) by email.
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Magpie-Align__Llama-3-8B-Magpie-Align-v0.3)
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               |16.89|
+|IFEval (0-Shot)    |44.97|
+|BBH (3-Shot)       |24.31|
+|MATH Lvl 5 (4-Shot)| 2.57|
+|GPQA (0-shot)      | 2.01|
+|MuSR (0-shot)      | 3.74|
+|MMLU-PRO (5-shot)  |23.71|