hawei_LinkedIn commited on
Commit
63e078d
·
1 Parent(s): 0f70593

update explanation of benchmark result table

Browse files
Files changed (1) hide show
  1. README.md +10 -12
README.md CHANGED
@@ -86,18 +86,6 @@ The table below summarizes evaluation results across mathematical tasks and orig
86
  | **Control LLM*** | 36.0 | 61.7 | **89.7**| 62.5 | 82.5 | 30.8 | **71.6**| 45.4 | **57.6** | **60.0** |
87
 
88
  ---
89
-
90
- ### Catastrophic Forgetting on OpenMath
91
- The following plot illustrates and compares catastrophic forgetting mitigation during training
92
-
93
- ![Catastrophic Forgetting](plots/ControlLLM_CF_Plot_Math.png)
94
-
95
- ### Alignment Result
96
- The plot below highlights the alignment result of the model trained with Control LLM.
97
-
98
- ![Alignment](plots/alignment_best.png)
99
-
100
-
101
  ### Explanation:
102
  - **MH**: MathHard
103
  - **M**: Math
@@ -109,3 +97,13 @@ The plot below highlights the alignment result of the model trained with Control
109
  - **MLUP**: MMLU Pro
110
  - **O-Avg**: Original Capability - Average across ARC, GPQA, MMLU, and MLUP
111
  - **Overall**: Combined average across all tasks
 
 
 
 
 
 
 
 
 
 
 
86
  | **Control LLM*** | 36.0 | 61.7 | **89.7**| 62.5 | 82.5 | 30.8 | **71.6**| 45.4 | **57.6** | **60.0** |
87
 
88
  ---
 
 
 
 
 
 
 
 
 
 
 
 
89
  ### Explanation:
90
  - **MH**: MathHard
91
  - **M**: Math
 
97
  - **MLUP**: MMLU Pro
98
  - **O-Avg**: Original Capability - Average across ARC, GPQA, MMLU, and MLUP
99
  - **Overall**: Combined average across all tasks
100
+
101
+ ### Catastrophic Forgetting on OpenMath
102
+ The following plot illustrates and compares catastrophic forgetting mitigation during training
103
+
104
+ ![Catastrophic Forgetting](plots/ControlLLM_CF_Plot_Math.png)
105
+
106
+ ### Alignment Result
107
+ The plot below highlights the alignment result of the model trained with Control LLM.
108
+
109
+ ![Alignment](plots/alignment_best.png)