RaymondAISG commited on
Commit
b73ee8d
·
verified ·
1 Parent(s): e41fd97

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -46,14 +46,14 @@ We also evaluated the model on English capabilities using tasks from the Open LL
46
 
47
  **English**
48
 
49
- | Model | ARC | BBH | HellaSwag | MMLU | GSM8k | Average |
50
- | ---------------------------------------- |:-----:|:-----:|:---------:|:-----:|:-----:|:-------:|
51
- | Qwen/Qwen2-7B | 61.86 | 53.10 | 80.63 | 70.45 | 78.09 | 68.83 |
52
- | google/gemma-2-9b | 68.00 | 53.53 | 82.73 | 70.26 | 63.53 | 67.61 |
53
- | aisingapore/llama3-8b-cpt-sealionv2-base | 58.87 | 47.70 | 81.14 | 63.11 | 50.49 | 60.26 |
54
- | meta-llama/Meta-Llama-3-8B | 57.85 | 46.09 | 81.89 | 65.10 | 45.34 | 59.25 |
55
- | mistralai/Mistral-7B-v0.3 | 59.56 | 44.89 | 82.97 | 62.36 | 33.36 | 56.63 |
56
- | Sail/Sailor-7B | 50.34 | 35.65 | 76.11 | 52.80 | 33.81 | 49.74 |
57
 
58
 
59
  ## Training Details
 
46
 
47
  **English**
48
 
49
+ | Model | ARC | BBH | HellaSwag | MMLU | GSM8k | Average |
50
+ | ----------------------------------------- |:-----:|:-----:|:---------:|:-----:|:-----:|:-------:|
51
+ | Qwen/Qwen2-7B | 61.86 | 53.10 | 80.63 | 70.45 | 78.09 | 68.83 |
52
+ | google/gemma-2-9b | 68.00 | 53.53 | 82.73 | 70.26 | 63.53 | 67.61 |
53
+ | aisingapore/llama3-8b-cpt-sea-lionv2-base | 58.87 | 47.70 | 81.14 | 63.11 | 50.49 | 60.26 |
54
+ | meta-llama/Meta-Llama-3-8B | 57.85 | 46.09 | 81.89 | 65.10 | 45.34 | 59.25 |
55
+ | mistralai/Mistral-7B-v0.3 | 59.56 | 44.89 | 82.97 | 62.36 | 33.36 | 56.63 |
56
+ | Sail/Sailor-7B | 50.34 | 35.65 | 76.11 | 52.80 | 33.81 | 49.74 |
57
 
58
 
59
  ## Training Details