Update README.md
Browse files
README.md
CHANGED
@@ -96,15 +96,15 @@ Seed-Coder-8B-Instruct has been evaluated on a wide range of coding tasks, inclu
|
|
96 |
|
97 |
| Model | HumanEval | MBPP | MHPP | BigCodeBench (Full) | BigCodeBench (Hard) | LiveCodeBench (2410 – 2502) |
|
98 |
|:-----------------------------:|:---------:|:----:|:----:|:-------------------:|:-------------------:|:-------------------------:|
|
99 |
-
| CodeLlama-7B-Instruct | 40.9 | 54.0 | 6.7 |
|
100 |
-
| DeepSeek-Coder-6.7B-Instruct | 74.4 | 74.9 | 20.0 |
|
101 |
-
| CodeQwen1.5-7B-Chat | 83.5 | 77.7 | 17.6 |
|
102 |
-
| Yi-Coder-9B-Chat | 82.3 | 82.0 | 26.7 |
|
103 |
-
| Llama-3.1-8B-Instruct | 68.3 | 70.1 | 17.1 |
|
104 |
-
| OpenCoder-8B-Instruct | 83.5 | 79.1 | 30.5 |
|
105 |
-
| Qwen2.5-Coder-7B-Instruct | 88.4 |
|
106 |
| Qwen3-8B | 84.8 | 77.0 | 32.8 | 51.7 | 23.0 | 23.5 |
|
107 |
-
| Seed-Coder-8B-Instruct | 84.8 | 85.2 | 36.2 | 53.3 |
|
108 |
|
109 |
|
110 |
For detailed benchmark performance, please refer to our [📑 Technical Report](https://github.com/ByteDance-Seed/Seed-Coder/blob/master/Seed-Coder.pdf).
|
|
|
96 |
|
97 |
| Model | HumanEval | MBPP | MHPP | BigCodeBench (Full) | BigCodeBench (Hard) | LiveCodeBench (2410 – 2502) |
|
98 |
|:-----------------------------:|:---------:|:----:|:----:|:-------------------:|:-------------------:|:-------------------------:|
|
99 |
+
| CodeLlama-7B-Instruct | 40.9 | 54.0 | 6.7 | 25.7 | 4.1 | 3.6 |
|
100 |
+
| DeepSeek-Coder-6.7B-Instruct | 74.4 | 74.9 | 20.0 | 43.8 | 15.5 | 9.6 |
|
101 |
+
| CodeQwen1.5-7B-Chat | 83.5 | 77.7 | 17.6 | 43.6 | 15.5 | 3.0 |
|
102 |
+
| Yi-Coder-9B-Chat | 82.3 | 82.0 | 26.7 | 49.0 | 17.6 | 17.5 |
|
103 |
+
| Llama-3.1-8B-Instruct | 68.3 | 70.1 | 17.1 | 40.5 | 13.5 | 11.5 |
|
104 |
+
| OpenCoder-8B-Instruct | 83.5 | 79.1 | 30.5 | 50.9 | 18.9 | 17.1 |
|
105 |
+
| Qwen2.5-Coder-7B-Instruct | **88.4** | 83.5 | 26.7 | 48.8 | 20.3 | 17.3 |
|
106 |
| Qwen3-8B | 84.8 | 77.0 | 32.8 | 51.7 | 23.0 | 23.5 |
|
107 |
+
| Seed-Coder-8B-Instruct | 84.8 | **85.2** | **36.2** | **53.3** | **26.4** | **24.7** |
|
108 |
|
109 |
|
110 |
For detailed benchmark performance, please refer to our [📑 Technical Report](https://github.com/ByteDance-Seed/Seed-Coder/blob/master/Seed-Coder.pdf).
|