Update README.md
Browse files
README.md
CHANGED
@@ -51,7 +51,8 @@ We've noticed some errors in the previous code and updated it.
|
|
51 |
|
52 |
|
53 |
<details>
|
54 |
-
<summary>Depricated Score</summary>
|
|
|
55 |
| Model | Chemistry | Math | Physics | Physics Word Puzzles | Puzzles | Average |
|
56 |
|---------------------------------------|-----------|-------|---------|----------------------|---------|---------|
|
57 |
| o1-2024-12-17 | 42.9 | 74.5 | 77.8 | 70.0 | 30.8 | 59.2 |
|
@@ -66,7 +67,8 @@ We've noticed some errors in the previous code and updated it.
|
|
66 |
| Exaone-3.5-32B-Instruct | 28.6 | 27.3 | 22.2 | 40.0 | 0.0 | 23.6 |
|
67 |
| gpt-4o-mini-2024-07-18 | 7.1 | 29.1 | 22.2 | 50.0 | 0.0 | 21.7 |
|
68 |
| UNIVA-Bllossom_DeepSeek-llama3.1-Bllossom-8B | 14.3 | 10.9 | 33.3 | 0.0 | 0.0 | 11.7 |
|
69 |
-
|
|
|
70 |
|
71 |
|
72 |
|
|
|
51 |
|
52 |
|
53 |
<details>
|
54 |
+
<summary>Depricated Score</summary>
|
55 |
+
|
56 |
| Model | Chemistry | Math | Physics | Physics Word Puzzles | Puzzles | Average |
|
57 |
|---------------------------------------|-----------|-------|---------|----------------------|---------|---------|
|
58 |
| o1-2024-12-17 | 42.9 | 74.5 | 77.8 | 70.0 | 30.8 | 59.2 |
|
|
|
67 |
| Exaone-3.5-32B-Instruct | 28.6 | 27.3 | 22.2 | 40.0 | 0.0 | 23.6 |
|
68 |
| gpt-4o-mini-2024-07-18 | 7.1 | 29.1 | 22.2 | 50.0 | 0.0 | 21.7 |
|
69 |
| UNIVA-Bllossom_DeepSeek-llama3.1-Bllossom-8B | 14.3 | 10.9 | 33.3 | 0.0 | 0.0 | 11.7 |
|
70 |
+
|
71 |
+
</details>
|
72 |
|
73 |
|
74 |
|