Update README.md
Browse files
README.md
CHANGED
@@ -80,7 +80,7 @@ See the Falcon 180B model card for an example of this.
|
|
80 |
|
81 |
## Performance
|
82 |
|
83 |
-
| Model | Average | 2 LC | BBH | DROP | GSM8k | IFEval | MATH | MMLU | Safety | PopQA | TruthQA |
|
84 |
|-------|---------|------|-----|------|-------|--------|------|------|--------|-------|---------|
|
85 |
| **Closed API models** | | | | | | | | | | | |
|
86 |
| GPT-3.5 Turbo 0125 | 59.6 | 38.7 | 66.6 | 70.2 | 74.3 | 66.9 | 41.2 | 70.2 | 69.1 | 45.0 | 62.9 |
|
|
|
80 |
|
81 |
## Performance
|
82 |
|
83 |
+
| Model | Average | AlpacaEval 2 LC | BBH | DROP | GSM8k | IFEval | MATH | MMLU | Safety | PopQA | TruthQA |
|
84 |
|-------|---------|------|-----|------|-------|--------|------|------|--------|-------|---------|
|
85 |
| **Closed API models** | | | | | | | | | | | |
|
86 |
| GPT-3.5 Turbo 0125 | 59.6 | 38.7 | 66.6 | 70.2 | 74.3 | 66.9 | 41.2 | 70.2 | 69.1 | 45.0 | 62.9 |
|