Update README.md
Browse files
README.md
CHANGED
@@ -43,7 +43,7 @@ to get the pruned model.
|
|
43 |
# Benchmark Results
|
44 |
|
45 |
**Benchmark Evaluation**:
|
46 |
-
The model follows
|
47 |
reasoning datasets that doesn't require foreign code to load:
|
48 |
|
49 |
| Model | BoolQ | HellaSwag | ARC-e | ARC-c | OBQA | Average Accuracy |
|
|
|
43 |
# Benchmark Results
|
44 |
|
45 |
**Benchmark Evaluation**:
|
46 |
+
The model follows the original paper's evaluation and perform zero-shot task classification on 5 common sense
|
47 |
reasoning datasets that doesn't require foreign code to load:
|
48 |
|
49 |
| Model | BoolQ | HellaSwag | ARC-e | ARC-c | OBQA | Average Accuracy |
|