Add evaluation results on the autoevaluate--zero-shot-classification-sample config and test split of autoevaluate/zero-shot-classification-sample
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the autoevaluate--zero-shot-classification-sample config and test split of the [autoevaluate/zero-shot-classification-sample](https://huggingface.co/datasets/autoevaluate/zero-shot-classification-sample) dataset by
@lewtun
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-a02353d8-c94a-4476-bd14-15028ee3f918-5452).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=autoevaluate/zero-shot-classification-sample).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=autoevaluate/zero-shot-classification-sample).
|
@@ -1,9 +1,31 @@
|
|
| 1 |
---
|
| 2 |
language: en
|
| 3 |
-
inference: false
|
| 4 |
tags:
|
| 5 |
- text-generation
|
| 6 |
- opt
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
---
|
| 8 |
|
| 9 |
Hello. I am a model, to be evaluated.
|
|
|
|
| 1 |
---
|
| 2 |
language: en
|
|
|
|
| 3 |
tags:
|
| 4 |
- text-generation
|
| 5 |
- opt
|
| 6 |
+
inference: false
|
| 7 |
+
model-index:
|
| 8 |
+
- name: autoevaluate/zero-shot-classification
|
| 9 |
+
results:
|
| 10 |
+
- task:
|
| 11 |
+
type: zero-shot-classification
|
| 12 |
+
name: Zero-Shot Text Classification
|
| 13 |
+
dataset:
|
| 14 |
+
name: autoevaluate/zero-shot-classification-sample
|
| 15 |
+
type: autoevaluate/zero-shot-classification-sample
|
| 16 |
+
config: autoevaluate--zero-shot-classification-sample
|
| 17 |
+
split: test
|
| 18 |
+
metrics:
|
| 19 |
+
- type: accuracy
|
| 20 |
+
value: 0.6666666666666666
|
| 21 |
+
name: Accuracy
|
| 22 |
+
verified: true
|
| 23 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiM2Q4YjIxMjdlYTJlNzE1NTE2MTgzM2RmMzU2Y2EzMDI0YWI0MWFkMzdhNGU4MzM5M2FmZGE2YWEzY2E5ZTEzNyIsInZlcnNpb24iOjF9.jBlcu153kaqz6iRGtOxG8Q9ynayhjZnIRwDp30-3gShW1UfQqXBsfv5dK-jxh03NT_uEhizWBPUt5Whc_s6XBw
|
| 24 |
+
- type: loss
|
| 25 |
+
value: 0.5084398140509924
|
| 26 |
+
name: Loss
|
| 27 |
+
verified: true
|
| 28 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMThjZjBkMWEwYTM0OTIzZjRjZmVmYzYyZjk2ZjYzZTRjOGIzM2U0MTE0MjljNGFlODk1ZDYwMjM3NTBmZWZmOCIsInZlcnNpb24iOjF9.BqM9rOmxo4xWUgyYr1g7c2gNZ5Q2N_F-QzLMVk2-OgE3zQNLhO_4vWEHRz-uPZFG_JW0i2JvheZV2zTXLcnQBA
|
| 29 |
---
|
| 30 |
|
| 31 |
Hello. I am a model, to be evaluated.
|