Aidan Mannion
commited on
Commit
·
75f0bed
1
Parent(s):
baad455
Update README.md
Browse files
README.md
CHANGED
@@ -90,7 +90,8 @@ This model was evaluated on the following datasets:
|
|
90 |
#### Metrics
|
91 |
|
92 |
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
93 |
-
|
|
|
94 |
|
95 |
### Results
|
96 |
|
|
|
90 |
#### Metrics
|
91 |
|
92 |
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
93 |
+
We provide the macro-averaged F1 scores here; given that all of the downstream token classification tasks in these experiments show significant class imbalance, the weighted-average scores tend to be uniformly higher than their macro-averaged counterparts.
|
94 |
+
In the interest of more fairly representing the less prevalent classes and highlighting the difficulty of capturing the long-tailed nature of the distributions in these datasets, we stick to the macro average.
|
95 |
|
96 |
### Results
|
97 |
|