Aidan Mannion commited on
Commit
75f0bed
·
1 Parent(s): baad455

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -90,7 +90,8 @@ This model was evaluated on the following datasets:
90
  #### Metrics
91
 
92
  <!-- These are the evaluation metrics being used, ideally with a description of why. -->
93
- [More Information Needed]
 
94
 
95
  ### Results
96
 
 
90
  #### Metrics
91
 
92
  <!-- These are the evaluation metrics being used, ideally with a description of why. -->
93
+ We provide the macro-averaged F1 scores here; given that all of the downstream token classification tasks in these experiments show significant class imbalance, the weighted-average scores tend to be uniformly higher than their macro-averaged counterparts.
94
+ In the interest of more fairly representing the less prevalent classes and highlighting the difficulty of capturing the long-tailed nature of the distributions in these datasets, we stick to the macro average.
95
 
96
  ### Results
97