bigscience
/

T0pp

@@ -13,7 +13,7 @@ You can use the models to perform inference on tasks by specifying your query in
 # How to use
-We make available the models presented in our [paper](TODO) along with the ablation models. We recommend using the [T0pp_11B](https://huggingface.co/bigscience/T0pp_11B) checkpoint as it leads (on average) to the best performances on a variety of NLP tasks.
 |Model|Number of parameters|
 |-|-|
@@ -74,11 +74,11 @@ For reproducibility, we release the data we used for training (and evaluation) i
 # Evaluation data
-We systematically evaluate our models on a suite of held-out tasks:
 |Task category|Datasets|
 |-|-|
-|Natural language inference|ANLI, CB, HANS, RTE|
 |Coreference resolution|WSC, Winogrande|
 |Word sense disambiguation|WiC|
 |Sentence completion|COPA, HellaSwag, Story Cloze|

 # How to use
+We make available the models presented in our [paper](TODO) along with the ablation models. We recommend using the [T0pp_11B](https://huggingface.co/bigscience/T0pp_11B) (pronounce "T zero plus plus") checkpoint as it leads (on average) to the best performances on a variety of NLP tasks.
 |Model|Number of parameters|
 |-|-|
 # Evaluation data
+We evaluate our models on a suite of held-out tasks:
 |Task category|Datasets|
 |-|-|
+|Natural language inference|ANLI, CB, RTE|
 |Coreference resolution|WSC, Winogrande|
 |Word sense disambiguation|WiC|
 |Sentence completion|COPA, HellaSwag, Story Cloze|