GerbilLab
/

GerbilBlender-A-15m

Text Generation

Inference Endpoints

Model card Files Files and versions Community

crumb commited on Mar 30, 2023

Commit

9e0e9ba

·

1 Parent(s): 7261716

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 ---
 license: apache-2.0
 ---
-| Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss ↓ | hellaswag acc ↑ | lambada ppl ↓ | lambada acc ↑ | winogrande acc ↑
-| --- | --- | --- | --- | --- |  --- |  --- |   --- |   --- |   --- |   --- |
 | [GerbilLab/GerbilBlender-A-15m](https://hf.co/GerbilLab/GerbilBlender-A-15m) | 15m | A-Class | 20 | 280M | 131k | 4.9642 |
 "Blender" models, inspired by UL2 pretraining, are trained equally in fill-in-the-middle, causal modelling, and masked language modelling tasks. Special tokens for these models include:

 ---
 license: apache-2.0
 ---
+| Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss ↓ |
+| --- | --- | --- | --- | --- |  --- |  --- |
 | [GerbilLab/GerbilBlender-A-15m](https://hf.co/GerbilLab/GerbilBlender-A-15m) | 15m | A-Class | 20 | 280M | 131k | 4.9642 |
 "Blender" models, inspired by UL2 pretraining, are trained equally in fill-in-the-middle, causal modelling, and masked language modelling tasks. Special tokens for these models include: