izumi-lab
/

electra-small-japanese-discriminator

Inference Endpoints

Model card Files Files and versions Community

izumilab commited on Oct 8, 2021

Commit

058ca22

·

1 Parent(s): 8df1234

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -40,7 +40,7 @@ The vocabulary size is 32768.
 ## Training
-The models are trained with the same configuration as ELECTRA small in the [original ELECTRA paper](https://arxiv.org/abs/2003.10555); 128 tokens per instance, 128 instances per batch, and 1M training steps.
 The size of the generator is the same of the discriminator.

 ## Training
+The models are trained with the same configuration as ELECTRA small in the [original ELECTRA paper](https://arxiv.org/abs/2003.10555) except size; 128 tokens per instance, 128 instances per batch, and 1M training steps.
 The size of the generator is the same of the discriminator.