a-mannion
/

umls-kgi-bert-es

Feature Extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

Aidan Mannion commited on Nov 14, 2023

Commit

b64e6bb

·

1 Parent(s): 9e589a4

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -56,7 +56,7 @@ Experiments on general-domain data suggest that, given it's specialised training
 #### Training Hyperparameters
 - sequence length: 256
-- learning rate $7.5\times10^{-5}$
 - linear learning rate schedule with 10,770 warmup steps
 - effective batch size 1500 (15 sequences per batch x 100 gradient accumulation steps)
 - MLM masking probability 0.15

 #### Training Hyperparameters
 - sequence length: 256
+- learning rate 7.5e-5
 - linear learning rate schedule with 10,770 warmup steps
 - effective batch size 1500 (15 sequences per batch x 100 gradient accumulation steps)
 - MLM masking probability 0.15