ctoraman
/

RoBERTa-TR-medium-bpe-16k

Inference Endpoints

Model card Files Files and versions Community

ctoraman commited on Mar 8, 2022

Commit

79222db

·

1 Parent(s): a796138

readme updated

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -18,8 +18,13 @@ Model architecture is similar to bert-medium (8 layers, 8 heads, and 512 hidden
 The details can be found at this paper:
 https://arxiv.org/...
-The following code segment can be used for initializing the tokenizer, example max length (514) can be changed:
 ```
 	tokenizer = PreTrainedTokenizerFast(tokenizer_file=[file_path])
 	tokenizer.mask_token = "[MASK]"
 	tokenizer.cls_token = "[CLS]"

 The details can be found at this paper:
 https://arxiv.org/...
+The following code can be used for model loading and tokenization, example max length (514) can be changed:
 ```
+	model = AutoModel.from_pretrained([model_path])
+	#for sequence classification:
+	#model = AutoModelForSequenceClassification.from_pretrained([model_path], num_labels=[num_classes])
+	tokenizer = ByT5Tokenizer.from_pretrained("google/byt5-small")
 	tokenizer = PreTrainedTokenizerFast(tokenizer_file=[file_path])
 	tokenizer.mask_token = "[MASK]"
 	tokenizer.cls_token = "[CLS]"