uer
/

roberta-mini-wwm-chinese-cluecorpussmall

Inference Endpoints

Model card Files Files and versions Community

uer commited on Sep 4, 2023

Commit

cb63ac1

·

1 Parent(s): cd32332

Update README.md

Files changed (1) hide show

README.md +9 -2

README.md CHANGED Viewed

@@ -11,9 +11,9 @@ widget:
 ## Model description
-This is the set of 6 Chinese Whole Word Masking RoBERTa models pre-trained by [UER-py](https://arxiv.org/abs/1909.05658).
-[Turc et al.](https://arxiv.org/abs/1908.08962) have shown that the standard BERT recipe is effective on a wide range of model sizes. Following their paper, we released the 6 Chinese Whole Word Masking RoBERTa models. In order to facilitate users to reproduce the results, we used the publicly available corpus and word segmentation tool, and provided all training details.
 You can download the 6 Chinese RoBERTa miniatures either from the [UER-py Github page](https://github.com/dbiir/UER-py/), or via HuggingFace from the links below:
@@ -175,6 +175,13 @@ python3 scripts/convert_bert_from_uer_to_huggingface.py --input_model_path model
   pages={241},
   year={2019}
 }
 ```
 [2_128]:https://huggingface.co/uer/roberta-tiny-wwm-chinese-cluecorpussmall

 ## Model description
+This is the set of 6 Chinese Whole Word Masking RoBERTa models pre-trained by [UER-py](https://github.com/dbiir/UER-py/), which is introduced in [this paper](https://arxiv.org/abs/1909.05658). Besides, the models could also be pre-trained by [TencentPretrain](https://github.com/Tencent/TencentPretrain) introduced in [this paper](https://arxiv.org/abs/2212.06385), which inherits UER-py to support models with parameters above one billion, and extends it to a multimodal pre-training framework.
+[Turc et al.](https://arxiv.org/abs/1908.08962) have shown that the standard BERT recipe is effective on a wide range of model sizes. Following their paper, we released the 6 Chinese Whole Word Masking RoBERTa models. In order to facilitate users in reproducing the results, we used a publicly available corpus and word segmentation tool, and provided all training details.
 You can download the 6 Chinese RoBERTa miniatures either from the [UER-py Github page](https://github.com/dbiir/UER-py/), or via HuggingFace from the links below:
   pages={241},
   year={2019}
 }
+@article{zhao2023tencentpretrain,
+  title={TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities},
+  author={Zhao, Zhe and Li, Yudong and Hou, Cheng and Zhao, Jing and others},
+  journal={ACL 2023},
+  pages={217},
+  year={2023}
 ```
 [2_128]:https://huggingface.co/uer/roberta-tiny-wwm-chinese-cluecorpussmall