suolyer commited on
Commit
3c9e67b
·
1 Parent(s): 5c8155a

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -0
README.md ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - zh
4
+ license: apache-2.0
5
+ widget:
6
+ - text: "生活的真谛是[MASK]。"
7
+ ---
8
+ # Zhouwenwang-110M model (Chinese),one model of [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM).
9
+ Zhouwenwang-110M apply a new unified structure, and jointly developed by the IDEA-CCNL and Zhuiyi Technology. In the pre-training, the model considers LM (Language Model) and MLM (Mask Language Model) tasks uniformly, and adds rotational position coding, so that the model has the ability to generate and understand. Zhouwenwang-110M is the largest model for LM and MLM tasks in the Chinese field. It will continue to be optimized in the direction of model scale, knowledge integration, and supervision task assistance.
10
+
11
+ ## Usage
12
+ There is no structure of Zhouwenwang-110M in [Transformers](https://github.com/huggingface/transformers), you can run follow code to get structure of Zhouwenwang-110M from [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM)
13
+
14
+ ```shell
15
+ git clone https://github.com/IDEA-CCNL/Fengshenbang-LM.git
16
+ ```
17
+
18
+ ### Load Model
19
+ ```python
20
+ from model.roformer.modeling_roformer import RoFormerModel
21
+ from model.roformer.configuration_roformer import RoFormerConfig
22
+ from transformers import BertTokenizer
23
+
24
+ tokenizer = BertTokenizer.from_pretrained("IDEA-CCNL/Zhouwenwang-110M")
25
+ config = RoFormerConfig.from_pretrained("IDEA-CCNL/Zhouwenwang-110M")
26
+ model = RoFormerModel.from_pretrained("IDEA-CCNL/Zhouwenwang-110M")
27
+
28
+
29
+ ```
30
+ ## Scores on downstream chinese tasks (without any data augmentation)
31
+ | Model| afqmc | tnews | iflytek | ocnli | cmnli | wsc | csl |
32
+ | :--------: | :-----: | :----: | :-----: | :----: | :----: | :----: | :----: |
33
+ | roberta-wwm-ext-large | 0.7514 | 0.5872 | 0.6152 | 0.777 | 0.814 | 0.8914 | 0.86 |
34
+ | Zhouwenwang-1.3B | 0.7463 | 0.6036 | 0.6288 | 0.7654 | 0.7741 | 0.8849 | 0. 8777 |
35
+
36
+ ## Citation
37
+ If you find the resource is useful, please cite the following website in your paper.
38
+ ```
39
+ https://github.com/IDEA-CCNL/Fengshenbang-LM
40
+ ```