IDEA-CCNL
/

Zhouwenwang-Unified-110M

Inference Endpoints

Model card Files Files and versions Community

suolyer commited on Nov 25, 2021

Commit

6c4d83b

·

1 Parent(s): 3c9e67b

Update README.md

Files changed (1) hide show

README.md +32 -0

README.md CHANGED Viewed

@@ -27,6 +27,38 @@ model = RoFormerModel.from_pretrained("IDEA-CCNL/Zhouwenwang-110M")
 ```
 ## Scores on downstream chinese tasks (without any data augmentation)
 |     Model| afqmc    |  tnews  | iflytek    |  ocnli  |  cmnli  | wsc  | csl  |
 | :--------:    | :-----:  | :----:  | :-----:   | :----: | :----: | :----: | :----: |

 ```
+### Generate task
+You can use Zhouwenwang-110M to continue writing
+```python
+from model.roformer.modeling_roformer import RoFormerModel
+from transformers import AutoTokenizer
+import torch
+import numpy as np
+sentence = '清华大学位于'
+max_length = 32
+model_pretrained_weight_path = '/home/'  # 预训练模型权重路径
+tokenizer = AutoTokenizer.from_pretrained(model_pretrained_weight_path)
+model = RoFormerModel.from_pretrained(model_pretrained_weight_path)
+for i in range(max_length):
+    encode = torch.tensor(
+        [[tokenizer.cls_token_id]+tokenizer.encode(sentence, add_special_tokens=False)]).long()
+    logits = model(encode)[0]
+    logits = torch.nn.functional.linear(
+        logits, model.embeddings.word_embeddings.weight)
+    logits = torch.nn.functional.softmax(
+        logits, dim=-1).cpu().detach().numpy()[0]
+    sentence = sentence + \
+        tokenizer.decode(int(np.random.choice(logits.shape[1], p=logits[-1])))
+    if sentence[-1] == '。':
+        break
+print(sentence)
+```
 ## Scores on downstream chinese tasks (without any data augmentation)
 |     Model| afqmc    |  tnews  | iflytek    |  ocnli  |  cmnli  | wsc  | csl  |
 | :--------:    | :-----:  | :----:  | :-----:   | :----: | :----: | :----: | :----: |