Salesforce
/

codegen-350M-nl

Text Generation

Model card Files Files and versions

rooa commited on Jun 28, 2022

Commit

170f13a

·

1 Parent(s): 70ffc2f

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ This checkpoint (CodeGen-NL 350M) was pre-trained on [the Pile](https://github.c
 ## Training procedure
 CodeGen was trained using cross-entropy loss to maximize the likelihood of sequential inputs.
-The family of models are trained using 4 TPU-v4 chips by Google, leveraging data and model parallelism.
 See Section 2.3 of the [paper](https://arxiv.org/abs/2203.13474) for more details.
 ## Evaluation results
@@ -35,8 +35,8 @@ This model can be easily loaded using the `AutoModelForCausalLM` functionality:
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
-tokenizer = AutoTokenizer.from_pretrained('Salesforce/codegen-350M-nl')
-model = AutoModelForCausalLM.from_pretrained('Salesforce/codegen-350M-nl')
 text = "def hello_world():"
 input_ids = tokenizer(text, return_tensors="pt").input_ids

 ## Training procedure
 CodeGen was trained using cross-entropy loss to maximize the likelihood of sequential inputs.
+The family of models are trained using multiple TPU-v4-512 by Google, leveraging data and model parallelism.
 See Section 2.3 of the [paper](https://arxiv.org/abs/2203.13474) for more details.
 ## Evaluation results
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("Salesforce/codegen-350M-nl")
+model = AutoModelForCausalLM.from_pretrained("Salesforce/codegen-350M-nl")
 text = "def hello_world():"
 input_ids = tokenizer(text, return_tensors="pt").input_ids