typeof
/

open_llama_3b_v2

Text Generation

text-generation-inference

Model card Files Files and versions

typeof commited on Oct 20, 2023

Commit

9c9a70e

·

1 Parent(s): d1ee5e7

Update README.md

Files changed (1) hide show

README.md +5 -8

README.md CHANGED Viewed

@@ -6,6 +6,10 @@ datasets:
 - togethercomputer/RedPajama-Data-1T
 ---
 # OpenLLaMA: An Open Reproduction of LLaMA
 **TL;DR**: we are releasing our public preview of OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA. We are releasing a series of 3B, 7B and 13B models trained on different data mixtures. Our model weights can serve as the drop in replacement of LLaMA in existing implementations.
@@ -24,14 +28,7 @@ Preview checkpoints can be directly loaded from Hugging Face Hub. **Please note
 import torch
 from transformers import LlamaTokenizer, LlamaForCausalLM
-## v2 models
-model_path = 'openlm-research/open_llama_3b_v2'
-# model_path = 'openlm-research/open_llama_7b_v2'
-## v1 models
-# model_path = 'openlm-research/open_llama_3b'
-# model_path = 'openlm-research/open_llama_7b'
-# model_path = 'openlm-research/open_llama_13b'
 tokenizer = LlamaTokenizer.from_pretrained(model_path)
 model = LlamaForCausalLM.from_pretrained(

 - togethercomputer/RedPajama-Data-1T
 ---
+# Tokenizer Fixed!! 🎉
+Thanks to https://huggingface.co/mistralai/Mistral-7B-v0.1/discussions/26/files
 # OpenLLaMA: An Open Reproduction of LLaMA
 **TL;DR**: we are releasing our public preview of OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA. We are releasing a series of 3B, 7B and 13B models trained on different data mixtures. Our model weights can serve as the drop in replacement of LLaMA in existing implementations.
 import torch
 from transformers import LlamaTokenizer, LlamaForCausalLM
+model_path = 'typeof/open_llama_3b_v2'
 tokenizer = LlamaTokenizer.from_pretrained(model_path)
 model = LlamaForCausalLM.from_pretrained(