atsuki-yamaguchi commited on
Commit
bfebe06
·
verified ·
1 Parent(s): 710b979

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -0
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ language:
4
+ - my
5
+ base_model: meta-llama/Meta-Llama-3-8B
6
+ library_name: transformers
7
+ ---
8
+
9
+ # LLaMA3-8B for Burmese: 5K target vocabulary size + Random target vocabulary initialization + T&B 2LS/MTP/512 training
10
+
11
+ This model is built on top of LLaMA3-8B adapted for Burmese using 30K target language sentences sampled from CC-100.
12
+
13
+ ## Model Details
14
+
15
+ * **Vocabulary**: This model has an additional 5K target vocabulary.
16
+ * **Target vocabulary initialization**: The target weights of the embedding and LM head were initialized using random initialization.
17
+ * **Training**: This model was additionally pre-trained on 30K target language sentences sampled from CC-100. The training was conducted with the T&B 2LS, MTP, and 512 strategies introduced in the paper.
18
+
19
+ ### Model Description
20
+
21
+ - **Language:** Burmese
22
+ - **License:** Llama 3 Community License Agreement
23
+ - **Finetuned from model:** meta-llama/Meta-Llama-3-8B
24
+
25
+ ### Model Sources
26
+
27
+ - **Repository:** https://github.com/gucci-j/lowres-cve
28
+ - **Paper:** https://arxiv.org/abs/2406.11477
29
+
30
+ ## How to Get Started with the Model
31
+ Use the code below to get started with the model.
32
+ ```python
33
+ from transformers import AutoTokenizer, AutoModelForCausalLM
34
+
35
+ model = AutoModelForCausalLM.from_pretrained(
36
+ "atsuki-yamaguchi/Llama-3-8B-my-30K-5000-rand"
37
+ )
38
+ tokenizer = AutoTokenizer.from_pretrained(
39
+ "atsuki-yamaguchi/Llama-3-8B-my-30K-5000-rand"
40
+ )
41
+ ```
42
+
43
+ ## Citation
44
+ ```
45
+ TBA
46
+ ```