yahma
/

alpaca-7b-lora

Model card Files Files and versions

yahma commited on Apr 3, 2023

Commit

003c2a4

·

1 Parent(s): 10faee1

Update README.md

Files changed (1) hide show

README.md +9 -9

README.md CHANGED Viewed

@@ -7,15 +7,15 @@ This repo contains a low-rank adapter for LLaMA-7b fit on the Cleaned Alpaca dat
 This version of the weights was trained with the following hyperparameters:
-    Cleaned dataset: Snapshot March 31, 2023
     Epochs: 3
-    Validation set size: 2000
     Batch size: 128
-    Micro batch size: 12
     Cutoff length: 512
     Learning rate: 3e-4
-    Lora r: 8
-    Lora target modules: q_proj, v_proj
 That is:
@@ -25,7 +25,7 @@ python finetune.py \
     --num_epochs=3 \
     --cutoff_len=512 \
     --output_dir='./lora-alpaca' \
-    --lora_target_modules='[q_proj,v_proj]' \
-    --lora_r=8 \
-    --val_set_size 2000 \
-    --micro_batch_size=12

 This version of the weights was trained with the following hyperparameters:
+    Cleaned dataset: Snapshot April 2, 2023
     Epochs: 3
+    Validation set size: 1500
     Batch size: 128
+    Micro batch size: 8
     Cutoff length: 512
     Learning rate: 3e-4
+    Lora r: 16
+    Lora target modules: q_proj, k_proj, v_proj, o_proj
 That is:
     --num_epochs=3 \
     --cutoff_len=512 \
     --output_dir='./lora-alpaca' \
+    --lora_target_modules='[q_proj,k_proj, v_proj, o_proj]' \
+    --lora_r=16 \
+    --val_set_size 1500 \
+    --micro_batch_size=8