krmk90 commited on
Commit
2283515
·
verified ·
1 Parent(s): 3547e0e

Model save

Browse files
README.md CHANGED
@@ -36,15 +36,15 @@ More information needed
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 0.0002
39
- - train_batch_size: 2
40
  - eval_batch_size: 8
41
  - seed: 42
42
  - gradient_accumulation_steps: 8
43
- - total_train_batch_size: 16
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: constant
46
  - lr_scheduler_warmup_ratio: 0.03
47
- - num_epochs: 5
48
 
49
  ### Training results
50
 
 
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 0.0002
39
+ - train_batch_size: 3
40
  - eval_batch_size: 8
41
  - seed: 42
42
  - gradient_accumulation_steps: 8
43
+ - total_train_batch_size: 24
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: constant
46
  - lr_scheduler_warmup_ratio: 0.03
47
+ - num_epochs: 3
48
 
49
  ### Training results
50
 
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8565bd13ecff59b3f9d5b7cefa56c9c0eb854466b1fe653aa057d6486f181b5e
3
  size 10107280
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:226aaa08387a8481e12f97ccf7abcc2d2beec6b4163b3a8651935a1e9b36255d
3
  size 10107280
runs/Feb27_21-35-25_ip-10-246-124-97/events.out.tfevents.1740692134.ip-10-246-124-97.6177.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:26eb798632952c4b5d11f3e478a016db5cba3215271cadbc4de454367023a796
3
- size 7768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:698fe31c9dbfad12c1f77dfcf9726d17b88ed257e7ea0e6526350349b655e173
3
+ size 8944