End of training
Browse files
README.md
CHANGED
@@ -41,7 +41,7 @@ The following hyperparameters were used during training:
|
|
41 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
42 |
- lr_scheduler_type: cosine
|
43 |
- lr_scheduler_warmup_steps: 1000
|
44 |
-
- num_epochs:
|
45 |
- mixed_precision_training: Native AMP
|
46 |
|
47 |
### Training results
|
@@ -52,5 +52,5 @@ The following hyperparameters were used during training:
|
|
52 |
|
53 |
- Transformers 4.35.2
|
54 |
- Pytorch 2.1.0+cu121
|
55 |
-
- Datasets 2.
|
56 |
- Tokenizers 0.15.1
|
|
|
41 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
42 |
- lr_scheduler_type: cosine
|
43 |
- lr_scheduler_warmup_steps: 1000
|
44 |
+
- num_epochs: 90
|
45 |
- mixed_precision_training: Native AMP
|
46 |
|
47 |
### Training results
|
|
|
52 |
|
53 |
- Transformers 4.35.2
|
54 |
- Pytorch 2.1.0+cu121
|
55 |
+
- Datasets 2.17.0
|
56 |
- Tokenizers 0.15.1
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 497774208
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:29508c727614b0138a7bddb5e9acab8e8bfa47cba9a8e773d1934c795c215ed4
|
3 |
size 497774208
|
runs/Feb15_11-50-01_6ddd855286a4/events.out.tfevents.1707997802.6ddd855286a4.579.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c64f2fee7dad5a53dbb0d6d6fbd39ac12db59a811d60e79a5ac77ff92f95f75c
|
3 |
+
size 4862
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4536
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:27b4b77abc4f03520cf51087b5a1565dcf181b707cc154019035eeb02d0dd434
|
3 |
size 4536
|