End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -6,8 +6,6 @@ tags:
 model-index:
 - name: qa-bert-base-multilingual-uncased
   results: []
-datasets:
-- SajjadAyoubi/persian_qa
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # qa-bert-base-multilingual-uncased
-This model is a fine-tuned version of [google-bert/bert-base-multilingual-uncased](https://huggingface.co/google-bert/bert-base-multilingual-uncased) on https://huggingface.co/datasets/SajjadAyoubi/persian_qa dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.7871
 ## Model description
@@ -36,21 +34,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.2437        | 1.0   | 1139 | 1.7038          |
-| 1.4946        | 2.0   | 2278 | 1.6015          |
-| 0.9703        | 3.0   | 3417 | 1.7871          |
 ### Framework versions
@@ -58,4 +58,4 @@ The following hyperparameters were used during training:
 - Transformers 4.42.4
 - Pytorch 2.3.1+cu121
 - Datasets 2.21.0
-- Tokenizers 0.19.1

 model-index:
 - name: qa-bert-base-multilingual-uncased
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # qa-bert-base-multilingual-uncased
+This model is a fine-tuned version of [google-bert/bert-base-multilingual-uncased](https://huggingface.co/google-bert/bert-base-multilingual-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.7136
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.0763        | 1.0   | 570  | 1.8933          |
+| 2.0611        | 2.0   | 1140 | 1.6730          |
+| 1.7286        | 3.0   | 1710 | 1.6859          |
+| 1.5198        | 4.0   | 2280 | 1.6814          |
+| 1.3609        | 5.0   | 2850 | 1.7136          |
 ### Framework versions
 - Transformers 4.42.4
 - Pytorch 2.3.1+cu121
 - Datasets 2.21.0
+- Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7cce55dc6dad6897153a214d72a5bca2dae99722061837f19eba75f349ad28d7
 size 667092808

 version https://git-lfs.github.com/spec/v1
+oid sha256:aa1998768ce12902f424504469ac7964c6b4bbb5446a5004a9cb5f64c4d3e8b1
 size 667092808

runs/Aug16_16-10-16_50bebba3fbc0/events.out.tfevents.1723824662.50bebba3fbc0.1317.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4fc6d93687da38922ca5aad8a567bf2acbf1ed8be53fda64b2b2c32820d462f5
-size 7469

 version https://git-lfs.github.com/spec/v1
+oid sha256:ba332d724563f1c93adaa92c24e04be09c3ff02828c78d2b4cce447eb852f40e
+size 7823