End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/deberta-v3-small](https://huggingface.co/microsoft/deberta-v3-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3573
 ## Model description
@@ -40,17 +40,22 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 16   | 0.4115          |
-| No log        | 2.0   | 32   | 0.3740          |
-| No log        | 3.0   | 48   | 0.3706          |
-| No log        | 4.0   | 64   | 0.3614          |
-| No log        | 5.0   | 80   | 0.3573          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/deberta-v3-small](https://huggingface.co/microsoft/deberta-v3-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1043
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 16   | 0.3287          |
+| No log        | 2.0   | 32   | 0.2921          |
+| No log        | 3.0   | 48   | 0.2538          |
+| No log        | 4.0   | 64   | 0.2106          |
+| No log        | 5.0   | 80   | 0.1782          |
+| No log        | 6.0   | 96   | 0.1501          |
+| No log        | 7.0   | 112  | 0.1291          |
+| No log        | 8.0   | 128  | 0.1176          |
+| No log        | 9.0   | 144  | 0.1081          |
+| No log        | 10.0  | 160  | 0.1043          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3a8a937312de0a0fb95794a9c676a02d9bedb881745da7e30a01788d236f457b
 size 567617008

 version https://git-lfs.github.com/spec/v1
+oid sha256:ec278e8a81278f51b5013ba81dc326318e115cad96692886fb3c4d1802ab4f12
 size 567617008

runs/Jun03_11-18-11_0417-111206-hkgnini8-10-45-16-10/events.out.tfevents.1717413493.0417-111206-hkgnini8-10-45-16-10.6681.34 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:2fc1f3a5da6a04fc4c282e793485dfc374d03dfa34c98ea0ccf5fb6c70b52962
+size 8425

runs/Jun03_11-18-11_0417-111206-hkgnini8-10-45-16-10/events.out.tfevents.1717413766.0417-111206-hkgnini8-10-45-16-10.6681.35 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6ebc6c28813e6f69a9841287167da2bd29c5c329b1af74bfb53ad6739235accc
+size 311

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d9013aaf3a4e426e667cf639add19662076c75d51c8bfc6b59e3eba0974cfed8
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:c8dc4c226709f28af64c89dbe75789afb0babbfbce6c3f7378a3212ceaa4d176
 size 5112