andricValdez commited on
Commit
b3b7bf3
·
verified ·
1 Parent(s): 640f66a

End of training

Browse files
README.md ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model: microsoft/deberta-v3-base
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - accuracy
8
+ - f1
9
+ model-index:
10
+ - name: deberta-v3-base-finetuned-autext23
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # deberta-v3-base-finetuned-autext23
18
+
19
+ This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the None dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 0.5603
22
+ - Accuracy: 0.8926
23
+ - F1: 0.8917
24
+
25
+ ## Model description
26
+
27
+ More information needed
28
+
29
+ ## Intended uses & limitations
30
+
31
+ More information needed
32
+
33
+ ## Training and evaluation data
34
+
35
+ More information needed
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - learning_rate: 2e-05
43
+ - train_batch_size: 64
44
+ - eval_batch_size: 64
45
+ - seed: 42
46
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
+ - lr_scheduler_type: linear
48
+ - num_epochs: 5
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
53
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
54
+ | No log | 1.0 | 371 | 0.3067 | 0.8755 | 0.8741 |
55
+ | 0.2052 | 2.0 | 742 | 0.3418 | 0.8832 | 0.8821 |
56
+ | 0.2052 | 3.0 | 1113 | 0.3133 | 0.9067 | 0.9063 |
57
+ | 0.0506 | 4.0 | 1484 | 0.4449 | 0.9006 | 0.9000 |
58
+ | 0.0506 | 5.0 | 1855 | 0.5603 | 0.8926 | 0.8917 |
59
+
60
+
61
+ ### Framework versions
62
+
63
+ - Transformers 4.40.1
64
+ - Pytorch 2.3.0+cu121
65
+ - Datasets 2.19.0
66
+ - Tokenizers 0.19.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f90aa659c14a82644068afd120c93007f6a2b1f5e3be8761c4ea2f684b886ee6
3
  size 737719272
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b44059111294668a52ef8397ff1e700ba58d3e82f3636e1e2f3c8086ea8a8446
3
  size 737719272
runs/Dec07_11-46-14_helena-Precision-7920-Tower/events.out.tfevents.1733593575.helena-Precision-7920-Tower CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:16b12ce03659955addda4f8a9ab25fe2cbd79c1eaf22de709b4e29f14a220443
3
- size 6980
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:15d007d26a49e43fe71effe907b0dbec2c709f5ec709c3b44417e36732360c4c
3
+ size 7703
runs/Dec07_11-46-14_helena-Precision-7920-Tower/events.out.tfevents.1733594488.helena-Precision-7920-Tower ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:20a8a7ab2013c7d10ff8266309963b23a1e2911fa3769ac3eae320a92d5969e0
3
+ size 40