JCAI2000 commited on
Commit
e782ffd
·
1 Parent(s): c0e03f4

End of training

Browse files
Files changed (4) hide show
  1. README.md +150 -0
  2. config.json +78 -0
  3. pytorch_model.bin +3 -0
  4. training_args.bin +3 -0
README.md ADDED
@@ -0,0 +1,150 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: nvidia/mit-b5
4
+ tags:
5
+ - vision
6
+ - image-segmentation
7
+ - generated_from_trainer
8
+ model-index:
9
+ - name: segformerb5-largeImages
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # segformerb5-largeImages
17
+
18
+ This model is a fine-tuned version of [nvidia/mit-b5](https://huggingface.co/nvidia/mit-b5) on the JCAI2000/LargerImagesLabelled dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.1156
21
+ - Mean Iou: 0.7785
22
+ - Mean Accuracy: 0.8298
23
+ - Overall Accuracy: 0.9767
24
+ - Accuracy Background: 0.9925
25
+ - Accuracy Branch: 0.6671
26
+ - Iou Background: 0.9759
27
+ - Iou Branch: 0.5812
28
+
29
+ ## Model description
30
+
31
+ More information needed
32
+
33
+ ## Intended uses & limitations
34
+
35
+ More information needed
36
+
37
+ ## Training and evaluation data
38
+
39
+ More information needed
40
+
41
+ ## Training procedure
42
+
43
+ ### Training hyperparameters
44
+
45
+ The following hyperparameters were used during training:
46
+ - learning_rate: 6e-05
47
+ - train_batch_size: 2
48
+ - eval_batch_size: 2
49
+ - seed: 42
50
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
+ - lr_scheduler_type: linear
52
+ - num_epochs: 100
53
+
54
+ ### Training results
55
+
56
+ | Training Loss | Epoch | Step | Validation Loss | Mean Iou | Mean Accuracy | Overall Accuracy | Accuracy Background | Accuracy Branch | Iou Background | Iou Branch |
57
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:-------------:|:----------------:|:-------------------:|:---------------:|:--------------:|:----------:|
58
+ | 0.2671 | 1.18 | 20 | 0.2779 | 0.4834 | 0.5075 | 0.9509 | 0.9985 | 0.0165 | 0.9508 | 0.0160 |
59
+ | 0.122 | 2.35 | 40 | 0.1772 | 0.6522 | 0.6931 | 0.9632 | 0.9922 | 0.3940 | 0.9625 | 0.3419 |
60
+ | 0.0671 | 3.53 | 60 | 0.1086 | 0.7392 | 0.8603 | 0.9658 | 0.9772 | 0.7435 | 0.9646 | 0.5138 |
61
+ | 0.0979 | 4.71 | 80 | 0.0860 | 0.7552 | 0.8493 | 0.9705 | 0.9836 | 0.7150 | 0.9695 | 0.5409 |
62
+ | 0.0749 | 5.88 | 100 | 0.0727 | 0.7601 | 0.8116 | 0.9746 | 0.9921 | 0.6311 | 0.9738 | 0.5465 |
63
+ | 0.032 | 7.06 | 120 | 0.0721 | 0.7535 | 0.8016 | 0.9741 | 0.9927 | 0.6106 | 0.9733 | 0.5338 |
64
+ | 0.0337 | 8.24 | 140 | 0.0719 | 0.7745 | 0.8530 | 0.9743 | 0.9873 | 0.7187 | 0.9733 | 0.5757 |
65
+ | 0.0398 | 9.41 | 160 | 0.0704 | 0.7732 | 0.8302 | 0.9756 | 0.9913 | 0.6690 | 0.9748 | 0.5715 |
66
+ | 0.0374 | 10.59 | 180 | 0.0724 | 0.7583 | 0.7995 | 0.9752 | 0.9941 | 0.6050 | 0.9744 | 0.5422 |
67
+ | 0.0334 | 11.76 | 200 | 0.0724 | 0.7721 | 0.8231 | 0.9760 | 0.9924 | 0.6537 | 0.9752 | 0.5690 |
68
+ | 0.025 | 12.94 | 220 | 0.0731 | 0.7725 | 0.8192 | 0.9763 | 0.9932 | 0.6452 | 0.9755 | 0.5694 |
69
+ | 0.0336 | 14.12 | 240 | 0.0699 | 0.7793 | 0.8334 | 0.9765 | 0.9919 | 0.6748 | 0.9757 | 0.5828 |
70
+ | 0.0321 | 15.29 | 260 | 0.0697 | 0.7825 | 0.8395 | 0.9767 | 0.9915 | 0.6875 | 0.9759 | 0.5891 |
71
+ | 0.0216 | 16.47 | 280 | 0.0752 | 0.7701 | 0.8176 | 0.9760 | 0.9930 | 0.6421 | 0.9752 | 0.5650 |
72
+ | 0.0178 | 17.65 | 300 | 0.0743 | 0.7753 | 0.8296 | 0.9761 | 0.9918 | 0.6674 | 0.9753 | 0.5752 |
73
+ | 0.0206 | 18.82 | 320 | 0.0717 | 0.7881 | 0.8488 | 0.9771 | 0.9909 | 0.7066 | 0.9763 | 0.5999 |
74
+ | 0.0162 | 20.0 | 340 | 0.0786 | 0.7694 | 0.8141 | 0.9761 | 0.9935 | 0.6347 | 0.9754 | 0.5634 |
75
+ | 0.0306 | 21.18 | 360 | 0.0785 | 0.7785 | 0.8275 | 0.9768 | 0.9929 | 0.6622 | 0.9760 | 0.5809 |
76
+ | 0.0179 | 22.35 | 380 | 0.0769 | 0.7816 | 0.8414 | 0.9764 | 0.9909 | 0.6919 | 0.9756 | 0.5876 |
77
+ | 0.0152 | 23.53 | 400 | 0.0776 | 0.7842 | 0.8461 | 0.9766 | 0.9906 | 0.7016 | 0.9758 | 0.5926 |
78
+ | 0.0245 | 24.71 | 420 | 0.0820 | 0.7725 | 0.8164 | 0.9765 | 0.9937 | 0.6390 | 0.9758 | 0.5692 |
79
+ | 0.0248 | 25.88 | 440 | 0.0829 | 0.7772 | 0.8268 | 0.9766 | 0.9928 | 0.6608 | 0.9759 | 0.5786 |
80
+ | 0.0176 | 27.06 | 460 | 0.0818 | 0.7761 | 0.8271 | 0.9764 | 0.9925 | 0.6617 | 0.9756 | 0.5767 |
81
+ | 0.0135 | 28.24 | 480 | 0.0816 | 0.7805 | 0.8384 | 0.9764 | 0.9913 | 0.6854 | 0.9756 | 0.5855 |
82
+ | 0.0343 | 29.41 | 500 | 0.0852 | 0.7777 | 0.8310 | 0.9764 | 0.9921 | 0.6699 | 0.9756 | 0.5798 |
83
+ | 0.0147 | 30.59 | 520 | 0.0851 | 0.7792 | 0.8367 | 0.9763 | 0.9913 | 0.6820 | 0.9755 | 0.5829 |
84
+ | 0.0119 | 31.76 | 540 | 0.0880 | 0.7800 | 0.8337 | 0.9767 | 0.9920 | 0.6754 | 0.9759 | 0.5842 |
85
+ | 0.0143 | 32.94 | 560 | 0.0899 | 0.7749 | 0.8241 | 0.9764 | 0.9928 | 0.6555 | 0.9756 | 0.5743 |
86
+ | 0.0122 | 34.12 | 580 | 0.0886 | 0.7810 | 0.8374 | 0.9766 | 0.9916 | 0.6832 | 0.9758 | 0.5863 |
87
+ | 0.0135 | 35.29 | 600 | 0.0908 | 0.7727 | 0.8206 | 0.9762 | 0.9930 | 0.6482 | 0.9755 | 0.5699 |
88
+ | 0.0203 | 36.47 | 620 | 0.0913 | 0.7758 | 0.8267 | 0.9764 | 0.9925 | 0.6608 | 0.9756 | 0.5759 |
89
+ | 0.0109 | 37.65 | 640 | 0.0898 | 0.7803 | 0.8337 | 0.9767 | 0.9921 | 0.6753 | 0.9759 | 0.5847 |
90
+ | 0.0141 | 38.82 | 660 | 0.0936 | 0.7774 | 0.8280 | 0.9766 | 0.9926 | 0.6634 | 0.9758 | 0.5790 |
91
+ | 0.0087 | 40.0 | 680 | 0.0903 | 0.7830 | 0.8493 | 0.9762 | 0.9898 | 0.7088 | 0.9753 | 0.5908 |
92
+ | 0.0099 | 41.18 | 700 | 0.0930 | 0.7779 | 0.8284 | 0.9766 | 0.9926 | 0.6641 | 0.9759 | 0.5799 |
93
+ | 0.0149 | 42.35 | 720 | 0.0908 | 0.7799 | 0.8320 | 0.9767 | 0.9923 | 0.6717 | 0.9760 | 0.5838 |
94
+ | 0.0168 | 43.53 | 740 | 0.0897 | 0.7864 | 0.8496 | 0.9768 | 0.9904 | 0.7087 | 0.9759 | 0.5969 |
95
+ | 0.0281 | 44.71 | 760 | 0.0954 | 0.7760 | 0.8259 | 0.9765 | 0.9927 | 0.6591 | 0.9757 | 0.5762 |
96
+ | 0.0102 | 45.88 | 780 | 0.0942 | 0.7819 | 0.8382 | 0.9767 | 0.9916 | 0.6849 | 0.9759 | 0.5879 |
97
+ | 0.0087 | 47.06 | 800 | 0.0948 | 0.7843 | 0.8422 | 0.9769 | 0.9913 | 0.6931 | 0.9761 | 0.5926 |
98
+ | 0.0166 | 48.24 | 820 | 0.0981 | 0.7777 | 0.8280 | 0.9766 | 0.9926 | 0.6634 | 0.9759 | 0.5796 |
99
+ | 0.0236 | 49.41 | 840 | 0.0972 | 0.7770 | 0.8274 | 0.9765 | 0.9926 | 0.6622 | 0.9758 | 0.5782 |
100
+ | 0.0168 | 50.59 | 860 | 0.0994 | 0.7751 | 0.8218 | 0.9766 | 0.9932 | 0.6505 | 0.9758 | 0.5743 |
101
+ | 0.017 | 51.76 | 880 | 0.0991 | 0.7779 | 0.8281 | 0.9767 | 0.9926 | 0.6635 | 0.9759 | 0.5799 |
102
+ | 0.0111 | 52.94 | 900 | 0.0994 | 0.7778 | 0.8266 | 0.9767 | 0.9929 | 0.6603 | 0.9760 | 0.5797 |
103
+ | 0.0202 | 54.12 | 920 | 0.0985 | 0.7845 | 0.8380 | 0.9772 | 0.9921 | 0.6839 | 0.9764 | 0.5926 |
104
+ | 0.0142 | 55.29 | 940 | 0.1025 | 0.7762 | 0.8240 | 0.9767 | 0.9931 | 0.6548 | 0.9759 | 0.5766 |
105
+ | 0.01 | 56.47 | 960 | 0.0997 | 0.7808 | 0.8346 | 0.9767 | 0.9920 | 0.6771 | 0.9759 | 0.5857 |
106
+ | 0.0127 | 57.65 | 980 | 0.1028 | 0.7797 | 0.8317 | 0.9767 | 0.9923 | 0.6712 | 0.9759 | 0.5835 |
107
+ | 0.0069 | 58.82 | 1000 | 0.1011 | 0.7834 | 0.8400 | 0.9768 | 0.9915 | 0.6885 | 0.9760 | 0.5907 |
108
+ | 0.0109 | 60.0 | 1020 | 0.1059 | 0.7775 | 0.8282 | 0.9766 | 0.9925 | 0.6638 | 0.9758 | 0.5792 |
109
+ | 0.0087 | 61.18 | 1040 | 0.1037 | 0.7793 | 0.8308 | 0.9767 | 0.9924 | 0.6692 | 0.9759 | 0.5826 |
110
+ | 0.0125 | 62.35 | 1060 | 0.1056 | 0.7784 | 0.8279 | 0.9768 | 0.9928 | 0.6630 | 0.9760 | 0.5808 |
111
+ | 0.0084 | 63.53 | 1080 | 0.1066 | 0.7803 | 0.8330 | 0.9768 | 0.9922 | 0.6737 | 0.9760 | 0.5847 |
112
+ | 0.0183 | 64.71 | 1100 | 0.1056 | 0.7806 | 0.8340 | 0.9767 | 0.9921 | 0.6759 | 0.9760 | 0.5853 |
113
+ | 0.0106 | 65.88 | 1120 | 0.1076 | 0.7768 | 0.8257 | 0.9766 | 0.9929 | 0.6586 | 0.9759 | 0.5778 |
114
+ | 0.0072 | 67.06 | 1140 | 0.1103 | 0.7771 | 0.8278 | 0.9765 | 0.9925 | 0.6630 | 0.9758 | 0.5784 |
115
+ | 0.0112 | 68.24 | 1160 | 0.1070 | 0.7799 | 0.8315 | 0.9768 | 0.9924 | 0.6705 | 0.9760 | 0.5838 |
116
+ | 0.0149 | 69.41 | 1180 | 0.1089 | 0.7778 | 0.8284 | 0.9766 | 0.9926 | 0.6642 | 0.9758 | 0.5797 |
117
+ | 0.0147 | 70.59 | 1200 | 0.1087 | 0.7805 | 0.8325 | 0.9768 | 0.9924 | 0.6727 | 0.9760 | 0.5850 |
118
+ | 0.013 | 71.76 | 1220 | 0.1081 | 0.7803 | 0.8331 | 0.9767 | 0.9922 | 0.6741 | 0.9760 | 0.5846 |
119
+ | 0.013 | 72.94 | 1240 | 0.1097 | 0.7789 | 0.8304 | 0.9767 | 0.9924 | 0.6683 | 0.9759 | 0.5818 |
120
+ | 0.0115 | 74.12 | 1260 | 0.1104 | 0.7773 | 0.8269 | 0.9766 | 0.9927 | 0.6610 | 0.9759 | 0.5787 |
121
+ | 0.0102 | 75.29 | 1280 | 0.1097 | 0.7795 | 0.8323 | 0.9767 | 0.9922 | 0.6725 | 0.9759 | 0.5831 |
122
+ | 0.0133 | 76.47 | 1300 | 0.1101 | 0.7808 | 0.8355 | 0.9767 | 0.9919 | 0.6791 | 0.9759 | 0.5857 |
123
+ | 0.013 | 77.65 | 1320 | 0.1111 | 0.7814 | 0.8358 | 0.9768 | 0.9919 | 0.6797 | 0.9760 | 0.5867 |
124
+ | 0.0068 | 78.82 | 1340 | 0.1107 | 0.7814 | 0.8362 | 0.9767 | 0.9919 | 0.6805 | 0.9759 | 0.5869 |
125
+ | 0.0036 | 80.0 | 1360 | 0.1136 | 0.7789 | 0.8313 | 0.9766 | 0.9923 | 0.6703 | 0.9758 | 0.5820 |
126
+ | 0.0163 | 81.18 | 1380 | 0.1123 | 0.7809 | 0.8347 | 0.9767 | 0.9920 | 0.6773 | 0.9760 | 0.5858 |
127
+ | 0.0065 | 82.35 | 1400 | 0.1117 | 0.7811 | 0.8356 | 0.9767 | 0.9919 | 0.6794 | 0.9759 | 0.5862 |
128
+ | 0.018 | 83.53 | 1420 | 0.1121 | 0.7811 | 0.8360 | 0.9767 | 0.9918 | 0.6802 | 0.9759 | 0.5864 |
129
+ | 0.0122 | 84.71 | 1440 | 0.1123 | 0.7803 | 0.8346 | 0.9766 | 0.9919 | 0.6772 | 0.9759 | 0.5847 |
130
+ | 0.0085 | 85.88 | 1460 | 0.1139 | 0.7783 | 0.8300 | 0.9766 | 0.9924 | 0.6676 | 0.9758 | 0.5808 |
131
+ | 0.0074 | 87.06 | 1480 | 0.1130 | 0.7820 | 0.8364 | 0.9768 | 0.9919 | 0.6808 | 0.9760 | 0.5879 |
132
+ | 0.0124 | 88.24 | 1500 | 0.1141 | 0.7801 | 0.8332 | 0.9767 | 0.9921 | 0.6743 | 0.9759 | 0.5843 |
133
+ | 0.0114 | 89.41 | 1520 | 0.1152 | 0.7783 | 0.8301 | 0.9766 | 0.9924 | 0.6678 | 0.9758 | 0.5808 |
134
+ | 0.0113 | 90.59 | 1540 | 0.1153 | 0.7784 | 0.8302 | 0.9766 | 0.9924 | 0.6680 | 0.9758 | 0.5811 |
135
+ | 0.0076 | 91.76 | 1560 | 0.1153 | 0.7778 | 0.8286 | 0.9766 | 0.9925 | 0.6647 | 0.9758 | 0.5797 |
136
+ | 0.0128 | 92.94 | 1580 | 0.1149 | 0.7785 | 0.8308 | 0.9766 | 0.9923 | 0.6694 | 0.9758 | 0.5813 |
137
+ | 0.0046 | 94.12 | 1600 | 0.1154 | 0.7781 | 0.8298 | 0.9766 | 0.9923 | 0.6673 | 0.9758 | 0.5803 |
138
+ | 0.0091 | 95.29 | 1620 | 0.1143 | 0.7792 | 0.8318 | 0.9766 | 0.9922 | 0.6713 | 0.9759 | 0.5826 |
139
+ | 0.0121 | 96.47 | 1640 | 0.1153 | 0.7784 | 0.8302 | 0.9766 | 0.9924 | 0.6681 | 0.9758 | 0.5810 |
140
+ | 0.0082 | 97.65 | 1660 | 0.1151 | 0.7787 | 0.8308 | 0.9766 | 0.9923 | 0.6694 | 0.9758 | 0.5815 |
141
+ | 0.0094 | 98.82 | 1680 | 0.1155 | 0.7784 | 0.8295 | 0.9766 | 0.9925 | 0.6664 | 0.9759 | 0.5808 |
142
+ | 0.0067 | 100.0 | 1700 | 0.1156 | 0.7785 | 0.8298 | 0.9767 | 0.9925 | 0.6671 | 0.9759 | 0.5812 |
143
+
144
+
145
+ ### Framework versions
146
+
147
+ - Transformers 4.33.0
148
+ - Pytorch 2.0.1+cu117
149
+ - Datasets 2.14.4
150
+ - Tokenizers 0.13.3
config.json ADDED
@@ -0,0 +1,78 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "nvidia/mit-b5",
3
+ "architectures": [
4
+ "SegformerForSemanticSegmentation"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.0,
7
+ "classifier_dropout_prob": 0.1,
8
+ "decoder_hidden_size": 768,
9
+ "depths": [
10
+ 3,
11
+ 6,
12
+ 40,
13
+ 3
14
+ ],
15
+ "downsampling_rates": [
16
+ 1,
17
+ 4,
18
+ 8,
19
+ 16
20
+ ],
21
+ "drop_path_rate": 0.1,
22
+ "hidden_act": "gelu",
23
+ "hidden_dropout_prob": 0.0,
24
+ "hidden_sizes": [
25
+ 64,
26
+ 128,
27
+ 320,
28
+ 512
29
+ ],
30
+ "id2label": {
31
+ "0": "background",
32
+ "1": "branch"
33
+ },
34
+ "image_size": 224,
35
+ "initializer_range": 0.02,
36
+ "label2id": {
37
+ "background": 0,
38
+ "branch": 1
39
+ },
40
+ "layer_norm_eps": 1e-06,
41
+ "mlp_ratios": [
42
+ 4,
43
+ 4,
44
+ 4,
45
+ 4
46
+ ],
47
+ "model_type": "segformer",
48
+ "num_attention_heads": [
49
+ 1,
50
+ 2,
51
+ 5,
52
+ 8
53
+ ],
54
+ "num_channels": 3,
55
+ "num_encoder_blocks": 4,
56
+ "patch_sizes": [
57
+ 7,
58
+ 3,
59
+ 3,
60
+ 3
61
+ ],
62
+ "reshape_last_stage": true,
63
+ "semantic_loss_ignore_index": 255,
64
+ "sr_ratios": [
65
+ 8,
66
+ 4,
67
+ 2,
68
+ 1
69
+ ],
70
+ "strides": [
71
+ 4,
72
+ 2,
73
+ 2,
74
+ 2
75
+ ],
76
+ "torch_dtype": "float32",
77
+ "transformers_version": "4.33.0"
78
+ }
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:edf5f20f983f0ce71bbc46d6ed5afffc121c79c955c10337c258b7b9dfd09f9a
3
+ size 338793417
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:92bc305358b1137a02aeaea55516fb0b840c0c6a97cd478000377beb165c54c4
3
+ size 4027