Kleo commited on
Commit
904d6ae
·
verified ·
1 Parent(s): c53504a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -13
README.md CHANGED
@@ -48,13 +48,6 @@ This is the model card of a 🤗 transformers model that has been pushed on the
48
 
49
  <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
50
 
51
- ### Direct Use
52
-
53
- <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
54
-
55
- [More Information Needed]
56
-
57
-
58
 
59
  ## Bias, Risks, and Limitations
60
 
@@ -125,11 +118,35 @@ for idx, result in enumerate(results, start=1):
125
 
126
  <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
127
 
128
- [ArgKP_2021_GR]](https://huggingface.co/datasets/Kleo/ArgKP_2021_GR)
129
 
130
  ### Training Procedure
131
-
132
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
133
 
134
  #### Preprocessing
135
  Social media text removal
@@ -143,7 +160,9 @@ Social media text removal
143
  #### Speeds, Sizes, Times [optional]
144
 
145
  <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
146
-
 
 
147
 
148
  ## Evaluation
149
 
@@ -155,7 +174,7 @@ Social media text removal
155
 
156
  <!-- This should link to a Dataset Card if possible. -->
157
 
158
- [More Information Needed]
159
 
160
 
161
  #### Metrics
@@ -164,9 +183,14 @@ Social media text removal
164
 
165
  mean Average Precision (mAP)
166
 
 
 
167
  ### Results
168
 
169
- [More Information Needed]
 
 
 
170
 
171
  #### Summary
172
 
 
48
 
49
  <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
50
 
 
 
 
 
 
 
 
51
 
52
  ## Bias, Risks, and Limitations
53
 
 
118
 
119
  <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
120
 
121
+ Machine translated train set of [ArgKP_2021_GR](https://huggingface.co/datasets/Kleo/ArgKP_2021_GR)
122
 
123
  ### Training Procedure
124
+ The following hyperparameters were used during training:
125
+ learning_rate: 1e-4
126
+ train_batch_size: 16
127
+ eval_batch_size: 16
128
+ seed: 42
129
+ num_devices: 1
130
+ gradient_accumulation_steps: 2
131
+ optimizer: paged Adam optimizer
132
+ lr_scheduler_type: linear
133
+ Weight Decay: 0.01
134
+ M. G. Norm: 0.3
135
+ max_seq_length: 512
136
+ num_epochs: 1
137
+ ##################################################################
138
+ LoRa Hyperparameters
139
+ LoRA r :8
140
+ LoRA alpha: 8
141
+ LoRA dropout: 0.0
142
+ LoRA bias: ‘none'
143
+ target_modules: q_proj, v_proj
144
+ task_type: "SEQ_CLS"
145
+ Loss: Binary Cross Entropy
146
+ trainable parameters: 3,416,064 (~5% of the original model)
147
+
148
+
149
+ #### Training hyperparameters
150
 
151
  #### Preprocessing
152
  Social media text removal
 
160
  #### Speeds, Sizes, Times [optional]
161
 
162
  <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
163
+ Num checkpoints: 5
164
+ Checkpoint size: 36,5 MB
165
+ Training duration per checkpoint: 4.15 hours
166
 
167
  ## Evaluation
168
 
 
174
 
175
  <!-- This should link to a Dataset Card if possible. -->
176
 
177
+ Human translated test set of [ArgKP_2021_GR](https://huggingface.co/datasets/Kleo/ArgKP_2021_GR)
178
 
179
 
180
  #### Metrics
 
183
 
184
  mean Average Precision (mAP)
185
 
186
+ μήπως να χρησιμοποιήσω το test set του hf δατασετ μου poy einai mono labelled?
187
+
188
  ### Results
189
 
190
+ |mAP strict| mAP relaxed | avgmAP |
191
+ |----------|-------------|--------|
192
+ |83.86 |94.27 |89.06 |
193
+
194
 
195
  #### Summary
196