Update README.md
Browse files
README.md
CHANGED
@@ -48,13 +48,6 @@ This is the model card of a 🤗 transformers model that has been pushed on the
|
|
48 |
|
49 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
50 |
|
51 |
-
### Direct Use
|
52 |
-
|
53 |
-
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
54 |
-
|
55 |
-
[More Information Needed]
|
56 |
-
|
57 |
-
|
58 |
|
59 |
## Bias, Risks, and Limitations
|
60 |
|
@@ -125,11 +118,35 @@ for idx, result in enumerate(results, start=1):
|
|
125 |
|
126 |
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
127 |
|
128 |
-
[ArgKP_2021_GR]
|
129 |
|
130 |
### Training Procedure
|
131 |
-
|
132 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
133 |
|
134 |
#### Preprocessing
|
135 |
Social media text removal
|
@@ -143,7 +160,9 @@ Social media text removal
|
|
143 |
#### Speeds, Sizes, Times [optional]
|
144 |
|
145 |
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
146 |
-
|
|
|
|
|
147 |
|
148 |
## Evaluation
|
149 |
|
@@ -155,7 +174,7 @@ Social media text removal
|
|
155 |
|
156 |
<!-- This should link to a Dataset Card if possible. -->
|
157 |
|
158 |
-
|
159 |
|
160 |
|
161 |
#### Metrics
|
@@ -164,9 +183,14 @@ Social media text removal
|
|
164 |
|
165 |
mean Average Precision (mAP)
|
166 |
|
|
|
|
|
167 |
### Results
|
168 |
|
169 |
-
|
|
|
|
|
|
|
170 |
|
171 |
#### Summary
|
172 |
|
|
|
48 |
|
49 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
50 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
51 |
|
52 |
## Bias, Risks, and Limitations
|
53 |
|
|
|
118 |
|
119 |
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
120 |
|
121 |
+
Machine translated train set of [ArgKP_2021_GR](https://huggingface.co/datasets/Kleo/ArgKP_2021_GR)
|
122 |
|
123 |
### Training Procedure
|
124 |
+
The following hyperparameters were used during training:
|
125 |
+
learning_rate: 1e-4
|
126 |
+
train_batch_size: 16
|
127 |
+
eval_batch_size: 16
|
128 |
+
seed: 42
|
129 |
+
num_devices: 1
|
130 |
+
gradient_accumulation_steps: 2
|
131 |
+
optimizer: paged Adam optimizer
|
132 |
+
lr_scheduler_type: linear
|
133 |
+
Weight Decay: 0.01
|
134 |
+
M. G. Norm: 0.3
|
135 |
+
max_seq_length: 512
|
136 |
+
num_epochs: 1
|
137 |
+
##################################################################
|
138 |
+
LoRa Hyperparameters
|
139 |
+
LoRA r :8
|
140 |
+
LoRA alpha: 8
|
141 |
+
LoRA dropout: 0.0
|
142 |
+
LoRA bias: ‘none'
|
143 |
+
target_modules: q_proj, v_proj
|
144 |
+
task_type: "SEQ_CLS"
|
145 |
+
Loss: Binary Cross Entropy
|
146 |
+
trainable parameters: 3,416,064 (~5% of the original model)
|
147 |
+
|
148 |
+
|
149 |
+
#### Training hyperparameters
|
150 |
|
151 |
#### Preprocessing
|
152 |
Social media text removal
|
|
|
160 |
#### Speeds, Sizes, Times [optional]
|
161 |
|
162 |
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
163 |
+
Num checkpoints: 5
|
164 |
+
Checkpoint size: 36,5 MB
|
165 |
+
Training duration per checkpoint: 4.15 hours
|
166 |
|
167 |
## Evaluation
|
168 |
|
|
|
174 |
|
175 |
<!-- This should link to a Dataset Card if possible. -->
|
176 |
|
177 |
+
Human translated test set of [ArgKP_2021_GR](https://huggingface.co/datasets/Kleo/ArgKP_2021_GR)
|
178 |
|
179 |
|
180 |
#### Metrics
|
|
|
183 |
|
184 |
mean Average Precision (mAP)
|
185 |
|
186 |
+
μήπως να χρησιμοποιήσω το test set του hf δατασετ μου poy einai mono labelled?
|
187 |
+
|
188 |
### Results
|
189 |
|
190 |
+
|mAP strict| mAP relaxed | avgmAP |
|
191 |
+
|----------|-------------|--------|
|
192 |
+
|83.86 |94.27 |89.06 |
|
193 |
+
|
194 |
|
195 |
#### Summary
|
196 |
|