Kleo
/

meltemi_arg2kp_matcher

Text Classification

PEFT

Safetensors

Greek

argumentation

Model card Files Files and versions Community

Kleo commited on Jan 27

Commit

904d6ae

verified ·

1 Parent(s): c53504a

Update README.md

Browse files

Files changed (1) hide show

README.md +37 -13

README.md CHANGED Viewed

@@ -48,13 +48,6 @@ This is the model card of a 🤗 transformers model that has been pushed on the
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
-### Direct Use
-<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
-[More Information Needed]
 ## Bias, Risks, and Limitations
@@ -125,11 +118,35 @@ for idx, result in enumerate(results, start=1):
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[ArgKP_2021_GR]](https://huggingface.co/datasets/Kleo/ArgKP_2021_GR)
 ### Training Procedure
-<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
 #### Preprocessing
 Social media text removal
@@ -143,7 +160,9 @@ Social media text removal
 #### Speeds, Sizes, Times [optional]
 <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
 ## Evaluation
@@ -155,7 +174,7 @@ Social media text removal
 <!-- This should link to a Dataset Card if possible. -->
-[More Information Needed]
 #### Metrics
@@ -164,9 +183,14 @@ Social media text removal
 mean Average Precision (mAP)
 ### Results
-[More Information Needed]
 #### Summary

 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 ## Bias, Risks, and Limitations
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+Machine translated train set of [ArgKP_2021_GR](https://huggingface.co/datasets/Kleo/ArgKP_2021_GR)
 ### Training Procedure
+The following hyperparameters were used during training:
+learning_rate:  1e-4
+train_batch_size: 16
+eval_batch_size: 16
+seed: 42
+num_devices: 1
+gradient_accumulation_steps: 2
+optimizer: paged Adam optimizer
+lr_scheduler_type: linear
+Weight Decay: 0.01
+M. G. Norm: 0.3
+max_seq_length: 512
+num_epochs: 1
+##################################################################
+LoRa Hyperparameters
+LoRA r :8
+LoRA alpha: 8
+LoRA dropout: 0.0
+LoRA bias: ‘none'
+target_modules: q_proj, v_proj
+task_type: "SEQ_CLS"
+Loss: Binary Cross Entropy
+trainable parameters: 3,416,064 (~5% of the original model)
+#### Training hyperparameters
 #### Preprocessing
 Social media text removal
 #### Speeds, Sizes, Times [optional]
 <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+Num checkpoints: 5
+Checkpoint size: 36,5 MB
+Training duration per checkpoint: 4.15 hours
 ## Evaluation
 <!-- This should link to a Dataset Card if possible. -->
+Human translated test set of [ArgKP_2021_GR](https://huggingface.co/datasets/Kleo/ArgKP_2021_GR)
 #### Metrics
 mean Average Precision (mAP)
+μήπως να χρησιμοποιήσω το test set του hf δατασετ μου poy einai mono labelled?
 ### Results
+|mAP strict| mAP relaxed | avgmAP |
+|----------|-------------|--------|
+|83.86     |94.27        |89.06   |
 #### Summary