adrianSauer
/

wav2vec2-cer

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

adrianSauer commited on Aug 18, 2024

Commit

a63a0b1

·

verified ·

1 Parent(s): b0a9ff3

End of training

Files changed (1) hide show

README.md +13 -11

README.md CHANGED Viewed

@@ -19,8 +19,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3202
-- Cer: 7.2954
 ## Model description
@@ -40,9 +40,11 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
 - lr_scheduler_warmup_steps: 50
@@ -53,16 +55,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Cer    |
 |:-------------:|:------:|:----:|:---------------:|:------:|
-| 0.4174        | 1.0101 | 100  | 0.3535          | 8.1385 |
-| 0.3411        | 2.0202 | 200  | 0.3387          | 7.8574 |
-| 0.2905        | 3.0303 | 300  | 0.3278          | 7.6076 |
-| 0.2591        | 4.0404 | 400  | 0.3214          | 7.3734 |
-| 0.251         | 5.0505 | 500  | 0.3202          | 7.2954 |
 ### Framework versions
-- Transformers 4.40.1
-- Pytorch 2.2.1+cu121
-- Datasets 2.19.0
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3490
+- Cer: 7.9739
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 8
 - eval_batch_size: 16
 - seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
 - lr_scheduler_warmup_steps: 50
 | Training Loss | Epoch  | Step | Validation Loss | Cer    |
 |:-------------:|:------:|:----:|:---------------:|:------:|
+| 0.4171        | 1.0152 | 100  | 0.3798          | 8.5448 |
+| 0.3367        | 2.0305 | 200  | 0.3633          | 8.4451 |
+| 0.2914        | 3.0457 | 300  | 0.3564          | 8.1279 |
+| 0.261         | 4.0609 | 400  | 0.3494          | 7.9014 |
+| 0.2464        | 5.0761 | 500  | 0.3490          | 7.9739 |
 ### Framework versions
+- Transformers 4.44.0
+- Pytorch 2.3.1+cu121
+- Datasets 2.21.0
 - Tokenizers 0.19.1