ales commited on
Commit
d30d1e5
·
1 Parent(s): 1df6116

update model card README.md

Browse files
Files changed (2) hide show
  1. README.md +22 -29
  2. train.log +2 -0
README.md CHANGED
@@ -1,41 +1,38 @@
1
  ---
2
- language:
3
- - be
4
  license: apache-2.0
5
  tags:
6
- - whisper-event
7
  - generated_from_trainer
8
  datasets:
9
- - mozilla-foundation/common_voice_11_0
10
  metrics:
11
  - wer
12
  model-index:
13
- - name: Whisper Tiny Belarusian
14
  results:
15
  - task:
16
  name: Automatic Speech Recognition
17
  type: automatic-speech-recognition
18
  dataset:
19
- name: mozilla-foundation/common_voice_11_0 be
20
- type: mozilla-foundation/common_voice_11_0
21
  config: be
22
  split: validation
23
  args: be
24
  metrics:
25
  - name: Wer
26
  type: wer
27
- value: 52.197802197802204
28
  ---
29
 
30
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
31
  should probably proofread and complete it, then remove this comment. -->
32
 
33
- # Whisper Tiny Belarusian
34
 
35
- This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the mozilla-foundation/common_voice_11_0 be dataset.
36
  It achieves the following results on the evaluation set:
37
- - Loss: 0.5074
38
- - Wer: 52.1978
39
 
40
  ## Model description
41
 
@@ -54,34 +51,30 @@ More information needed
54
  ### Training hyperparameters
55
 
56
  The following hyperparameters were used during training:
57
- - learning_rate: 1e-05
58
  - train_batch_size: 32
59
  - eval_batch_size: 32
60
  - seed: 42
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
- - training_steps: 300
 
64
  - mixed_precision_training: Native AMP
65
 
66
  ### Training results
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Wer |
69
  |:-------------:|:-----:|:----:|:---------------:|:-------:|
70
- | 2.4473 | 0.5 | 10 | 1.3675 | 95.4212 |
71
- | 1.256 | 1.0 | 20 | 0.9745 | 75.2747 |
72
- | 0.9934 | 0.3 | 30 | 0.8114 | 72.1612 |
73
- | 0.9568 | 0.4 | 40 | 0.7814 | 72.7106 |
74
- | 0.6856 | 0.5 | 50 | 0.7517 | 76.9231 |
75
- | 0.7808 | 0.6 | 60 | 0.6514 | 63.5531 |
76
- | 0.6826 | 0.7 | 70 | 0.6197 | 60.4396 |
77
- | 0.7832 | 0.8 | 80 | 0.6129 | 65.9341 |
78
- | 0.6031 | 0.9 | 90 | 0.5877 | 61.3553 |
79
- | 0.6678 | 1.0 | 100 | 0.5759 | 61.5385 |
80
- | 0.4611 | 0.07 | 110 | 0.5625 | 57.6923 |
81
- | 0.4451 | 0.13 | 120 | 0.5636 | 56.5934 |
82
- | 0.3615 | 0.2 | 130 | 0.5490 | 61.1722 |
83
- | 0.4055 | 0.27 | 140 | 0.5382 | 55.1282 |
84
- | 0.2946 | 0.33 | 150 | 0.5387 | 55.6777 |
85
 
86
 
87
  ### Framework versions
 
1
  ---
 
 
2
  license: apache-2.0
3
  tags:
 
4
  - generated_from_trainer
5
  datasets:
6
+ - common_voice_11_0
7
  metrics:
8
  - wer
9
  model-index:
10
+ - name: whisper-tiny-be-test
11
  results:
12
  - task:
13
  name: Automatic Speech Recognition
14
  type: automatic-speech-recognition
15
  dataset:
16
+ name: common_voice_11_0
17
+ type: common_voice_11_0
18
  config: be
19
  split: validation
20
  args: be
21
  metrics:
22
  - name: Wer
23
  type: wer
24
+ value: 61.72161172161172
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
28
  should probably proofread and complete it, then remove this comment. -->
29
 
30
+ # whisper-tiny-be-test
31
 
32
+ This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the common_voice_11_0 dataset.
33
  It achieves the following results on the evaluation set:
34
+ - Loss: 0.5790
35
+ - Wer: 61.7216
36
 
37
  ## Model description
38
 
 
51
  ### Training hyperparameters
52
 
53
  The following hyperparameters were used during training:
54
+ - learning_rate: 0.0001
55
  - train_batch_size: 32
56
  - eval_batch_size: 32
57
  - seed: 42
58
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
59
  - lr_scheduler_type: linear
60
+ - lr_scheduler_warmup_steps: 10
61
+ - training_steps: 100
62
  - mixed_precision_training: Native AMP
63
 
64
  ### Training results
65
 
66
  | Training Loss | Epoch | Step | Validation Loss | Wer |
67
  |:-------------:|:-----:|:----:|:---------------:|:-------:|
68
+ | 2.5622 | 0.1 | 10 | 1.5402 | 94.5055 |
69
+ | 1.3719 | 0.2 | 20 | 1.0012 | 75.2747 |
70
+ | 0.9898 | 0.3 | 30 | 0.8217 | 72.7106 |
71
+ | 0.9742 | 0.4 | 40 | 0.7924 | 72.5275 |
72
+ | 0.6951 | 0.5 | 50 | 0.7628 | 76.1905 |
73
+ | 0.7824 | 0.6 | 60 | 0.6738 | 65.3846 |
74
+ | 0.6818 | 0.7 | 70 | 0.6389 | 60.0733 |
75
+ | 0.7823 | 0.8 | 80 | 0.6208 | 65.7509 |
76
+ | 0.5994 | 0.9 | 90 | 0.5901 | 61.9048 |
77
+ | 0.6647 | 1.0 | 100 | 0.5790 | 61.7216 |
 
 
 
 
 
78
 
79
 
80
  ### Framework versions
train.log CHANGED
@@ -151,3 +151,5 @@
151
  {'loss': 0.5994, 'learning_rate': 1.4444444444444444e-05, 'epoch': 0.9}
152
  {'eval_loss': 0.5900620818138123, 'eval_wer': 61.904761904761905, 'eval_runtime': 17.489, 'eval_samples_per_second': 3.659, 'eval_steps_per_second': 0.114, 'epoch': 0.9}
153
  {'loss': 0.6647, 'learning_rate': 3.3333333333333333e-06, 'epoch': 1.0}
 
 
 
151
  {'loss': 0.5994, 'learning_rate': 1.4444444444444444e-05, 'epoch': 0.9}
152
  {'eval_loss': 0.5900620818138123, 'eval_wer': 61.904761904761905, 'eval_runtime': 17.489, 'eval_samples_per_second': 3.659, 'eval_steps_per_second': 0.114, 'epoch': 0.9}
153
  {'loss': 0.6647, 'learning_rate': 3.3333333333333333e-06, 'epoch': 1.0}
154
+ {'eval_loss': 0.5789934992790222, 'eval_wer': 61.72161172161172, 'eval_runtime': 18.4962, 'eval_samples_per_second': 3.46, 'eval_steps_per_second': 0.108, 'epoch': 1.0}
155
+ {'train_runtime': 873.4716, 'train_samples_per_second': 3.664, 'train_steps_per_second': 0.114, 'train_loss': 1.0103698587417602, 'epoch': 1.0}