ciderstt commited on
Commit
b982514
·
verified ·
1 Parent(s): 3d33d44

End of training

Browse files
Files changed (2) hide show
  1. README.md +85 -0
  2. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,85 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ language:
4
+ - nan
5
+ license: apache-2.0
6
+ base_model: openai/whisper-small
7
+ tags:
8
+ - whisper-event
9
+ - generated_from_trainer
10
+ datasets:
11
+ - mozilla-foundation/common_voice_11_0
12
+ metrics:
13
+ - wer
14
+ model-index:
15
+ - name: Whisper Small taiwanese
16
+ results:
17
+ - task:
18
+ name: Automatic Speech Recognition
19
+ type: automatic-speech-recognition
20
+ dataset:
21
+ name: Common Voice 11.0
22
+ type: mozilla-foundation/common_voice_11_0
23
+ config: nan-tw
24
+ split: test
25
+ args: nan-tw
26
+ metrics:
27
+ - name: Wer
28
+ type: wer
29
+ value: 103.93735044594301
30
+ ---
31
+
32
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
33
+ should probably proofread and complete it, then remove this comment. -->
34
+
35
+ # Whisper Small taiwanese
36
+
37
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
38
+ It achieves the following results on the evaluation set:
39
+ - Loss: 1.2406
40
+ - Wer: 103.9374
41
+
42
+ ## Model description
43
+
44
+ More information needed
45
+
46
+ ## Intended uses & limitations
47
+
48
+ More information needed
49
+
50
+ ## Training and evaluation data
51
+
52
+ More information needed
53
+
54
+ ## Training procedure
55
+
56
+ ### Training hyperparameters
57
+
58
+ The following hyperparameters were used during training:
59
+ - learning_rate: 1e-05
60
+ - train_batch_size: 64
61
+ - eval_batch_size: 8
62
+ - seed: 42
63
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
64
+ - lr_scheduler_type: linear
65
+ - lr_scheduler_warmup_steps: 500
66
+ - training_steps: 5000
67
+ - mixed_precision_training: Native AMP
68
+
69
+ ### Training results
70
+
71
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
72
+ |:-------------:|:-------:|:----:|:---------------:|:--------:|
73
+ | 0.0007 | 39.005 | 1000 | 1.0996 | 97.8682 |
74
+ | 0.0003 | 79.005 | 2000 | 1.1532 | 100.1958 |
75
+ | 0.0001 | 119.005 | 3000 | 1.1976 | 102.4146 |
76
+ | 0.0001 | 159.005 | 4000 | 1.2206 | 105.9822 |
77
+ | 0.0001 | 199.005 | 5000 | 1.2406 | 103.9374 |
78
+
79
+
80
+ ### Framework versions
81
+
82
+ - Transformers 4.50.0.dev0
83
+ - Pytorch 2.5.1+cu124
84
+ - Datasets 3.3.2
85
+ - Tokenizers 0.21.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fccdead528ba6971ffc0d72c85953efa55cf8aaa6d74014daf29e337fae9ab85
3
  size 966995080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b3af7912af9b5d5d03969d17fd3f46954b24f9f304a40841bed2f982a8ce5b47
3
  size 966995080