csikasote commited on
Commit
91c9023
·
verified ·
1 Parent(s): a7f181a

Model save

Browse files
Files changed (2) hide show
  1. README.md +84 -0
  2. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,84 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ base_model: facebook/mms-1b-all
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - wer
8
+ model-index:
9
+ - name: mms-1b-bem-male-sv
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/cicasote/huggingface/runs/x8tbh9an)
17
+ # mms-1b-bem-male-sv
18
+
19
+ This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on an unknown dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 0.1409
22
+ - Wer: 0.3498
23
+
24
+ ## Model description
25
+
26
+ More information needed
27
+
28
+ ## Intended uses & limitations
29
+
30
+ More information needed
31
+
32
+ ## Training and evaluation data
33
+
34
+ More information needed
35
+
36
+ ## Training procedure
37
+
38
+ ### Training hyperparameters
39
+
40
+ The following hyperparameters were used during training:
41
+ - learning_rate: 0.001
42
+ - train_batch_size: 8
43
+ - eval_batch_size: 8
44
+ - seed: 42
45
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
+ - lr_scheduler_type: linear
47
+ - lr_scheduler_warmup_steps: 500
48
+ - num_epochs: 5.0
49
+ - mixed_precision_training: Native AMP
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
54
+ |:-------------:|:------:|:----:|:---------------:|:------:|
55
+ | No log | 0.2183 | 200 | 0.1927 | 0.4257 |
56
+ | No log | 0.4367 | 400 | 0.1713 | 0.3885 |
57
+ | 2.0358 | 0.6550 | 600 | 0.1760 | 0.3907 |
58
+ | 2.0358 | 0.8734 | 800 | 0.1819 | 0.4143 |
59
+ | 0.519 | 1.0917 | 1000 | 0.1611 | 0.3869 |
60
+ | 0.519 | 1.3100 | 1200 | 0.1550 | 0.3736 |
61
+ | 0.519 | 1.5284 | 1400 | 0.1538 | 0.3771 |
62
+ | 0.4764 | 1.7467 | 1600 | 0.1744 | 0.4176 |
63
+ | 0.4764 | 1.9651 | 1800 | 0.1598 | 0.3884 |
64
+ | 0.4501 | 2.1834 | 2000 | 0.1507 | 0.3577 |
65
+ | 0.4501 | 2.4017 | 2200 | 0.1535 | 0.3763 |
66
+ | 0.4501 | 2.6201 | 2400 | 0.1502 | 0.3649 |
67
+ | 0.4422 | 2.8384 | 2600 | 0.1457 | 0.3502 |
68
+ | 0.4422 | 3.0568 | 2800 | 0.1485 | 0.3580 |
69
+ | 0.4217 | 3.2751 | 3000 | 0.1480 | 0.3547 |
70
+ | 0.4217 | 3.4934 | 3200 | 0.1498 | 0.3666 |
71
+ | 0.4217 | 3.7118 | 3400 | 0.1458 | 0.3494 |
72
+ | 0.4144 | 3.9301 | 3600 | 0.1427 | 0.3574 |
73
+ | 0.4144 | 4.1485 | 3800 | 0.1445 | 0.3594 |
74
+ | 0.3926 | 4.3668 | 4000 | 0.1462 | 0.3666 |
75
+ | 0.3926 | 4.5852 | 4200 | 0.1432 | 0.3527 |
76
+ | 0.3926 | 4.8035 | 4400 | 0.1409 | 0.3498 |
77
+
78
+
79
+ ### Framework versions
80
+
81
+ - Transformers 4.43.0
82
+ - Pytorch 2.3.1+cu121
83
+ - Datasets 2.21.0
84
+ - Tokenizers 0.19.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ebd95c7b3e010bf88d999f76c812ef9f134249170b34d5b83ca15084278d8a7f
3
  size 3858890924
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b364271d9d94ce457ae4a310ac7fdebf5ff34a873b795b8bc32300266145ecf
3
  size 3858890924