Model save

Browse files

Files changed (3) hide show

README.md +77 -0
model.safetensors +1 -1
runs/Jan03_09-57-35_srvrocgpu011.uct.ac.za/events.out.tfevents.1735891106.srvrocgpu011.uct.ac.za +2 -2

README.md ADDED Viewed

	@@ -0,0 +1,77 @@

+---
+library_name: transformers
+license: cc-by-nc-4.0
+base_model: facebook/mms-1b-all
+tags:
+- generated_from_trainer
+metrics:
+- wer
+model-index:
+- name: mms-1b-swagen-combined-15hrs-model
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# mms-1b-swagen-combined-15hrs-model
+This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.2307
+- Wer: 0.1929
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0003
+- train_batch_size: 4
+- eval_batch_size: 4
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 8
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 100
+- num_epochs: 30.0
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer    |
+|:-------------:|:------:|:----:|:---------------:|:------:|
+| 14.8801       | 0.0797 | 100  | 0.7377          | 0.4426 |
+| 0.6766        | 0.1594 | 200  | 0.2688          | 0.2006 |
+| 0.5153        | 0.2391 | 300  | 0.2484          | 0.1975 |
+| 0.526         | 0.3189 | 400  | 0.2398          | 0.1949 |
+| 0.4874        | 0.3986 | 500  | 0.2398          | 0.1958 |
+| 0.4666        | 0.4783 | 600  | 0.2358          | 0.1909 |
+| 0.4406        | 0.5580 | 700  | 0.2391          | 0.1944 |
+| 0.4689        | 0.6377 | 800  | 0.2334          | 0.1926 |
+| 0.462         | 0.7174 | 900  | 0.2293          | 0.1927 |
+| 0.4407        | 0.7971 | 1000 | 0.2293          | 0.1931 |
+| 0.4567        | 0.8768 | 1100 | 0.2298          | 0.1928 |
+| 0.4711        | 0.9566 | 1200 | 0.2305          | 0.1972 |
+| 0.4444        | 1.0359 | 1300 | 0.2307          | 0.1929 |
+### Framework versions
+- Transformers 4.47.1
+- Pytorch 2.5.1+cu124
+- Datasets 3.2.0
+- Tokenizers 0.21.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e96510ebfb7a9586a05f4f1889b9dea582bbf7959c37f74a214d6932ab5ced5b
 size 3858957536

 version https://git-lfs.github.com/spec/v1
+oid sha256:0def71fe7cd6c49edee7f2fb4de67729df92bbdbdcc890aa91f92fe086224b06
 size 3858957536

runs/Jan03_09-57-35_srvrocgpu011.uct.ac.za/events.out.tfevents.1735891106.srvrocgpu011.uct.ac.za CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:03af34bc6dc919583ab24c5683916b85d8f8dfa47db6ef67ae852f4c7ad1a76b
-size 12913

 version https://git-lfs.github.com/spec/v1
+oid sha256:b3b04ec3882d33b8cfecee01e8faa3910cdd7a7db3a66eac835f93e7a327b329
+size 13796