jnemecek commited on
Commit
229c45e
·
1 Parent(s): 14976b6

add details to dataset card

Browse files
Files changed (1) hide show
  1. README.md +13 -6
README.md CHANGED
@@ -7,12 +7,17 @@ model-index:
7
  results: []
8
  ---
9
 
10
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
- should probably proofread and complete it, then remove this comment. -->
 
 
 
 
 
12
 
13
  # wav2vec2-large-xls-r-300m-kaqchikel-with-bloom
14
 
15
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the Viña and Bloom audio datasets.
16
  It achieves the following results on the evaluation set:
17
  - Loss: 0.6700
18
  - Cer: 0.0854
@@ -20,18 +25,20 @@ It achieves the following results on the evaluation set:
20
 
21
  ## Model description
22
 
23
- More information needed
24
 
25
  ## Intended uses & limitations
26
 
27
- More information needed
28
 
29
  ## Training and evaluation data
30
 
31
- More information needed
32
 
33
  ## Training procedure
34
 
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
 
7
  results: []
8
  ---
9
 
10
+ ## Model Description
11
+
12
+ - **Homepage:** [SIL AI](https://ai.sil.org/)
13
+ - **Point of Contact:** [SIL AI email](mailto:[email protected])
14
+ - **Source Data:** [Bloom Library](https://bloomlibrary.org/) and [Viña Studios](https://www.vinyastudios.org)
15
+
16
+
17
 
18
  # wav2vec2-large-xls-r-300m-kaqchikel-with-bloom
19
 
20
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on a collection of audio from [Deditos](deditos.org) videos in Kaqchikel provided by [Viña Studios](www.vinyastudios.org) and Kaqchikel audio from audiobooks on [Bloom Library](bloomlibrary.org).
21
  It achieves the following results on the evaluation set:
22
  - Loss: 0.6700
23
  - Cer: 0.0854
 
25
 
26
  ## Model description
27
 
28
+ This model is a baseline model finetuned from [XLS-R 300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m). Users should refer to the original model for tutorials on using a trained model for inference.
29
 
30
  ## Intended uses & limitations
31
 
32
+ Users of this model must abide by the [UN Declarations on the Rights of Indigenous Peoples](https://www.un.org/development/desa/indigenouspeoples/declaration-on-the-rights-of-indigenous-peoples.html).
33
 
34
  ## Training and evaluation data
35
 
36
+ Training, Validation, and Test datasets were generated from the same corpus, ensuring that no duplicate files were used.
37
 
38
  ## Training procedure
39
 
40
+ Standard finetuning of XLS-R was used based on the examples in the [Hugging Face Transformers Github](https://github.com/huggingface/transformers/tree/main/examples/pytorch/speech-recognition)
41
+
42
  ### Training hyperparameters
43
 
44
  The following hyperparameters were used during training: