add details to dataset card
Browse files
README.md
CHANGED
@@ -7,12 +7,17 @@ model-index:
|
|
7 |
results: []
|
8 |
---
|
9 |
|
10 |
-
|
11 |
-
|
|
|
|
|
|
|
|
|
|
|
12 |
|
13 |
# wav2vec2-large-xls-r-300m-kaqchikel-with-bloom
|
14 |
|
15 |
-
This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on
|
16 |
It achieves the following results on the evaluation set:
|
17 |
- Loss: 0.6700
|
18 |
- Cer: 0.0854
|
@@ -20,18 +25,20 @@ It achieves the following results on the evaluation set:
|
|
20 |
|
21 |
## Model description
|
22 |
|
23 |
-
|
24 |
|
25 |
## Intended uses & limitations
|
26 |
|
27 |
-
|
28 |
|
29 |
## Training and evaluation data
|
30 |
|
31 |
-
|
32 |
|
33 |
## Training procedure
|
34 |
|
|
|
|
|
35 |
### Training hyperparameters
|
36 |
|
37 |
The following hyperparameters were used during training:
|
|
|
7 |
results: []
|
8 |
---
|
9 |
|
10 |
+
## Model Description
|
11 |
+
|
12 |
+
- **Homepage:** [SIL AI](https://ai.sil.org/)
|
13 |
+
- **Point of Contact:** [SIL AI email](mailto:[email protected])
|
14 |
+
- **Source Data:** [Bloom Library](https://bloomlibrary.org/) and [Viña Studios](https://www.vinyastudios.org)
|
15 |
+
|
16 |
+
|
17 |
|
18 |
# wav2vec2-large-xls-r-300m-kaqchikel-with-bloom
|
19 |
|
20 |
+
This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on a collection of audio from [Deditos](deditos.org) videos in Kaqchikel provided by [Viña Studios](www.vinyastudios.org) and Kaqchikel audio from audiobooks on [Bloom Library](bloomlibrary.org).
|
21 |
It achieves the following results on the evaluation set:
|
22 |
- Loss: 0.6700
|
23 |
- Cer: 0.0854
|
|
|
25 |
|
26 |
## Model description
|
27 |
|
28 |
+
This model is a baseline model finetuned from [XLS-R 300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m). Users should refer to the original model for tutorials on using a trained model for inference.
|
29 |
|
30 |
## Intended uses & limitations
|
31 |
|
32 |
+
Users of this model must abide by the [UN Declarations on the Rights of Indigenous Peoples](https://www.un.org/development/desa/indigenouspeoples/declaration-on-the-rights-of-indigenous-peoples.html).
|
33 |
|
34 |
## Training and evaluation data
|
35 |
|
36 |
+
Training, Validation, and Test datasets were generated from the same corpus, ensuring that no duplicate files were used.
|
37 |
|
38 |
## Training procedure
|
39 |
|
40 |
+
Standard finetuning of XLS-R was used based on the examples in the [Hugging Face Transformers Github](https://github.com/huggingface/transformers/tree/main/examples/pytorch/speech-recognition)
|
41 |
+
|
42 |
### Training hyperparameters
|
43 |
|
44 |
The following hyperparameters were used during training:
|