sil-ai
/

w2v2-kaqchikel

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

jnemecek commited on Aug 24, 2022

Commit

229c45e

·

1 Parent(s): 14976b6

add details to dataset card

Files changed (1) hide show

README.md +13 -6

README.md CHANGED Viewed

@@ -7,12 +7,17 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # wav2vec2-large-xls-r-300m-kaqchikel-with-bloom
-This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the Viña and Bloom audio datasets.
 It achieves the following results on the evaluation set:
 - Loss: 0.6700
 - Cer: 0.0854
@@ -20,18 +25,20 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

   results: []
 ---
+## Model Description
+- **Homepage:** [SIL AI](https://ai.sil.org/)
+- **Point of Contact:** [SIL AI email](mailto:[email protected])
+- **Source Data:** [Bloom Library](https://bloomlibrary.org/) and [Viña Studios](https://www.vinyastudios.org)
 # wav2vec2-large-xls-r-300m-kaqchikel-with-bloom
+This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on a collection of audio from [Deditos](deditos.org) videos in Kaqchikel provided by [Viña Studios](www.vinyastudios.org) and Kaqchikel audio from audiobooks on [Bloom Library](bloomlibrary.org).
 It achieves the following results on the evaluation set:
 - Loss: 0.6700
 - Cer: 0.0854
 ## Model description
+This model is a baseline model finetuned from [XLS-R 300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m). Users should refer to the original model for tutorials on using a trained model for inference.
 ## Intended uses & limitations
+Users of this model must abide by the [UN Declarations on the Rights of Indigenous Peoples](https://www.un.org/development/desa/indigenouspeoples/declaration-on-the-rights-of-indigenous-peoples.html).
 ## Training and evaluation data
+Training, Validation, and Test datasets were generated from the same corpus, ensuring that no duplicate files were used.
 ## Training procedure
+Standard finetuning of XLS-R was used based on the examples in the [Hugging Face Transformers Github](https://github.com/huggingface/transformers/tree/main/examples/pytorch/speech-recognition)
 ### Training hyperparameters
 The following hyperparameters were used during training: