manandey
/

wav2vec2-large-xlsr-assamese

Automatic Speech Recognition

xlsr-fine-tuning-week

Inference Endpoints

Model card Files Files and versions Community

manandey commited on Mar 28, 2021

Commit

66faa1b

·

1 Parent(s): 84b83a9

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -1,7 +1,9 @@
 ---
 language: as
 datasets:
-- common_voice
 tags:
 - audio
 - automatic-speech-recognition
@@ -23,7 +25,6 @@ model-index:
          type: wer
          value: 74.25
 ---
 # Wav2Vec2-Large-XLSR-53-Assamese
 Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) in Assamese using the [Common Voice](https://huggingface.co/datasets/common_voice)
@@ -78,7 +79,7 @@ chars_to_ignore_regex = '[\,\?\.\!\-\;\:\"\“\%\‘\”\�\'\।]'
 resampler = torchaudio.transforms.Resample(48_000, 16_000)
 # Preprocessing the datasets.
-# We need to read the aduio files as arrays
 def speech_file_to_array_fn(batch):
     batch["sentence"] = re.sub(chars_to_ignore_regex, '', batch["sentence"]).lower()

 ---
 language: as
 datasets:
+- common_voice
+metrics:
+- wer
 tags:
 - audio
 - automatic-speech-recognition
          type: wer
          value: 74.25
 ---
 # Wav2Vec2-Large-XLSR-53-Assamese
 Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) in Assamese using the [Common Voice](https://huggingface.co/datasets/common_voice)
 resampler = torchaudio.transforms.Resample(48_000, 16_000)
 # Preprocessing the datasets.
+# We need to read the audio files as arrays
 def speech_file_to_array_fn(batch):
     batch["sentence"] = re.sub(chars_to_ignore_regex, '', batch["sentence"]).lower()