update LM eval results
Browse files
README.md
CHANGED
|
@@ -20,10 +20,16 @@ model-index:
|
|
| 20 |
type: mozilla-foundation/common_voice_8_0
|
| 21 |
args: ky
|
| 22 |
metrics:
|
| 23 |
-
- name: Test WER
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 24 |
type: wer
|
| 25 |
value: 31.28
|
| 26 |
-
- name: Test CER
|
| 27 |
type: cer
|
| 28 |
value: 7.66
|
| 29 |
---
|
|
@@ -45,6 +51,8 @@ For a description of the model architecture, see [facebook/wav2vec2-xls-r-300m](
|
|
| 45 |
|
| 46 |
The model vocabulary consists of the cyrillic alphabet with punctuation removed.
|
| 47 |
|
|
|
|
|
|
|
| 48 |
## Intended uses & limitations
|
| 49 |
|
| 50 |
This model is expected to be of some utility for low-fidelity use cases such as:
|
|
|
|
| 20 |
type: mozilla-foundation/common_voice_8_0
|
| 21 |
args: ky
|
| 22 |
metrics:
|
| 23 |
+
- name: Test WER (with LM)
|
| 24 |
+
type: wer
|
| 25 |
+
value: 19.01
|
| 26 |
+
- name: Test CER (with LM)
|
| 27 |
+
type: cer
|
| 28 |
+
value: 5.38
|
| 29 |
+
- name: Test WER (no LM)
|
| 30 |
type: wer
|
| 31 |
value: 31.28
|
| 32 |
+
- name: Test CER (no LM)
|
| 33 |
type: cer
|
| 34 |
value: 7.66
|
| 35 |
---
|
|
|
|
| 51 |
|
| 52 |
The model vocabulary consists of the cyrillic alphabet with punctuation removed.
|
| 53 |
|
| 54 |
+
The kenlm language model is built using the text of the train and invalidated corpus splits.
|
| 55 |
+
|
| 56 |
## Intended uses & limitations
|
| 57 |
|
| 58 |
This model is expected to be of some utility for low-fidelity use cases such as:
|
mozilla-foundation_common_voice_8_0_ky_test_eval_results.txt
CHANGED
|
@@ -1,2 +1,2 @@
|
|
| 1 |
-
WER: 0.
|
| 2 |
-
CER: 0.
|
|
|
|
| 1 |
+
WER: 0.19011371973587674
|
| 2 |
+
CER: 0.05388927913480272
|