metadata
language: sv-SE
datasets:
- common_voice
- NST Swedish ASR Database
metrics:
- wer
tags:
- audio
- automatic-speech-recognition
- speech
- voxpopuli
license: cc-by-nc-4.0
model-index:
- name: Wav2vec 2.0 large VoxPopuli-sv swedish
results:
- task:
name: Speech Recognition
type: automatic-speech-recognition
dataset:
name: NST Swedish ASR Database
metrics:
- name: Test WER
type: wer
value: 5.192353080009441
- task:
name: Speech Recognition
type: automatic-speech-recognition
dataset:
name: Common Voice
type: common_voice
args: sv-SE
metrics:
- name: Test WER
type: wer
value: 17.37743757973392
Wav2vec 2.0 large-voxpopuli-sv-swedish
Finetuned version of Facebooks VoxPopuli-sv large model. WER for NST + Common Voice test set (2% of total sentences) is 5.19%. WER for Common Voice test set is 17.38%.