pongjin's picture
Update README.md
7e43bf4
metadata
license: apache-2.0
datasets:
  - pongjin/en_corpora_parliament_processed
language:
  - en
pipeline_tag: automatic-speech-recognition
metrics:
  - wer

This model has been referred to the following links

  1. https://huggingface.co/blog/wav2vec2-with-ngram
  2. https://huggingface.co/blog/fine-tune-wav2vec2-english

Thanks to patrickvonplaten Patrick von Platen

ํ•ด๋‹น ๋ชจ๋ธ์€ ํ•œ๊ตญ์ธ์˜ ์˜์–ด ๋ฐœํ™” ์ธ์‹ ์„ฑ๋Šฅ ๊ฐœ์„ ์„ ์œ„ํ•ด facebook/wav2vec2-base-960h ๋กœ ํŒŒ์ธํŠœ๋‹ํ•œ ๋ชจ๋ธ์— KenLM 5-gram ์„ ๋ถ™์ธ ASR + LM ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.

If you want to use LM, you must have kenlm installed https://github.com/kpu/kenlm

pip install https://github.com/kpu/kenlm/archive/master.zip

ํ•™์Šต ๋ฐ์ดํ„ฐ ์ถœ์ฒ˜ : https://aiopen.etri.re.kr/voiceModel

transformers==4.24.0
huggingface_hub==0.13.2

wer epoch batch lr weight_decay warmup_steps
0.32 10 16 1e-4 0.005 1000