codemurt commited on
Commit
c3393a1
·
verified ·
1 Parent(s): dcbc917

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -3,8 +3,6 @@ language:
3
  - udm
4
  ---
5
 
6
- Inspired by https://huggingface.co/slone/bert-tiny-char-ctc-bak-denoise
7
-
8
  # bert-tiny-char-ctc-udm-denoise
9
 
10
  This is a tiny BERT model for Udmurt, intended for fixing OCR errors.
@@ -28,4 +26,6 @@ fix_text("кыче мои солы оскылй!")
28
  # Кыӵе мон солы оскылӥ!
29
  ```
30
 
31
- It was trained on a parallel corpus (corrupted + fixed sentence) with CTC loss. On our test dataset, it reduces OCR errors by 50%.
 
 
 
3
  - udm
4
  ---
5
 
 
 
6
  # bert-tiny-char-ctc-udm-denoise
7
 
8
  This is a tiny BERT model for Udmurt, intended for fixing OCR errors.
 
26
  # Кыӵе мон солы оскылӥ!
27
  ```
28
 
29
+ It was trained on a parallel corpus (corrupted + fixed sentence) with CTC loss. On our test dataset, it reduces OCR errors by 50%.
30
+
31
+ Inspired by https://huggingface.co/slone/bert-tiny-char-ctc-bak-denoise