Commit
·
7cfaa21
1
Parent(s):
8ea0c2f
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- ru
|
4 |
+
---
|
5 |
+
|
6 |
+
# Card for ruM2M100-1.2B model
|
7 |
+
|
8 |
+
### Summary
|
9 |
+
|
10 |
+
The model corrects spelling errors and typos by bringing all the words in the text to the norm of the Russian language.
|
11 |
+
Corrector was trained based on the model [M2M100-1.2B](https://huggingface.co/facebook/m2m100_1.2B).
|
12 |
+
An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the library [SAGE](https://github.com/orgs/ai-forever/sage).
|