ycffm
/

MBart50-legalform-remover

Model card Files Files and versions Community

ycffm commited on Nov 29, 2024

Commit

0a7455a

·

verified ·

1 Parent(s): 9f38308

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -118,15 +118,15 @@ You can use the code displayed above, or download the files from the directory a
 ### Training Data
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-real_companies_1            26764 --real company names in EU languages, Russian and various other languages usually in LATIN format.
-real_companies_2            24790 --real company names in EU languages, Russian and various other languages usually in LATIN format.
-real_companies_arabic        2317 --real company names in Arabic
-real_companies_ea           20328 --real company names in Chinese, Korean, Japanese
-synthetic_companies_eu      20000 --synthetic company names in EU languages
 The entire dataset was split in 8-1-1 as training-validation-testing set
 A typical data entry is

 ### Training Data
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+real_companies_1             26764 --real company names in EU languages, Russian and various other languages usually in LATIN format.
+real_companies_2             24790 --real company names in EU languages, Russian and various other languages usually in LATIN format.
+real_companies_arabic         2317 --real company names in Arabic
+real_companies_ea            20328 --real company names in Chinese, Korean, Japanese
+synthetic_companies_eu       20000 --synthetic company names in EU languages
 The entire dataset was split in 8-1-1 as training-validation-testing set
 A typical data entry is