--- license: cc-by-nc-sa-4.0 language: - en - fi pipeline_tag: translation --- # Opus Tatoeba | English -> Finnish * dataset: opus * model: transformer-align * source language(s): eng * target language(s): fin * model: transformer-align * pre-processing: normalization + SentencePiece (spm32k,spm32k) * download: [opus-2021-02-19.zip](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-fin/opus-2021-02-19.zip) * test set translations: [opus-2021-02-19.test.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-fin/opus-2021-02-19.test.txt) * test set scores: [opus-2021-02-19.eval.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-fin/opus-2021-02-19.eval.txt) ## Benchmarks | testset | BLEU | chr-F | #sent | #words | BP | |---------|-------|-------|-------|--------|----| | newsdev2015-enfi.eng-fin | 21.6 | 0.556 | 1500 | 23375 | 1.000 | | newstest2015-enfi.eng-fin | 23.2 | 0.567 | 1370 | 19968 | 1.000 | | newstest2016-enfi.eng-fin | 24.9 | 0.578 | 3000 | 48116 | 0.986 | | newstest2017-enfi.eng-fin | 27.5 | 0.605 | 3002 | 45718 | 0.996 | | newstest2018-enfi.eng-fin | 18.4 | 0.532 | 3000 | 45475 | 1.000 | | newstest2019-enfi.eng-fin | 23.3 | 0.551 | 1997 | 38369 | 0.966 | | newstestB2016-enfi.eng-fin | 19.7 | 0.542 | 3000 | 45766 | 1.000 | | newstestB2017-enfi.eng-fin | 22.7 | 0.565 | 3002 | 45506 | 1.000 | | Tatoeba-test.eng-fin | 38.7 | 0.629 | 10000 | 60517 | 0.935 |