mirror of
https://github.com/Helsinki-NLP/OPUS-MT-train.git
synced 2024-12-02 20:48:08 +03:00
.. | ||
README.md |
opus+techiaith-2020-03-30.zip
- dataset: opus+techiaith
- model: transformer-align
- pre-processing: normalization + SentencePiece
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - download: opus+techiaith-2020-03-30.zip
- test set translations: opus+techiaith-2020-03-30.test.txt
- test set scores: opus+techiaith-2020-03-30.eval.txt
Benchmarks
testset | BLEU | chr-F |
---|---|---|
Tatoeba.en.ga | 18.6 | 0.323 |
opus+techiaith+bt-2020-04-03.zip
- dataset: opus+techiaith+bt
- model: transformer-align
- pre-processing: normalization + SentencePiece
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - download: opus+techiaith+bt-2020-04-03.zip
- test set translations: opus+techiaith+bt-2020-04-03.test.txt
- test set scores: opus+techiaith+bt-2020-04-03.eval.txt
Benchmarks
testset | BLEU | chr-F |
---|---|---|
Tatoeba.en.ga | 21.8 | 0.383 |
opus+techiaith+bt-2020-04-11.zip
- dataset: opus+techiaith+bt
- model: transformer-align
- pre-processing: normalization + SentencePiece
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - download: opus+techiaith+bt-2020-04-11.zip
- test set translations: opus+techiaith+bt-2020-04-11.test.txt
- test set scores: opus+techiaith+bt-2020-04-11.eval.txt
Benchmarks
testset | BLEU | chr-F |
---|---|---|
Tatoeba.en.ga | 22.1 | 0.385 |
opus+techiaith+bt-2020-04-24.zip
- dataset: opus+techiaith+bt
- model: transformer-align
- pre-processing: normalization + SentencePiece
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - download: opus+techiaith+bt-2020-04-24.zip
- test set translations: opus+techiaith+bt-2020-04-24.test.txt
- test set scores: opus+techiaith+bt-2020-04-24.eval.txt
Benchmarks
testset | BLEU | chr-F |
---|---|---|
Tatoeba.en.ga | 22.8 | 0.404 |