# opus-2019-12-18.zip * dataset: opus * model: transformer-align * pre-processing: normalization + SentencePiece * download: [opus-2019-12-18.zip](https://object.pouta.csc.fi/OPUS-MT-models/hy-en/opus-2019-12-18.zip) * test set translations: [opus-2019-12-18.test.txt](https://object.pouta.csc.fi/OPUS-MT-models/hy-en/opus-2019-12-18.test.txt) * test set scores: [opus-2019-12-18.eval.txt](https://object.pouta.csc.fi/OPUS-MT-models/hy-en/opus-2019-12-18.eval.txt) ## Benchmarks | testset | BLEU | chr-F | |-----------------------|-------|-------| | Tatoeba.hy.en | 29.5 | 0.466 | # opus+bt-2020-05-23.zip * dataset: opus+bt * model: transformer-align * source language(s): hy * target language(s): en * model: transformer-align * pre-processing: normalization + SentencePiece (spm4k,spm4k) * download: [opus+bt-2020-05-23.zip](https://object.pouta.csc.fi/OPUS-MT-models/hy-en/opus+bt-2020-05-23.zip) * test set translations: [opus+bt-2020-05-23.test.txt](https://object.pouta.csc.fi/OPUS-MT-models/hy-en/opus+bt-2020-05-23.test.txt) * test set scores: [opus+bt-2020-05-23.eval.txt](https://object.pouta.csc.fi/OPUS-MT-models/hy-en/opus+bt-2020-05-23.eval.txt) ## Training data: opus+bt * hy-en: bible-uedin (12963) GNOME (128) JW300 (388439) KDE4 (270) OpenSubtitles (3204) QED (36597) Ubuntu (530) * hy-en: total size = 442131 * total size (opus+bt): 441909 ## Validation data * en-hy: Tatoeba * devset = top 1000 lines of Tatoeba.src.shuffled! * testset = next 1026 lines of Tatoeba.src.shuffled! * remaining lines are added to traindata ## Benchmarks | testset | BLEU | chr-F | |-----------------------|-------|-------| | Tatoeba.hy.en | 36.8 | 0.524 |