mirror of
https://github.com/Helsinki-NLP/OPUS-MT-train.git
synced 2024-09-11 20:27:19 +03:00
.. | ||
README.md |
opus-2019-12-04.zip
- dataset: opus
- model: transformer-align
- pre-processing: normalization + SentencePiece
- download: opus-2019-12-04.zip
- test set translations: opus-2019-12-04.test.txt
- test set scores: opus-2019-12-04.eval.txt
Benchmarks
testset | BLEU | chr-F |
---|---|---|
Tatoeba.de.fi | 40.1 | 0.624 |
goethe-2019-11-15.zip
- dataset: opus+goethe
- model: transformer
- pre-processing: normalization + tokenization + BPE
- download: goethe-2019-11-15.zip
- info: trained on OPUS and fine-tuned for 6 epochs on data from the Goethe Institute
Benchmarks
testset | BLEU | chr-F |
---|---|---|
goethe.de.fi | 39.26 |
goethe-2020-01-07.zip
- dataset: opus+goethe
- model: transformer
- pre-processing: normalization + tokenization + BPE
- download: goethe-2019-11-15.zip
- info: trained on OPUS and fine-tuned for 3 epochs on data from the Goethe Institute without duplicates
Benchmarks
testset | BLEU | chr-F |
---|---|---|
goethe.de.fi | 38.57 |
opus-2020-01-08.zip
- dataset: opus
- model: transformer-align
- pre-processing: normalization + SentencePiece
- download: opus-2020-01-08.zip
- test set translations: opus-2020-01-08.test.txt
- test set scores: opus-2020-01-08.eval.txt
Benchmarks
testset | BLEU | chr-F |
---|---|---|
Tatoeba.de.fi | 40.0 | 0.628 |