Commit Graph

28 Commits

Author SHA1 Message Date
Traubert
1da54c4155 Fix typo 2020-09-15 15:02:20 +03:00
Jörg Tiedemann
1a6e29275d dev data is now uniq to avoid overlaps with test data 2020-09-09 23:21:07 +03:00
Jörg Tiedemann
a47c292152 pivoting and documentation 2020-09-05 22:19:00 +03:00
Jörg Tiedemann
ad828c3124 started tutorial and fixes to backtranslate makefile 2020-09-05 00:16:22 +03:00
Tiedemann
96eaad2d05 added possibility to fetch moses file from ObjectStore (instead of reading with opus_read) 2020-09-03 22:04:44 +03:00
Tiedemann
2332732577 make compatible with mac osx and include submodules for required tools 2020-09-02 15:52:34 +03:00
Joerg Tiedemann
2c04e48dbe fixed an important bug in data merging 2020-08-28 11:52:46 +03:00
Joerg Tiedemann
e31550a3ad enabled fetching OPUS data instead of reading local files if necessary 2020-08-28 10:53:11 +03:00
Joerg Tiedemann
94eeec13eb take away dependence on local OPUS files for finding data 2020-08-27 22:36:50 +03:00
Joerg Tiedemann
4c35456038 cleanup in data makefile 2020-08-26 00:44:02 +03:00
Joerg Tiedemann
1b913277b3 tatoeba language group models with various sample sizews 2020-07-25 22:52:33 +03:00
Joerg Tiedemann
e141772b34 fixed multilingual tatoeba evaluation 2020-06-11 00:54:40 +03:00
Joerg Tiedemann
b7691875c2 tatoeba models now operational 2020-06-09 00:12:16 +03:00
Joerg Tiedemann
035cca7c1a fixed tatoeba model scripts 2020-06-08 17:24:39 +03:00
Joerg Tiedemann
e07eb14984 fit-data-size fixed 2020-06-08 14:14:55 +03:00
Joerg Tiedemann
6cb9959e82 tatoeba challenge model scripts updated 2020-06-06 20:49:54 +03:00
Joerg Tiedemann
edaf361803 multilingual tatoeba models and some documentation added 2020-06-03 15:39:18 +03:00
Joerg Tiedemann
eeaef7768c tatoeba models added 2020-06-03 00:16:21 +03:00
Joerg Tiedemann
ec43fcd30a fixed a bug in eval-testsets 2020-05-29 14:43:36 +03:00
Joerg Tiedemann
716d7b52c1 fixed testset names and backtranslation sentence splitting 2020-05-20 23:19:48 +03:00
Joerg Tiedemann
04d72ff8ed fixes with pivoting 2020-05-18 21:36:53 +03:00
Joerg Tiedemann
b01b4f22c3 pivot-based translations added 2020-05-17 22:43:05 +03:00
Joerg Tiedemann
1246bcd271 added some size info to train data README 2020-05-17 01:21:57 +03:00
Joerg Tiedemann
cb3b77573e make it possible to exclude certain data sets 2020-05-14 10:36:46 +03:00
Joerg Tiedemann
7ef908dcd7 translate with backtranslations 2020-05-13 00:41:07 +03:00
Joerg Tiedemann
e4455e510a a bit more info added for data sets 2020-05-09 22:33:33 +03:00
Joerg Tiedemann
d4b71e0261 fixed includes in backtranslate/evaluate/finetune makefiles 2020-05-07 22:51:31 +03:00
Joerg Tiedemann
6b8e69269a better division of the massive tasks makefile 2020-05-03 20:27:55 +03:00