Commit Graph

46 Commits

Author SHA1 Message Date
Joerg Tiedemann
dee0f6b951 elg project stuff and changes done on mahti 2022-03-17 21:02:11 +02:00
Joerg Tiedemann
8f92bc84c7 Merge branch 'master' of github.com:Helsinki-NLP/OPUS-MT-train 2021-12-10 19:21:26 +02:00
Joerg Tiedemann
17ecdf2719 cleanup 2021-12-10 19:19:51 +02:00
Joerg Tiedemann
e2c2d3808e fixed renaming of LOADMODS to LOAD_ENV 2021-12-10 09:36:28 +02:00
Joerg Tiedemann
6537fdea13 backtranslation for Tatoeba data 2021-02-25 17:17:21 +02:00
Joerg Tiedemann
f9a44bdb99 merged 2020-09-18 23:08:56 +03:00
Joerg Tiedemann
87a5354de5 changes to tatoeba recipes 2020-09-18 23:05:46 +03:00
Jörg Tiedemann
3367ad2e34 documentation of low-resource languages 2020-09-06 23:56:16 +03:00
Jörg Tiedemann
ad828c3124 started tutorial and fixes to backtranslate makefile 2020-09-05 00:16:22 +03:00
Tiedemann Jörg
d11f74ce41 added bpe submodule 2020-09-04 15:34:20 +03:00
Tiedemann
2332732577 make compatible with mac osx and include submodules for required tools 2020-09-02 15:52:34 +03:00
Joerg Tiedemann
639bd2adda started documentation of project specific models 2020-08-28 15:51:37 +03:00
Joerg Tiedemann
e31550a3ad enabled fetching OPUS data instead of reading local files if necessary 2020-08-28 10:53:11 +03:00
Joerg Tiedemann
d7252e32b7 tatoeba monolingual data 2020-08-05 00:00:24 +03:00
Joerg Tiedemann
c9fcb7f35d tatoeba langgroup models 2020-08-02 11:38:42 +03:00
Joerg Tiedemann
1b913277b3 tatoeba language group models with various sample sizews 2020-07-25 22:52:33 +03:00
Joerg Tiedemann
ec43fcd30a fixed a bug in eval-testsets 2020-05-29 14:43:36 +03:00
Joerg Tiedemann
716d7b52c1 fixed testset names and backtranslation sentence splitting 2020-05-20 23:19:48 +03:00
Joerg Tiedemann
04d72ff8ed fixes with pivoting 2020-05-18 21:36:53 +03:00
Joerg Tiedemann
b01b4f22c3 pivot-based translations added 2020-05-17 22:43:05 +03:00
Joerg Tiedemann
1246bcd271 added some size info to train data README 2020-05-17 01:21:57 +03:00
Joerg Tiedemann
37a83a9eba information about license for pre-trained models added 2020-05-15 20:01:07 +03:00
Joerg Tiedemann
7ef908dcd7 translate with backtranslations 2020-05-13 00:41:07 +03:00
Joerg Tiedemann
d4b71e0261 fixed includes in backtranslate/evaluate/finetune makefiles 2020-05-07 22:51:31 +03:00
Joerg Tiedemann
3f292fd7b8 all models 2020-04-27 13:56:40 +03:00
Joerg Tiedemann
9ba784419e updates celtic model 2020-04-24 13:30:16 +03:00
Joerg Tiedemann
ea2b283ad4 new sami model 2020-04-19 19:48:01 +03:00
Joerg Tiedemann
58f042d127 add local config parameters 2020-04-18 21:40:52 +03:00
Joerg Tiedemann
d49a791cc7 some new models 2020-04-11 14:50:39 +03:00
Joerg Tiedemann
f508bb4df6 use only latest backtranslation 2020-04-01 20:18:06 +03:00
Joerg Tiedemann
24fd67cc99 sami model update 2020-03-29 11:21:39 +03:00
Joerg Tiedemann
08c17af2ee sami 2020-03-27 22:30:51 +02:00
Joerg Tiedemann
f4fdb304a5 sami language task added 2020-03-26 22:50:21 +02:00
Joerg Tiedemann
93f03a1fe7 backtranslation data for multilingual models 2020-03-24 23:47:57 +02:00
Joerg Tiedemann
87551ac387 target for extracting text from all wikis 2020-03-20 15:32:29 +02:00
Joerg Tiedemann
fd6db4e93a new marian and fixed path to mono lang check in backtranslation 2020-03-19 20:42:27 +02:00
Joerg Tiedemann
0e893a06e0 finetuning for fi-en 2020-02-14 00:12:55 +02:00
Joerg Tiedemann
870804f4ee finetuning anc backtranslations 2020-02-11 23:20:11 +02:00
Joerg Tiedemann
ee8c27e3db removed punctuation normalisation and added language filter 2020-02-08 00:19:21 +02:00
Joerg Tiedemann
106b06aa4c avoid uploading linked dist files 2020-01-29 21:46:18 +02:00
Joerg Tiedemann
bb5532ab71 new models 2020-01-24 13:39:21 +02:00
Joerg Tiedemann
bb98f03df5 backtranslate bugfix 2020-01-22 13:33:28 +02:00
Joerg Tiedemann
f32ddd06ce allwikis 2020-01-20 23:37:40 +02:00
Joerg Tiedemann
2887762198 bugfixing and optimising makefiles 2020-01-19 19:00:13 +02:00
Joerg Tiedemann
e2ed3d85d1 finetuning and backtranslation 2020-01-12 01:10:53 +02:00
Joerg Tiedemann
fe16a0c4dd backtranslation scripts 2020-01-11 00:29:06 +02:00