Commit Graph

46 Commits

Author SHA1 Message Date
Joerg Tiedemann
4cc0ccb18d fixing many bugs with tatoeba model recipes 2022-02-07 20:55:31 +02:00
Joerg Tiedemann
bc54b403cd student model quantisation finetuning added 2022-01-18 14:41:17 +02:00
Joerg Tiedemann
17ecdf2719 cleanup 2021-12-10 19:19:51 +02:00
Joerg Tiedemann
a941317cef some cleanup 2021-12-01 00:39:00 +02:00
Joerg Tiedemann
9e6a9f1f86 ome cleanup and adjustments for mahti 2021-11-12 11:17:40 +02:00
Joerg Tiedemann
0017e83e6b renamed variable for loading environment 2021-11-11 19:21:35 +02:00
Joerg Tiedemann
08ed366914 allas storage commands 2021-11-04 09:57:48 +02:00
Joerg Tiedemann
72e1bcb7ec fixed multithreading issues with data recipe 2021-08-09 22:19:05 +03:00
Joerg Tiedemann
3da042c3cf latest changes to makefiles 2021-08-09 14:06:10 +03:00
Joerg Tiedemann
bb39c060c0 added recipe for refreshing release info 2021-03-13 00:29:23 +02:00
Joerg Tiedemann
4305ad01b9 tatoeba model result lists added 2021-01-18 14:53:54 +02:00
Traubert
21433867ab If we include lib/config.mk before prerequisites are made, make fails
Therefore, the install target needs to come first (though this is not
necessarily sufficient).
2020-09-24 12:25:24 +03:00
Jörg Tiedemann
c2798e9758 plain text vocab files from spm models 2020-09-13 22:17:21 +03:00
Jörg Tiedemann
c0cb356417 added acknowledgements 2020-09-12 12:01:02 +03:00
Jörg Tiedemann
16eef8e45d moved project makefiles to lib/projects 2020-09-10 12:12:44 +03:00
Jörg Tiedemann
ad828c3124 started tutorial and fixes to backtranslate makefile 2020-09-05 00:16:22 +03:00
Tiedemann
1435b7849a moved allas recipes to a different makefile 2020-09-02 16:35:35 +03:00
Joerg Tiedemann
cde8e65a5b new buckets for fetch and store, uncompressed now and follow-links 2020-09-01 16:15:01 +03:00
Joerg Tiedemann
e31550a3ad enabled fetching OPUS data instead of reading local files if necessary 2020-08-28 10:53:11 +03:00
Joerg Tiedemann
308bf647f0 fetchdata src and dest dir 2020-08-25 22:07:32 +03:00
Joerg Tiedemann
1e23566f30 fixed fetch and store from and to allas 2020-08-23 10:08:06 +03:00
Joerg Tiedemann
0e27198048 store and fetch work data 2020-08-22 23:51:37 +03:00
Joerg Tiedemann
e141772b34 fixed multilingual tatoeba evaluation 2020-06-11 00:54:40 +03:00
Joerg Tiedemann
edaf361803 multilingual tatoeba models and some documentation added 2020-06-03 15:39:18 +03:00
Joerg Tiedemann
eeaef7768c tatoeba models added 2020-06-03 00:16:21 +03:00
Joerg Tiedemann
ec43fcd30a fixed a bug in eval-testsets 2020-05-29 14:43:36 +03:00
Joerg Tiedemann
d0a217cf40 wikimatrix models added 2020-05-21 20:51:38 +03:00
Joerg Tiedemann
716d7b52c1 fixed testset names and backtranslation sentence splitting 2020-05-20 23:19:48 +03:00
Joerg Tiedemann
b01b4f22c3 pivot-based translations added 2020-05-17 22:43:05 +03:00
Joerg Tiedemann
1246bcd271 added some size info to train data README 2020-05-17 01:21:57 +03:00
Joerg Tiedemann
198c779e91 make cascade job for train + backtranslate + retrain 2020-05-15 20:59:05 +03:00
Joerg Tiedemann
37a83a9eba information about license for pre-trained models added 2020-05-15 20:01:07 +03:00
Joerg Tiedemann
7ef908dcd7 translate with backtranslations 2020-05-13 00:41:07 +03:00
Joerg Tiedemann
e4455e510a a bit more info added for data sets 2020-05-09 22:33:33 +03:00
Joerg Tiedemann
5404f515aa new makefile structure 2020-05-03 21:46:30 +03:00
Joerg Tiedemann
6b8e69269a better division of the massive tasks makefile 2020-05-03 20:27:55 +03:00
Joerg Tiedemann
294175f0fe fixed sami models 2020-04-18 01:05:02 +03:00
Joerg Tiedemann
fd6db4e93a new marian and fixed path to mono lang check in backtranslation 2020-03-19 20:42:27 +02:00
Joerg Tiedemann
d13a9461f0 simplification evaluation with BLEU 2020-03-01 00:25:05 +02:00
Joerg Tiedemann
44182291dc skip word alignment if not necessary 2020-02-25 09:00:24 +02:00
Joerg Tiedemann
f32ddd06ce allwikis 2020-01-20 23:37:40 +02:00
Joerg Tiedemann
f97bc1895c fixed model names 2020-01-20 00:37:24 +02:00
Joerg Tiedemann
2887762198 bugfixing and optimising makefiles 2020-01-19 19:00:13 +02:00
Joerg Tiedemann
596cae8922 train with backtranslations 2020-01-18 20:37:01 +02:00
Joerg Tiedemann
58690950b0 all models = opus 2020-01-15 23:18:07 +02:00
Joerg Tiedemann
b36d9a3e22 initial import 2020-01-10 16:45:42 +02:00