Commit Graph

28 Commits

Author SHA1 Message Date
Jörg Tiedemann
a61cf48443 add option to skip sentence piecce vocabs but use marian_vocab instead 2020-09-16 19:33:19 +03:00
Jörg Tiedemann
666b2b8462 internal sentence piece models in transformers 2020-09-12 16:16:01 +03:00
Jörg Tiedemann
c0cb356417 added acknowledgements 2020-09-12 12:01:02 +03:00
Jörg Tiedemann
16eef8e45d moved project makefiles to lib/projects 2020-09-10 12:12:44 +03:00
Jörg Tiedemann
1a6e29275d dev data is now uniq to avoid overlaps with test data 2020-09-09 23:21:07 +03:00
Jörg Tiedemann
3735af4ec1 more documentation 2020-09-07 23:00:01 +03:00
Jörg Tiedemann
3367ad2e34 documentation of low-resource languages 2020-09-06 23:56:16 +03:00
Jörg Tiedemann
a47c292152 pivoting and documentation 2020-09-05 22:19:00 +03:00
Jörg Tiedemann
ad828c3124 started tutorial and fixes to backtranslate makefile 2020-09-05 00:16:22 +03:00
Tiedemann
96eaad2d05 added possibility to fetch moses file from ObjectStore (instead of reading with opus_read) 2020-09-03 22:04:44 +03:00
Tiedemann
7e97f4bc19 setup and installation information added 2020-09-02 16:49:22 +03:00
Tiedemann
1435b7849a moved allas recipes to a different makefile 2020-09-02 16:35:35 +03:00
Joerg Tiedemann
cde8e65a5b new buckets for fetch and store, uncompressed now and follow-links 2020-09-01 16:15:01 +03:00
Joerg Tiedemann
1a279ce6f1 started documentation of project specific models 2020-08-28 15:53:23 +03:00
Joerg Tiedemann
639bd2adda started documentation of project specific models 2020-08-28 15:51:37 +03:00
Joerg Tiedemann
e31550a3ad enabled fetching OPUS data instead of reading local files if necessary 2020-08-28 10:53:11 +03:00
Joerg Tiedemann
94eeec13eb take away dependence on local OPUS files for finding data 2020-08-27 22:36:50 +03:00
Joerg Tiedemann
831ee89f76 fixed bug in env.mk 2020-08-26 22:18:12 +03:00
Joerg Tiedemann
596dd993a5 more documentation 2020-08-26 21:45:03 +03:00
Joerg Tiedemann
a8b54f5311 some info about training added 2020-08-26 15:12:38 +03:00
Joerg Tiedemann
2f8a37cc92 more details about data compilation added 2020-08-26 14:31:50 +03:00
Joerg Tiedemann
dac6070069 started some more documentation 2020-08-26 09:59:24 +03:00
Joerg Tiedemann
5493aeddb4 fixed a problem with lang group targets 2020-07-14 21:40:49 +03:00
Joerg Tiedemann
ec6d7c7142 tatoeba langgroups 2020-07-04 23:37:39 +03:00
Joerg Tiedemann
7df91a9eaa language group jobs with some more documentation 2020-06-29 12:26:45 +03:00
Joerg Tiedemann
4e18da6e4c fix chinese/korean/japanese language codes 2020-06-17 22:02:39 +03:00
Joerg Tiedemann
6cb9959e82 tatoeba challenge model scripts updated 2020-06-06 20:49:54 +03:00
Joerg Tiedemann
edaf361803 multilingual tatoeba models and some documentation added 2020-06-03 15:39:18 +03:00