Commit Graph

26 Commits

Author SHA1 Message Date
Joerg Tiedemann
4cc0ccb18d fixing many bugs with tatoeba model recipes 2022-02-07 20:55:31 +02:00
Joerg Tiedemann
176977df32 released langpairs in tatoeba 2022-02-05 13:40:55 +02:00
Joerg Tiedemann
ed1bde6ac5 fix in reverse-data 2022-01-06 23:48:34 +02:00
Joerg Tiedemann
08ed366914 allas storage commands 2021-11-04 09:57:48 +02:00
Joerg Tiedemann
71174062d6 fix vocab yaml script added 2021-11-02 18:38:28 +02:00
Joerg Tiedemann
f4ffae653b create valid yaml files from vocab 2021-10-05 17:43:46 +03:00
Joerg Tiedemann
6db5b3b716 fixed a problem with langlabel files 2021-09-13 00:07:51 +03:00
Joerg Tiedemann
72e1bcb7ec fixed multithreading issues with data recipe 2021-08-09 22:19:05 +03:00
Joerg Tiedemann
200662863e added recipes for tatoeba models other than English 2021-05-04 08:49:16 +03:00
Joerg Tiedemann
cde8f0d0af balance dev data in multiligual models and a bug fixed in preprocess script 2021-03-30 00:00:28 +03:00
Joerg Tiedemann
bb39c060c0 added recipe for refreshing release info 2021-03-13 00:29:23 +02:00
Joerg Tiedemann
6537fdea13 backtranslation for Tatoeba data 2021-02-25 17:17:21 +02:00
Joerg Tiedemann
53c5680268 fixed tatoeba group recipes 2021-02-16 20:36:00 +02:00
Joerg Tiedemann
3c6793045b moved results table generation for tatoeba models 2021-02-15 20:35:29 +02:00
Jörg Tiedemann
666b2b8462 internal sentence piece models in transformers 2020-09-12 16:16:01 +03:00
Jörg Tiedemann
ddafb43d66 removed dependence on moses tools in preprocessing script for released spm packages 2020-09-12 14:42:10 +03:00
Joerg Tiedemann
4e18da6e4c fix chinese/korean/japanese language codes 2020-06-17 22:02:39 +03:00
Joerg Tiedemann
e141772b34 fixed multilingual tatoeba evaluation 2020-06-11 00:54:40 +03:00
Joerg Tiedemann
e07eb14984 fit-data-size fixed 2020-06-08 14:14:55 +03:00
Joerg Tiedemann
6cb9959e82 tatoeba challenge model scripts updated 2020-06-06 20:49:54 +03:00
Joerg Tiedemann
edaf361803 multilingual tatoeba models and some documentation added 2020-06-03 15:39:18 +03:00
Joerg Tiedemann
eeaef7768c tatoeba models added 2020-06-03 00:16:21 +03:00
Joerg Tiedemann
b01b4f22c3 pivot-based translations added 2020-05-17 22:43:05 +03:00
Joerg Tiedemann
d49a791cc7 some new models 2020-04-11 14:50:39 +03:00
Joerg Tiedemann
3f57e4f873 lang specific cleanup scripts are now possible 2020-02-29 18:23:08 +02:00
Joerg Tiedemann
0ff0e625d5 train text simplification model 2020-02-29 17:59:27 +02:00