Joerg Tiedemann
|
54526e1339
|
more sp models
|
2022-06-02 00:30:13 +03:00 |
|
Joerg Tiedemann
|
d4aca060c7
|
latest spm models online
|
2022-05-28 00:17:52 +03:00 |
|
Joerg Tiedemann
|
1f2f928a67
|
elg updates
|
2022-03-20 21:15:49 +02:00 |
|
Joerg Tiedemann
|
4cc0ccb18d
|
fixing many bugs with tatoeba model recipes
|
2022-02-07 20:55:31 +02:00 |
|
Joerg Tiedemann
|
176977df32
|
released langpairs in tatoeba
|
2022-02-05 13:40:55 +02:00 |
|
Joerg Tiedemann
|
ed1bde6ac5
|
fix in reverse-data
|
2022-01-06 23:48:34 +02:00 |
|
Joerg Tiedemann
|
08ed366914
|
allas storage commands
|
2021-11-04 09:57:48 +02:00 |
|
Joerg Tiedemann
|
71174062d6
|
fix vocab yaml script added
|
2021-11-02 18:38:28 +02:00 |
|
Joerg Tiedemann
|
f4ffae653b
|
create valid yaml files from vocab
|
2021-10-05 17:43:46 +03:00 |
|
Joerg Tiedemann
|
6db5b3b716
|
fixed a problem with langlabel files
|
2021-09-13 00:07:51 +03:00 |
|
Joerg Tiedemann
|
72e1bcb7ec
|
fixed multithreading issues with data recipe
|
2021-08-09 22:19:05 +03:00 |
|
Joerg Tiedemann
|
200662863e
|
added recipes for tatoeba models other than English
|
2021-05-04 08:49:16 +03:00 |
|
Joerg Tiedemann
|
cde8f0d0af
|
balance dev data in multiligual models and a bug fixed in preprocess script
|
2021-03-30 00:00:28 +03:00 |
|
Joerg Tiedemann
|
bb39c060c0
|
added recipe for refreshing release info
|
2021-03-13 00:29:23 +02:00 |
|
Joerg Tiedemann
|
6537fdea13
|
backtranslation for Tatoeba data
|
2021-02-25 17:17:21 +02:00 |
|
Joerg Tiedemann
|
53c5680268
|
fixed tatoeba group recipes
|
2021-02-16 20:36:00 +02:00 |
|
Joerg Tiedemann
|
3c6793045b
|
moved results table generation for tatoeba models
|
2021-02-15 20:35:29 +02:00 |
|
Jörg Tiedemann
|
666b2b8462
|
internal sentence piece models in transformers
|
2020-09-12 16:16:01 +03:00 |
|
Jörg Tiedemann
|
ddafb43d66
|
removed dependence on moses tools in preprocessing script for released spm packages
|
2020-09-12 14:42:10 +03:00 |
|
Joerg Tiedemann
|
4e18da6e4c
|
fix chinese/korean/japanese language codes
|
2020-06-17 22:02:39 +03:00 |
|
Joerg Tiedemann
|
e141772b34
|
fixed multilingual tatoeba evaluation
|
2020-06-11 00:54:40 +03:00 |
|
Joerg Tiedemann
|
e07eb14984
|
fit-data-size fixed
|
2020-06-08 14:14:55 +03:00 |
|
Joerg Tiedemann
|
6cb9959e82
|
tatoeba challenge model scripts updated
|
2020-06-06 20:49:54 +03:00 |
|
Joerg Tiedemann
|
edaf361803
|
multilingual tatoeba models and some documentation added
|
2020-06-03 15:39:18 +03:00 |
|
Joerg Tiedemann
|
eeaef7768c
|
tatoeba models added
|
2020-06-03 00:16:21 +03:00 |
|
Joerg Tiedemann
|
b01b4f22c3
|
pivot-based translations added
|
2020-05-17 22:43:05 +03:00 |
|
Joerg Tiedemann
|
d49a791cc7
|
some new models
|
2020-04-11 14:50:39 +03:00 |
|
Joerg Tiedemann
|
3f57e4f873
|
lang specific cleanup scripts are now possible
|
2020-02-29 18:23:08 +02:00 |
|
Joerg Tiedemann
|
0ff0e625d5
|
train text simplification model
|
2020-02-29 17:59:27 +02:00 |
|