Joerg Tiedemann
|
dee0f6b951
|
elg project stuff and changes done on mahti
|
2022-03-17 21:02:11 +02:00 |
|
Joerg Tiedemann
|
8f92bc84c7
|
Merge branch 'master' of github.com:Helsinki-NLP/OPUS-MT-train
|
2021-12-10 19:21:26 +02:00 |
|
Joerg Tiedemann
|
17ecdf2719
|
cleanup
|
2021-12-10 19:19:51 +02:00 |
|
Joerg Tiedemann
|
e2c2d3808e
|
fixed renaming of LOADMODS to LOAD_ENV
|
2021-12-10 09:36:28 +02:00 |
|
Joerg Tiedemann
|
6537fdea13
|
backtranslation for Tatoeba data
|
2021-02-25 17:17:21 +02:00 |
|
Joerg Tiedemann
|
f9a44bdb99
|
merged
|
2020-09-18 23:08:56 +03:00 |
|
Joerg Tiedemann
|
87a5354de5
|
changes to tatoeba recipes
|
2020-09-18 23:05:46 +03:00 |
|
Jörg Tiedemann
|
3367ad2e34
|
documentation of low-resource languages
|
2020-09-06 23:56:16 +03:00 |
|
Jörg Tiedemann
|
ad828c3124
|
started tutorial and fixes to backtranslate makefile
|
2020-09-05 00:16:22 +03:00 |
|
Tiedemann Jörg
|
d11f74ce41
|
added bpe submodule
|
2020-09-04 15:34:20 +03:00 |
|
Tiedemann
|
2332732577
|
make compatible with mac osx and include submodules for required tools
|
2020-09-02 15:52:34 +03:00 |
|
Joerg Tiedemann
|
639bd2adda
|
started documentation of project specific models
|
2020-08-28 15:51:37 +03:00 |
|
Joerg Tiedemann
|
e31550a3ad
|
enabled fetching OPUS data instead of reading local files if necessary
|
2020-08-28 10:53:11 +03:00 |
|
Joerg Tiedemann
|
d7252e32b7
|
tatoeba monolingual data
|
2020-08-05 00:00:24 +03:00 |
|
Joerg Tiedemann
|
c9fcb7f35d
|
tatoeba langgroup models
|
2020-08-02 11:38:42 +03:00 |
|
Joerg Tiedemann
|
1b913277b3
|
tatoeba language group models with various sample sizews
|
2020-07-25 22:52:33 +03:00 |
|
Joerg Tiedemann
|
ec43fcd30a
|
fixed a bug in eval-testsets
|
2020-05-29 14:43:36 +03:00 |
|
Joerg Tiedemann
|
716d7b52c1
|
fixed testset names and backtranslation sentence splitting
|
2020-05-20 23:19:48 +03:00 |
|
Joerg Tiedemann
|
04d72ff8ed
|
fixes with pivoting
|
2020-05-18 21:36:53 +03:00 |
|
Joerg Tiedemann
|
b01b4f22c3
|
pivot-based translations added
|
2020-05-17 22:43:05 +03:00 |
|
Joerg Tiedemann
|
1246bcd271
|
added some size info to train data README
|
2020-05-17 01:21:57 +03:00 |
|
Joerg Tiedemann
|
37a83a9eba
|
information about license for pre-trained models added
|
2020-05-15 20:01:07 +03:00 |
|
Joerg Tiedemann
|
7ef908dcd7
|
translate with backtranslations
|
2020-05-13 00:41:07 +03:00 |
|
Joerg Tiedemann
|
d4b71e0261
|
fixed includes in backtranslate/evaluate/finetune makefiles
|
2020-05-07 22:51:31 +03:00 |
|
Joerg Tiedemann
|
3f292fd7b8
|
all models
|
2020-04-27 13:56:40 +03:00 |
|
Joerg Tiedemann
|
9ba784419e
|
updates celtic model
|
2020-04-24 13:30:16 +03:00 |
|
Joerg Tiedemann
|
ea2b283ad4
|
new sami model
|
2020-04-19 19:48:01 +03:00 |
|
Joerg Tiedemann
|
58f042d127
|
add local config parameters
|
2020-04-18 21:40:52 +03:00 |
|
Joerg Tiedemann
|
d49a791cc7
|
some new models
|
2020-04-11 14:50:39 +03:00 |
|
Joerg Tiedemann
|
f508bb4df6
|
use only latest backtranslation
|
2020-04-01 20:18:06 +03:00 |
|
Joerg Tiedemann
|
24fd67cc99
|
sami model update
|
2020-03-29 11:21:39 +03:00 |
|
Joerg Tiedemann
|
08c17af2ee
|
sami
|
2020-03-27 22:30:51 +02:00 |
|
Joerg Tiedemann
|
f4fdb304a5
|
sami language task added
|
2020-03-26 22:50:21 +02:00 |
|
Joerg Tiedemann
|
93f03a1fe7
|
backtranslation data for multilingual models
|
2020-03-24 23:47:57 +02:00 |
|
Joerg Tiedemann
|
87551ac387
|
target for extracting text from all wikis
|
2020-03-20 15:32:29 +02:00 |
|
Joerg Tiedemann
|
fd6db4e93a
|
new marian and fixed path to mono lang check in backtranslation
|
2020-03-19 20:42:27 +02:00 |
|
Joerg Tiedemann
|
0e893a06e0
|
finetuning for fi-en
|
2020-02-14 00:12:55 +02:00 |
|
Joerg Tiedemann
|
870804f4ee
|
finetuning anc backtranslations
|
2020-02-11 23:20:11 +02:00 |
|
Joerg Tiedemann
|
ee8c27e3db
|
removed punctuation normalisation and added language filter
|
2020-02-08 00:19:21 +02:00 |
|
Joerg Tiedemann
|
106b06aa4c
|
avoid uploading linked dist files
|
2020-01-29 21:46:18 +02:00 |
|
Joerg Tiedemann
|
bb5532ab71
|
new models
|
2020-01-24 13:39:21 +02:00 |
|
Joerg Tiedemann
|
bb98f03df5
|
backtranslate bugfix
|
2020-01-22 13:33:28 +02:00 |
|
Joerg Tiedemann
|
f32ddd06ce
|
allwikis
|
2020-01-20 23:37:40 +02:00 |
|
Joerg Tiedemann
|
2887762198
|
bugfixing and optimising makefiles
|
2020-01-19 19:00:13 +02:00 |
|
Joerg Tiedemann
|
e2ed3d85d1
|
finetuning and backtranslation
|
2020-01-12 01:10:53 +02:00 |
|
Joerg Tiedemann
|
fe16a0c4dd
|
backtranslation scripts
|
2020-01-11 00:29:06 +02:00 |
|