OPUS-MT-train/scripts
2023-04-17 23:45:34 +03:00
..
cleanup lang specific cleanup scripts are now possible 2020-02-29 18:23:08 +02:00
evaluate fixed a problem with langlabel files 2021-09-13 00:07:51 +03:00
filter fixed multithreading issues with data recipe 2021-08-09 22:19:05 +03:00
bitext_filter.pl fixing many bugs with tatoeba model recipes 2022-02-07 20:55:31 +02:00
data-sample-sizes.pl more sp models 2022-06-02 00:30:13 +03:00
detect_chinese_script.pl fix chinese/korean/japanese language codes 2020-06-17 22:02:39 +03:00
fit-data-size.pl released langpairs in tatoeba 2022-02-05 13:40:55 +02:00
fix_vocab.py allas storage commands 2021-11-04 09:57:48 +02:00
large-context.pl fixed multilingual tatoeba evaluation 2020-06-11 00:54:40 +03:00
normalize-scores.py elg updates 2022-03-20 21:15:49 +02:00
pivot-bt.pl added recipe for refreshing release info 2021-03-13 00:29:23 +02:00
postprocess-bpe.sh removed dependence on moses tools in preprocessing script for released spm packages 2020-09-12 14:42:10 +03:00
postprocess-spm.sh 24x12 transformer model added 2023-03-20 23:55:58 +02:00
postprocess-txt.sh internal sentence piece models in transformers 2020-09-12 16:16:01 +03:00
preprocess-bpe-multi-target.sh 24x12 transformer model added 2023-04-17 23:45:34 +03:00
preprocess-bpe.sh 24x12 transformer model added 2023-04-17 23:45:34 +03:00
preprocess-spm-multi-target.sh added recipes for tatoeba models other than English 2021-05-04 08:49:16 +03:00
preprocess-spm.sh added recipes for tatoeba models other than English 2021-05-04 08:49:16 +03:00
preprocess-txt-multi-target.sh 24x12 transformer model added 2023-04-17 23:45:34 +03:00
preprocess-txt.sh 24x12 transformer model added 2023-04-17 23:45:34 +03:00
readme2yaml.pl fixed tatoeba group recipes 2021-02-16 20:36:00 +02:00
verify-wordalign.pl fixed multilingual tatoeba evaluation 2020-06-11 00:54:40 +03:00
vocab2yaml.py create valid yaml files from vocab 2021-10-05 17:43:46 +03:00