.. |
cleanup
|
lang specific cleanup scripts are now possible
|
2020-02-29 18:23:08 +02:00 |
evaluate
|
fixed a problem with langlabel files
|
2021-09-13 00:07:51 +03:00 |
filter
|
fixed multithreading issues with data recipe
|
2021-08-09 22:19:05 +03:00 |
bitext_filter.pl
|
fixing many bugs with tatoeba model recipes
|
2022-02-07 20:55:31 +02:00 |
data-sample-sizes.pl
|
more sp models
|
2022-06-02 00:30:13 +03:00 |
detect_chinese_script.pl
|
fix chinese/korean/japanese language codes
|
2020-06-17 22:02:39 +03:00 |
fit-data-size.pl
|
released langpairs in tatoeba
|
2022-02-05 13:40:55 +02:00 |
fix_vocab.py
|
allas storage commands
|
2021-11-04 09:57:48 +02:00 |
large-context.pl
|
fixed multilingual tatoeba evaluation
|
2020-06-11 00:54:40 +03:00 |
normalize-scores.py
|
elg updates
|
2022-03-20 21:15:49 +02:00 |
pivot-bt.pl
|
added recipe for refreshing release info
|
2021-03-13 00:29:23 +02:00 |
postprocess-bpe.sh
|
removed dependence on moses tools in preprocessing script for released spm packages
|
2020-09-12 14:42:10 +03:00 |
postprocess-spm.sh
|
24x12 transformer model added
|
2023-03-20 23:55:58 +02:00 |
postprocess-txt.sh
|
internal sentence piece models in transformers
|
2020-09-12 16:16:01 +03:00 |
preprocess-bpe-multi-target.sh
|
24x12 transformer model added
|
2023-04-17 23:45:34 +03:00 |
preprocess-bpe.sh
|
24x12 transformer model added
|
2023-04-17 23:45:34 +03:00 |
preprocess-spm-multi-target.sh
|
added recipes for tatoeba models other than English
|
2021-05-04 08:49:16 +03:00 |
preprocess-spm.sh
|
added recipes for tatoeba models other than English
|
2021-05-04 08:49:16 +03:00 |
preprocess-txt-multi-target.sh
|
24x12 transformer model added
|
2023-04-17 23:45:34 +03:00 |
preprocess-txt.sh
|
24x12 transformer model added
|
2023-04-17 23:45:34 +03:00 |
readme2yaml.pl
|
fixed tatoeba group recipes
|
2021-02-16 20:36:00 +02:00 |
verify-wordalign.pl
|
fixed multilingual tatoeba evaluation
|
2020-06-11 00:54:40 +03:00 |
vocab2yaml.py
|
create valid yaml files from vocab
|
2021-10-05 17:43:46 +03:00 |