lang specific cleanup scripts are now possible

This commit is contained in:
Joerg Tiedemann 2020-02-29 18:23:08 +02:00
parent f5111a27a7
commit 3f57e4f873

View File

@ -0,0 +1,6 @@
# Language-specific cleanup scripts
Cleanup scripts can be collected in language-specific sub-directories.
Each script should read lines from STDIN and print to STDOUT and should produce the same number of lines in the output as there are in the input (to make sure that sentence alignment doesn't break). Parameters are not supported by the data processing pipeline implemented in Makefile.data that uses those scripts. By default the makefile will use all exectuable files in the language sub-directory.