mosesdecoder/scripts/analysis
2010-01-26 17:20:28 +00:00
..
perllib minor, and moved stuff around 2006-08-08 23:38:45 +00:00
smtgui add svn id comments to start of file 2007-03-14 22:22:36 +00:00
bootstrap-hypothesis-difference-significance.pl added unpaired significance evaluation 2009-05-12 18:56:01 +00:00
nontranslated_words.pl add svn id comments to start of file 2007-03-14 22:22:36 +00:00
README updating docs 2006-08-16 16:37:11 +00:00
sentence-by-sentence.pl add svn id comments to start of file 2007-03-14 22:22:36 +00:00
sg2dot.pl don't organize to stacks by default, accept --organize-to-stacks 2010-01-26 17:20:28 +00:00
show-phrases-used.pl add svn id comments to start of file 2007-03-14 22:22:36 +00:00

Put any scripts useful for human analysis of MT output here.

sentence-by-sentence.pl [EVH]: show comparison of sentences in reference translation(s)/system output(s)/(truth) in colorful format
-- show all sentences given, with non-matching words in the system output marked, BLEU scores given by sentence, and matching n-grams shown in a table
-- requires all input files be utf8-encoded (you can convert a file with `cat FILE | perl -n -e 'binmode(STDOUT, ":utf8"); print;' > FILE.utf8`)

show-phrases-used.pl [EVH]: draw colorful diagram of which source phrases map to which target phrases
-- requires the Perl GD module, which in turn requires that gd be installed and in LD_LIBRARY_PATH
-- show average length of source phrases used for each sentence and overall
-- command-line options -r for reference and -s for source; lone filenames are taken to be system outputs