mosesdecoder/scripts/analysis/README

Put any scripts useful for human analysis of MT output here.

sentence-by-sentence.pl [EVH]: show comparison of sentences in reference translation(s)/system output(s)/(truth) in colorful format
-- show all sentences given, with non-matching words in the system output marked, BLEU scores given by sentence, and matching n-grams shown in a table
-- requires all input files be utf8-encoded (you can convert a file with `cat FILE | perl -n -e 'binmode(STDOUT, ":utf8"); print;' > FILE.utf8`)

show-phrases-used.pl [EVH]: draw colorful diagram of which source phrases map to which target phrases
-- requires the Perl GD module, which in turn requires that gd be installed and in LD_LIBRARY_PATH
-- show average length of source phrases used for each sentence and overall
-- command-line options -r for reference and -s for source; lone filenames are taken to be system outputs
added a placeholder git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@408 1f5c12ca-751b-0410-a591-d2e778427230 2006-07-31 20:39:33 +04:00			`Put any scripts useful for human analysis of MT output here.`
added my script to the docs git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@724 1f5c12ca-751b-0410-a591-d2e778427230 2006-08-14 20:13:29 +04:00
modified sentence-by-sentence to handle multiple outputs; edited cache handling in newsmtgui (should increase speed and decrease errors) git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@767 1f5c12ca-751b-0410-a591-d2e778427230 2006-08-16 18:49:10 +04:00			`sentence-by-sentence.pl [EVH]: show comparison of sentences in reference translation(s)/system output(s)/(truth) in colorful format`
adding show-phrases-used git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@768 1f5c12ca-751b-0410-a591-d2e778427230 2006-08-16 18:51:04 +04:00			`-- show all sentences given, with non-matching words in the system output marked, BLEU scores given by sentence, and matching n-grams shown in a table`
added my script to the docs git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@724 1f5c12ca-751b-0410-a591-d2e778427230 2006-08-14 20:13:29 +04:00			-- requires all input files be utf8-encoded (you can convert a file with `cat FILE \| perl -n -e 'binmode(STDOUT, ":utf8"); print;' > FILE.utf8`)
adding show-phrases-used git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@768 1f5c12ca-751b-0410-a591-d2e778427230 2006-08-16 18:51:04 +04:00
			`show-phrases-used.pl [EVH]: draw colorful diagram of which source phrases map to which target phrases`
updating docs git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@771 1f5c12ca-751b-0410-a591-d2e778427230 2006-08-16 20:37:11 +04:00			`-- requires the Perl GD module, which in turn requires that gd be installed and in LD_LIBRARY_PATH`
adding show-phrases-used git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@768 1f5c12ca-751b-0410-a591-d2e778427230 2006-08-16 18:51:04 +04:00			`-- show average length of source phrases used for each sentence and overall`
			`-- command-line options -r for reference and -s for source; lone filenames are taken to be system outputs`