mirror of
https://github.com/moses-smt/mosesdecoder.git
synced 2024-12-27 05:55:02 +03:00
2b4e61d826
Compiles, not tested. Conflicts: Jamroot OnDiskPt/PhraseNode.h OnDiskPt/TargetPhrase.cpp OnDiskPt/TargetPhrase.h OnDiskPt/TargetPhraseCollection.cpp mert/BleuScorer.cpp mert/Data.cpp mert/FeatureData.cpp moses-chart-cmd/src/Main.cpp moses/src/AlignmentInfo.h moses/src/ChartManager.cpp moses/src/LM/Ken.cpp moses/src/LM/Ken.h moses/src/LMList.h moses/src/LexicalReordering.h moses/src/PhraseDictionaryTree.h moses/src/ScoreIndexManager.h moses/src/StaticData.h moses/src/TargetPhrase.h moses/src/Word.cpp scripts/ems/experiment.meta scripts/ems/experiment.perl scripts/training/train-model.perl |
||
---|---|---|
.. | ||
perllib | ||
smtgui | ||
bootstrap-hypothesis-difference-significance.pl | ||
extract-target-trees.py | ||
nontranslated_words.pl | ||
oov.pl | ||
README | ||
sentence-by-sentence.pl | ||
sg2dot.perl | ||
show-phrases-used.pl | ||
suspicious_tokenization.pl | ||
weight-scan-summarize.sh | ||
weight-scan.pl |
Put any scripts useful for human analysis of MT output here. sentence-by-sentence.pl [EVH]: show comparison of sentences in reference translation(s)/system output(s)/(truth) in colorful format -- show all sentences given, with non-matching words in the system output marked, BLEU scores given by sentence, and matching n-grams shown in a table -- requires all input files be utf8-encoded (you can convert a file with `cat FILE | perl -n -e 'binmode(STDOUT, ":utf8"); print;' > FILE.utf8`) show-phrases-used.pl [EVH]: draw colorful diagram of which source phrases map to which target phrases -- requires the Perl GD module, which in turn requires that gd be installed and in LD_LIBRARY_PATH -- show average length of source phrases used for each sentence and overall -- command-line options -r for reference and -s for source; lone filenames are taken to be system outputs