Commit Graph

13671 Commits

Author SHA1 Message Date
Ulrich Germann
a49b76be3f Quick hack to make moses not stumble over double-dash parameter specifications. 2015-03-11 23:32:06 +00:00
Philipp Koehn
530d0f5a11 some more better defaults for recaser 2015-03-11 17:56:02 +00:00
Philipp Koehn
2ce45229f8 better default configuration for recaser 2015-03-11 17:52:30 +00:00
Philipp Koehn
1632c5f39d proper handling of specified configuration file 2015-03-11 16:49:20 +00:00
Matthias Huck
534a894c0b glue rules with stripped BitPar labels 2015-03-10 22:02:21 +00:00
Matthias Huck
01bed83cf9 GHKM extraction: option to strip non-terminal labels from BitPar syntactic parses right during extraction (i.e., remove any suffix starting with a hyphen from the label) 2015-03-10 21:25:32 +00:00
Hieu Hoang
2fe8bccd2b remove visual studio and xcode project files. No longer maintained 2015-03-10 16:19:13 +00:00
Hieu Hoang
1705e29212 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-03-10 16:05:13 +00:00
Hieu Hoang
ee6948b168 eclipse 2015-03-10 16:04:30 +00:00
Phil Williams
c7cf33ee05 parse-en-egret.perl: use "ROOT" instead of "TOP" as label of root tree node
This is to match the label Egret assigns to the root vertices of forests.
2015-03-10 15:43:14 +00:00
Hieu Hoang
ad73919979 merge with private branch 2015-03-10 15:28:45 +00:00
Phil Williams
77faaaea6c Add truecase-egret.sh
This is currently just a wrapper for Travatar's tree-converter tool.
2015-03-10 14:36:28 +00:00
Phil Williams
f7b4d403e3 Add parse-en-egret.perl wrapper script. 2015-03-10 14:32:59 +00:00
Phil Williams
9e88f794e6 Add phrase-extract/postprocess-egret-forests
This performs some minor transformations to Egret forests: escaping of
Moses special characters; removal of "^g" suffixes from constituent labels;
and marking of slash/hyphen split points (using @ characters).
2015-03-10 13:51:30 +00:00
Phil Williams
9e2eb702dc EMS: add TRAINING:use-syntax-input-weight-feature option 2015-03-10 11:40:49 +00:00
Phil Williams
91abb69cdf train-model.perl: add -use-syntax-input-weight-feature option
Currently only used for forest input.
2015-03-10 11:39:14 +00:00
Phil Williams
e8a7163f0d Add SyntaxInputWeight feature function
Currently only used for forest input.
2015-03-10 11:07:04 +00:00
Ulrich Germann
137b07a486 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-03-10 10:41:47 +00:00
Ulrich Germann
51824355f9 Sampling now keeps track of counts for hierarchical lexicalized reordering. 2015-03-10 10:41:41 +00:00
Phil Williams
7eba58b942 EMS: add TRAINING:dont-tune-glue-grammar option
Adds -dont-tune-glue-grammar to train-model.perl command during config file
generation step.  This is preferable to manually adding -dont-tune-glue-grammar
to TRAINING:training-options because changing its value won't trigger a re-run
of dependent steps that don't really need re-running (like word alignment).
2015-03-10 10:20:19 +00:00
Phil Williams
e79644540c train-model.perl: add -dont-tune-glue-grammar option 2015-03-10 09:53:12 +00:00
Phil Williams
fd3dcb7bb0 filter-model-given-input.pl: add -[no]StripXml and -SyntaxFilterCmd options
-noStripXml is required for tree and forest input in STSG-based models.

-SyntaxFilterCmd can be used to set the command for filtering rule tables in
syntax-based models.  The default is to use

    $SCRIPTS_ROOTDIR/../bin/filter-rule-table

The option -MinNonInitialRuleCount is deprecated.
2015-03-10 08:57:56 +00:00
Phil Williams
70bef90b36 train-model.perl: add -score-command option
This matches the existing -extract-command option.  Given the argument value
<name>, train-model.perl will use the score program in

  $SCRIPTS_ROOTDIR/../bin/<name>

The default value is "score".
2015-03-10 08:48:54 +00:00
Matthias Huck
25f5470216 GHKM: write target parts-of-speech as a factor 2015-03-09 21:54:03 +00:00
Matthias Huck
524ed4406e pragma once 2015-03-09 21:44:54 +00:00
Matthias Huck
559077f6f8 some moderate modifications in phrase-extract/score-main.cpp
(e.g., use Moses::Scan<>() rather than atof()/atoi())
2015-03-09 18:49:32 +00:00
Matthias Huck
973fd98052 conservative update of some old code in phrase-extract/consolidate-main.cpp 2015-03-09 18:47:28 +00:00
Matthias Huck
0c79e19ff9 consolidate properties: fixing bug from commit b08d3ed 2015-03-09 18:44:02 +00:00
Ulrich Germann
ccf4cb838c Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-03-09 01:27:02 +00:00
Hieu Hoang
b08d3ed0fe merge with private branch. Add --Count arg 2015-03-09 00:47:51 +00:00
Ulrich Germann
524376fad4 Code cleanup. 2015-03-09 00:34:47 +00:00
Ulrich Germann
c4885d87c0 Code cleanup 2015-03-09 00:34:18 +00:00
Ulrich Germann
c1d2313a66 Moved caching of lexical reordering scores from class TranslationOption to class TargetPhrase.
This was done so that phrase tables can add this information (if available) as extra annotation
to TargetPhrases, in preparation of providing lexical reordering models with sampling phrase tables.
2015-03-09 00:30:01 +00:00
Ulrich Germann
ddea89312e Commented out an #include that triggered a 'deprecated header' warning. 2015-03-09 00:08:31 +00:00
Hieu Hoang
afe8b4162e link with openmp for srilm 2015-03-08 21:25:24 +00:00
Hieu Hoang
987d3e9282 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-03-08 21:23:24 +00:00
Hieu Hoang
c08ebc9dae link with openmp for srilm 2015-03-08 21:23:13 +00:00
Hieu Hoang
44d4d94b44 eclipse 2015-03-08 20:26:16 +00:00
Matthias Huck
99b8f65fb1 GHKM: POS factor in glue rules: target side only 2015-03-06 16:47:44 +00:00
Ales Tamchyna
9266d65304 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-03-06 13:44:46 +01:00
Ales Tamchyna
9224779695 include scores of source-context features in topt collection verbose output 2015-03-06 13:44:38 +01:00
Rico Sennrich
b3d2b11a36 make BilingualLM play nice with new chart decoding score deltas 2015-03-06 08:52:13 +00:00
Matthias Huck
aa077ab66c GHKM extraction / consolidate: write most frequent POS sequence from property to factor (for usage with a POS LM) 2015-03-05 22:25:32 +00:00
Hieu Hoang
cb2e1b8a40 separate variables into lines. Easier to merge with other branches 2015-03-05 21:37:30 +00:00
Hieu Hoang
0f5556f6d9 separate variables into lines. Easier to merge with other branches 2015-03-05 21:28:51 +00:00
Rico Sennrich
2431f514dd fix EMS bug from dca8dd: cleaning step was skipped 2015-03-05 10:55:35 +00:00
Rico Sennrich
47c460fe1d remove unused variable 2015-03-05 08:31:50 +00:00
Ulrich Germann
596c4c1673 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-03-05 00:36:54 +00:00
Ulrich Germann
612e014e79 Organized command line options thematically.
This is the first step in migration to the use of boost::program_options for GNU/Linux-style handling of command line options.
2015-03-05 00:34:02 +00:00
Matthias Huck
773a16b5fd POS property in glue rules 2015-03-04 23:05:45 +00:00