Ulrich Germann
a49b76be3f
Quick hack to make moses not stumble over double-dash parameter specifications.
2015-03-11 23:32:06 +00:00
Philipp Koehn
530d0f5a11
some more better defaults for recaser
2015-03-11 17:56:02 +00:00
Philipp Koehn
2ce45229f8
better default configuration for recaser
2015-03-11 17:52:30 +00:00
Philipp Koehn
1632c5f39d
proper handling of specified configuration file
2015-03-11 16:49:20 +00:00
Matthias Huck
534a894c0b
glue rules with stripped BitPar labels
2015-03-10 22:02:21 +00:00
Matthias Huck
01bed83cf9
GHKM extraction: option to strip non-terminal labels from BitPar syntactic parses right during extraction (i.e., remove any suffix starting with a hyphen from the label)
2015-03-10 21:25:32 +00:00
Hieu Hoang
2fe8bccd2b
remove visual studio and xcode project files. No longer maintained
2015-03-10 16:19:13 +00:00
Hieu Hoang
1705e29212
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2015-03-10 16:05:13 +00:00
Hieu Hoang
ee6948b168
eclipse
2015-03-10 16:04:30 +00:00
Phil Williams
c7cf33ee05
parse-en-egret.perl: use "ROOT" instead of "TOP" as label of root tree node
...
This is to match the label Egret assigns to the root vertices of forests.
2015-03-10 15:43:14 +00:00
Hieu Hoang
ad73919979
merge with private branch
2015-03-10 15:28:45 +00:00
Phil Williams
77faaaea6c
Add truecase-egret.sh
...
This is currently just a wrapper for Travatar's tree-converter tool.
2015-03-10 14:36:28 +00:00
Phil Williams
f7b4d403e3
Add parse-en-egret.perl wrapper script.
2015-03-10 14:32:59 +00:00
Phil Williams
9e88f794e6
Add phrase-extract/postprocess-egret-forests
...
This performs some minor transformations to Egret forests: escaping of
Moses special characters; removal of "^g" suffixes from constituent labels;
and marking of slash/hyphen split points (using @ characters).
2015-03-10 13:51:30 +00:00
Phil Williams
9e2eb702dc
EMS: add TRAINING:use-syntax-input-weight-feature option
2015-03-10 11:40:49 +00:00
Phil Williams
91abb69cdf
train-model.perl: add -use-syntax-input-weight-feature option
...
Currently only used for forest input.
2015-03-10 11:39:14 +00:00
Phil Williams
e8a7163f0d
Add SyntaxInputWeight feature function
...
Currently only used for forest input.
2015-03-10 11:07:04 +00:00
Ulrich Germann
137b07a486
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-03-10 10:41:47 +00:00
Ulrich Germann
51824355f9
Sampling now keeps track of counts for hierarchical lexicalized reordering.
2015-03-10 10:41:41 +00:00
Phil Williams
7eba58b942
EMS: add TRAINING:dont-tune-glue-grammar option
...
Adds -dont-tune-glue-grammar to train-model.perl command during config file
generation step. This is preferable to manually adding -dont-tune-glue-grammar
to TRAINING:training-options because changing its value won't trigger a re-run
of dependent steps that don't really need re-running (like word alignment).
2015-03-10 10:20:19 +00:00
Phil Williams
e79644540c
train-model.perl: add -dont-tune-glue-grammar option
2015-03-10 09:53:12 +00:00
Phil Williams
fd3dcb7bb0
filter-model-given-input.pl: add -[no]StripXml and -SyntaxFilterCmd options
...
-noStripXml is required for tree and forest input in STSG-based models.
-SyntaxFilterCmd can be used to set the command for filtering rule tables in
syntax-based models. The default is to use
$SCRIPTS_ROOTDIR/../bin/filter-rule-table
The option -MinNonInitialRuleCount is deprecated.
2015-03-10 08:57:56 +00:00
Phil Williams
70bef90b36
train-model.perl: add -score-command option
...
This matches the existing -extract-command option. Given the argument value
<name>, train-model.perl will use the score program in
$SCRIPTS_ROOTDIR/../bin/<name>
The default value is "score".
2015-03-10 08:48:54 +00:00
Matthias Huck
25f5470216
GHKM: write target parts-of-speech as a factor
2015-03-09 21:54:03 +00:00
Matthias Huck
524ed4406e
pragma once
2015-03-09 21:44:54 +00:00
Matthias Huck
559077f6f8
some moderate modifications in phrase-extract/score-main.cpp
...
(e.g., use Moses::Scan<>() rather than atof()/atoi())
2015-03-09 18:49:32 +00:00
Matthias Huck
973fd98052
conservative update of some old code in phrase-extract/consolidate-main.cpp
2015-03-09 18:47:28 +00:00
Matthias Huck
0c79e19ff9
consolidate properties: fixing bug from commit b08d3ed
2015-03-09 18:44:02 +00:00
Ulrich Germann
ccf4cb838c
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-03-09 01:27:02 +00:00
Hieu Hoang
b08d3ed0fe
merge with private branch. Add --Count arg
2015-03-09 00:47:51 +00:00
Ulrich Germann
524376fad4
Code cleanup.
2015-03-09 00:34:47 +00:00
Ulrich Germann
c4885d87c0
Code cleanup
2015-03-09 00:34:18 +00:00
Ulrich Germann
c1d2313a66
Moved caching of lexical reordering scores from class TranslationOption to class TargetPhrase.
...
This was done so that phrase tables can add this information (if available) as extra annotation
to TargetPhrases, in preparation of providing lexical reordering models with sampling phrase tables.
2015-03-09 00:30:01 +00:00
Ulrich Germann
ddea89312e
Commented out an #include that triggered a 'deprecated header' warning.
2015-03-09 00:08:31 +00:00
Hieu Hoang
afe8b4162e
link with openmp for srilm
2015-03-08 21:25:24 +00:00
Hieu Hoang
987d3e9282
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2015-03-08 21:23:24 +00:00
Hieu Hoang
c08ebc9dae
link with openmp for srilm
2015-03-08 21:23:13 +00:00
Hieu Hoang
44d4d94b44
eclipse
2015-03-08 20:26:16 +00:00
Matthias Huck
99b8f65fb1
GHKM: POS factor in glue rules: target side only
2015-03-06 16:47:44 +00:00
Ales Tamchyna
9266d65304
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2015-03-06 13:44:46 +01:00
Ales Tamchyna
9224779695
include scores of source-context features in topt collection verbose output
2015-03-06 13:44:38 +01:00
Rico Sennrich
b3d2b11a36
make BilingualLM play nice with new chart decoding score deltas
2015-03-06 08:52:13 +00:00
Matthias Huck
aa077ab66c
GHKM extraction / consolidate: write most frequent POS sequence from property to factor (for usage with a POS LM)
2015-03-05 22:25:32 +00:00
Hieu Hoang
cb2e1b8a40
separate variables into lines. Easier to merge with other branches
2015-03-05 21:37:30 +00:00
Hieu Hoang
0f5556f6d9
separate variables into lines. Easier to merge with other branches
2015-03-05 21:28:51 +00:00
Rico Sennrich
2431f514dd
fix EMS bug from dca8dd: cleaning step was skipped
2015-03-05 10:55:35 +00:00
Rico Sennrich
47c460fe1d
remove unused variable
2015-03-05 08:31:50 +00:00
Ulrich Germann
596c4c1673
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-03-05 00:36:54 +00:00
Ulrich Germann
612e014e79
Organized command line options thematically.
...
This is the first step in migration to the use of boost::program_options for GNU/Linux-style handling of command line options.
2015-03-05 00:34:02 +00:00
Matthias Huck
773a16b5fd
POS property in glue rules
2015-03-04 23:05:45 +00:00