Commit Graph

13964 Commits

Author SHA1 Message Date
Hieu Hoang
5fefb0da47 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-05-05 12:02:13 +04:00
Hieu Hoang
8f272e04a9 output debugging messages to stderr, not stdout 2015-05-05 12:01:21 +04:00
Nicola Bertoldi
6085a60494 bug fix 2015-05-04 09:51:12 +02:00
Nicola Bertoldi
90a982e579 merge remote into local 2015-05-04 09:42:44 +02:00
Nicola Bertoldi
c4f04670c2 made ProbingPT constructor compliant with PhraseDictionary signature 2015-05-04 09:25:50 +02:00
Nicola Bertoldi
85a4e48df4 made use of boost::shared_mutex dependent from WITH_THREADS 2015-05-04 09:24:36 +02:00
Ulrich Germann
2b671e67dd Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-05-03 15:09:45 +01:00
Ulrich Germann
6d40c6e209 Bug fix wrt. lexical reordering of OOV words in combination with Mmsapt. 2015-05-03 15:09:26 +01:00
Hieu Hoang
d456d9229e add multi-bleu-detok. Like multi-bleu scoring but will detokenize/post-process before scoring 2015-05-03 14:07:12 +04:00
Hieu Hoang
e5f76ee99e Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-05-03 11:50:31 +04:00
Hieu Hoang
73ae7d7e20 option not to use parallel 2015-05-03 11:50:10 +04:00
Hieu Hoang
402b958d90 some astyle version don't like prefixed commas 2015-05-03 10:03:38 +04:00
Hieu Hoang
cc8c6b7b10 beautify 2015-05-02 11:45:24 +01:00
Philipp Koehn
a4a7c14593 allow breaking up training data for fast align (to avoid memory blowups for very large corpora) 2015-05-01 17:47:08 -04:00
Philipp Koehn
de6a9bd1b3 minor updates to factor scripts; brown-cluster may now run other scripts (e.g., truecaser) before assigning classes 2015-05-01 17:46:14 -04:00
Philipp Koehn
b369699661 various small changes, mostly related to better compliance with grid engine 2015-05-01 17:44:18 -04:00
Jeroen Vermeulen
09c982c1de Remove bad initialization.
Setting lastLine[0] when lastLine is empty probably doesn't do anything, but
in C++11 is definitely undefined.  The value wasn't used anyway!
2015-05-01 18:42:04 +07:00
XapaJIaMnu
dab63e8076 Initial work to add support for profiling mode in the testsuite 2015-04-30 23:46:50 +01:00
Matthias Huck
4ee8f2dec1 sentence-bleu less greedy regarding memory
Don't load all references, read them line by line.
Corpora with millions of sentences can now be evaluated without consuming gigabytes of RAM.
2015-04-30 22:26:30 +01:00
Matthias Huck
1d86b8fde7 .gitignore 2015-04-30 19:46:03 +01:00
Matthias Huck
34d1d3a904 sentence-bleu-nbest 2015-04-30 19:44:29 +01:00
Rico Sennrich
e98a2fc980 fix interpolation for LM with parser in pre-processing 2015-04-30 15:46:33 +01:00
Hieu Hoang
1278b8f5a7 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-04-30 15:35:34 +04:00
Hieu Hoang
a6d34a660d another madamira wrapper. Just uses the tokenized file it outputs 2015-04-30 15:35:15 +04:00
Hieu Hoang
15e4b16f49 delete unused var 2015-04-30 14:01:03 +04:00
Hieu Hoang
ebc5a51d32 Merge pull request #111 from unhammer/extract-perl-safewait
die if the forked extract exited with error
2015-04-30 11:45:25 +04:00
Hieu Hoang
1c99b2b2b8 Merge pull request #110 from unhammer/extract-perl-abspath-when-ln
avoid bad symlinks in extract-parallel
2015-04-30 11:41:03 +04:00
Kevin Brubeck Unhammer
2af2f2ef36 avoid bad symlinks in extract-parallel
train-model seems to pass a non-absolute path for the
model/aligned-argument, and then extract-parallel creates a bad symlink
2015-04-30 09:36:59 +02:00
Kevin Brubeck Unhammer
c116fa0dbf die if the forked extract exited with error
Should we pass on bad exit codes from RunFork to those waitpids as well?
Seems like the right thing, though I don't know the code.
2015-04-30 09:33:37 +02:00
Nicola Bertoldi
3400b622c0 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-04-30 08:35:41 +02:00
Jeroen Vermeulen
eca5824100 Remove trailing whitespace in C++ files. 2015-04-30 12:05:11 +07:00
Jeroen Vermeulen
85acdc62b1 Make build instructions a bit less discouraging. 2015-04-30 11:45:59 +07:00
Ulrich Germann
324b1a9b56 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-04-29 20:20:54 +01:00
Ulrich Germann
e4f5c69109 One step closer to eliminating the requirement to provide num-features=... in the config file.
Some FF (Mmsapt, LexicalReordering, Many single-value FF) provide this number during "registration";
when missing, a default weight vector of uniform 1.0 is automatically generated. This eliminates the
need for the user to figure out what the exact number of features is for each FF, which can get complicated,
e.g. in the case of Mmsapt/PhraseDictionaryBitextSampling.
2015-04-29 20:16:52 +01:00
Ulrich Germann
c76f1c338d Uninitialized variable. 2015-04-29 20:16:43 +01:00
Ulrich Germann
d99479d5a3 Bug fix. 2015-04-29 20:11:34 +01:00
Ulrich Germann
8b2e5ea0be No try { ... } catch { ... } in debug mode. 2015-04-29 20:07:03 +01:00
Ulrich Germann
1c37f92d77 Missing return value. 2015-04-29 20:06:08 +01:00
Jeroen Vermeulen
616b589da3 Fix a bunch of compiler warnings.
Warnings are useful, but only if there are few!
2015-04-29 21:18:51 +07:00
Hieu Hoang
8f9bf7ea38 add -config 2015-04-28 15:03:59 +04:00
Hieu Hoang
7c19add821 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-04-28 12:30:19 +04:00
Hieu Hoang
b7792b227a script to convert arabic to bw, and vice versa 2015-04-28 12:29:58 +04:00
Hieu Hoang
e6951aa9f2 codelite 2015-04-27 23:01:14 +04:00
Hieu Hoang
0ee4feecb5 add extra parser argument to ChartCellCollection to support FF::EvaluateGivenAllOtherTransOpts() 2015-04-27 22:41:47 +04:00
Ulrich Germann
315610c02a Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-04-27 16:39:40 +01:00
Ulrich Germann
0c74610fc2 Merge branch 'master' of https://github.com/ugermann/mosesdecoder 2015-04-27 16:37:48 +01:00
Ulrich Germann
ba6e17ce26 Code reorganization. 2015-04-27 16:31:22 +01:00
Ulrich Germann
37bb1de9ed Unused variable. 2015-04-27 16:30:59 +01:00
Hieu Hoang
8adad4fc2e exec permission 2015-04-27 17:39:49 +04:00
Hieu Hoang
a47fc00635 option to output factors 2015-04-27 17:35:19 +04:00