Commit Graph

11693 Commits

Author SHA1 Message Date
Hieu Hoang
2ea2224ba1 output source side of rule, if available 2014-04-07 12:00:13 +01:00
Hieu Hoang
09dc0e9913 add FF/SetSourcePhrase 2014-04-06 16:57:54 +01:00
Ulrich Germann
872facd171 Avoid errors in truecaser if input isn't factored and contains vertical bars. 2014-04-05 15:39:00 +01:00
Hieu Hoang
a17bead267 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2014-04-05 14:48:41 +01:00
Hieu Hoang
b7b502fbf5 eclipse 2014-04-04 22:34:38 +01:00
Rico Sennrich
395285d879 more logging for incremental chart search 2014-04-04 15:54:48 +01:00
Hieu Hoang
59098ac664 merge 2014-04-04 11:20:06 +01:00
Hieu Hoang
ee03e0f0e3 meteor scorer requires FDStream. Only gcc 2014-04-03 23:35:04 +01:00
Hieu Hoang
afb3d01344 eclipse 2014-04-03 23:27:08 +01:00
Hieu Hoang
b5784c9c77 eclipse 2014-04-03 23:23:21 +01:00
Hieu Hoang
5297620e8e eclipse 2014-04-03 23:15:05 +01:00
Hieu Hoang
f6956bfe99 eclipse 2014-04-03 23:14:19 +01:00
Ulrich Germann
b1b9f6cfa2 Additions / changes to Moses server so that it can be used for Moses tuning. 2014-04-03 20:38:14 +01:00
Ulrich Germann
467b5d93e8 Additions / changes to mosesserver.cpp so that it can be used in parameter tuning. 2014-04-03 20:35:26 +01:00
Ulrich Germann
aa1d942175 Reformatting for readability. 2014-04-03 20:35:02 +01:00
Ulrich Germann
8c04078d14 Reformatting for readability. 2014-04-03 20:33:40 +01:00
Hieu Hoang
67b1cbce13 error in constrained decoding 2014-04-03 18:03:58 +01:00
Rico Sennrich
63ca61ba0b tmcombine: don't crash if model contains sparse features or other data
(no adaptation; tmcombine will copy sparse features / data from first model)
2014-03-27 15:27:10 +00:00
Rico Sennrich
1f435340f0 faster pruning in chart decoding 2014-03-26 11:23:59 +00:00
Ulrich Germann
f63606a92b 1. Renamed PhraseDictionary member function release() to Release().
2. Added option to check phrase prefixes for early stopping of
   InputPath generation when translating confusion networks.
2014-03-25 01:49:24 +00:00
Ulrich Germann
80b031769d (1) Bug fixes related to locking. (2) pstats cache disabled when translating confusion networks. 2014-03-25 01:48:49 +00:00
Ulrich Germann
84299006da Added another constructor to TSA_tree_iterator. 2014-03-25 01:43:00 +00:00
Ulrich Germann
f5d969ab37 Added missing library for custom-pt.cc 2014-03-25 01:42:06 +00:00
Ulrich Germann
344b7a0759 Changed verbosity level for "read confusion net with format ..." to 2. 2014-03-25 01:41:21 +00:00
Hieu Hoang
4a91739ad5 eclipse 2014-03-24 16:09:28 +00:00
Hieu Hoang
d9fb9ce23c eclipse 2014-03-24 16:05:17 +00:00
Hieu Hoang
4ccb001b29 eclipse 2014-03-24 16:03:44 +00:00
Rico Sennrich
c8682e9420 target-syntax: use SoftMatchingFeature to assign non-terminal to unknown words 2014-03-24 14:57:24 +00:00
Ulrich Germann
728ceb4e4c Added missing #include. 2014-03-24 14:51:51 +00:00
Ulrich Germann
14e52c9b94 Errors occurring when accessing data files now throw exceptions. 2014-03-24 13:39:15 +00:00
Ulrich Germann
ed825cd234 Bug fix: dependency .../UG/generic//generic was missing for custom-pt 2014-03-24 13:33:46 +00:00
Ulrich Germann
20aa9dd189 Adapted to configurable map type in pstats. 2014-03-24 13:32:41 +00:00
Ulrich Germann
794defa238 Numerous bug fixes; implemented caching and improved life cycle
management for TargetPhraseCollections.
2014-03-24 13:30:22 +00:00
Ulrich Germann
7fd87943ea Reinstated pstats caching. 2014-03-24 13:29:05 +00:00
germann
2063da274d Made member Tokenindex.ridx mutable. 2014-03-24 12:34:59 +00:00
germann
5ec4436b07 Initial check-in. 2014-03-24 12:33:17 +00:00
germann
c12c25770a Added virtual member function release(TargetPhraseCollection*) to
PhraseDictionary, which gets called by the destructor of InputPath to
release TargetPhraseCollections that aren't needed any more.
2014-03-24 12:29:46 +00:00
germann
042e840045 Added mmlib as a dependency for FF_Factory.o 2014-03-24 12:23:18 +00:00
germann
f7ec2ee09a 1. Fixed order of initializations in constructor so gcc won't complain.
2. Added call  to PhraseDictionary.release(TargetPhraseCollection*) to
   release used TargetPhraseCollection's.
2014-03-24 10:55:47 +00:00
germann
ec6fd21603 Make sure m_numInputScores is initialized properly. 2014-03-24 10:53:29 +00:00
Phil Williams
cea86d6750 Transliteration support for syntax models. 2014-03-21 22:13:38 +00:00
Rico Sennrich
ba52fa163b use | as default escape sequence for "|" (for consistency with tokenizer.perl) 2014-03-21 19:19:03 +00:00
Ulrich Germann
1f32bd0d08 Fixed header inclusions. 2014-03-21 16:49:44 +00:00
Rico Sennrich
fb16df8c00 typo in last commit 2014-03-21 11:59:41 +00:00
Rico Sennrich
45630a5851 various optimizations to make CYK+ parser several times faster and eat less memory.
speed-up of decoding depends on how much time is spent in parser:
10-50% speed-up for string-to-tree systems observed (more on long sentences and with high max-chart-span).

if you only use hiero or string-to-tree models (but none with source syntax), use compile-option --unlabelled-source for (small) efficiency gains.
2014-03-21 11:12:24 +00:00
Ulrich Germann
4685aeadd2 Bug fix. 2014-03-21 11:11:24 +00:00
Ulrich Germann
d5c96eff35 Minor edit of a comment. 2014-03-21 11:08:37 +00:00
Ulrich Germann
f030986f51 Bug fixes. Added some sanity checks. 2014-03-21 11:07:17 +00:00
Ulrich Germann
f1d39b50fc Added support for Mmsapt. 2014-03-21 11:03:45 +00:00
Ulrich Germann
43e6c68586 Changed default method for PhraseScoreFwd. Added minor debugging message. 2014-03-19 20:54:49 +00:00