Commit Graph

11736 Commits

Author SHA1 Message Date
Hieu Hoang
67b1cbce13 error in constrained decoding 2014-04-03 18:03:58 +01:00
Rico Sennrich
63ca61ba0b tmcombine: don't crash if model contains sparse features or other data
(no adaptation; tmcombine will copy sparse features / data from first model)
2014-03-27 15:27:10 +00:00
Rico Sennrich
1f435340f0 faster pruning in chart decoding 2014-03-26 11:23:59 +00:00
Ulrich Germann
f63606a92b 1. Renamed PhraseDictionary member function release() to Release().
2. Added option to check phrase prefixes for early stopping of
   InputPath generation when translating confusion networks.
2014-03-25 01:49:24 +00:00
Ulrich Germann
80b031769d (1) Bug fixes related to locking. (2) pstats cache disabled when translating confusion networks. 2014-03-25 01:48:49 +00:00
Ulrich Germann
84299006da Added another constructor to TSA_tree_iterator. 2014-03-25 01:43:00 +00:00
Ulrich Germann
f5d969ab37 Added missing library for custom-pt.cc 2014-03-25 01:42:06 +00:00
Ulrich Germann
344b7a0759 Changed verbosity level for "read confusion net with format ..." to 2. 2014-03-25 01:41:21 +00:00
Hieu Hoang
4a91739ad5 eclipse 2014-03-24 16:09:28 +00:00
Hieu Hoang
d9fb9ce23c eclipse 2014-03-24 16:05:17 +00:00
Hieu Hoang
4ccb001b29 eclipse 2014-03-24 16:03:44 +00:00
Rico Sennrich
c8682e9420 target-syntax: use SoftMatchingFeature to assign non-terminal to unknown words 2014-03-24 14:57:24 +00:00
Ulrich Germann
728ceb4e4c Added missing #include. 2014-03-24 14:51:51 +00:00
Ulrich Germann
14e52c9b94 Errors occurring when accessing data files now throw exceptions. 2014-03-24 13:39:15 +00:00
Ulrich Germann
ed825cd234 Bug fix: dependency .../UG/generic//generic was missing for custom-pt 2014-03-24 13:33:46 +00:00
Ulrich Germann
20aa9dd189 Adapted to configurable map type in pstats. 2014-03-24 13:32:41 +00:00
Ulrich Germann
794defa238 Numerous bug fixes; implemented caching and improved life cycle
management for TargetPhraseCollections.
2014-03-24 13:30:22 +00:00
Ulrich Germann
7fd87943ea Reinstated pstats caching. 2014-03-24 13:29:05 +00:00
germann
2063da274d Made member Tokenindex.ridx mutable. 2014-03-24 12:34:59 +00:00
germann
5ec4436b07 Initial check-in. 2014-03-24 12:33:17 +00:00
germann
c12c25770a Added virtual member function release(TargetPhraseCollection*) to
PhraseDictionary, which gets called by the destructor of InputPath to
release TargetPhraseCollections that aren't needed any more.
2014-03-24 12:29:46 +00:00
germann
042e840045 Added mmlib as a dependency for FF_Factory.o 2014-03-24 12:23:18 +00:00
germann
f7ec2ee09a 1. Fixed order of initializations in constructor so gcc won't complain.
2. Added call  to PhraseDictionary.release(TargetPhraseCollection*) to
   release used TargetPhraseCollection's.
2014-03-24 10:55:47 +00:00
germann
ec6fd21603 Make sure m_numInputScores is initialized properly. 2014-03-24 10:53:29 +00:00
Phil Williams
cea86d6750 Transliteration support for syntax models. 2014-03-21 22:13:38 +00:00
Rico Sennrich
ba52fa163b use | as default escape sequence for "|" (for consistency with tokenizer.perl) 2014-03-21 19:19:03 +00:00
Ulrich Germann
1f32bd0d08 Fixed header inclusions. 2014-03-21 16:49:44 +00:00
Rico Sennrich
fb16df8c00 typo in last commit 2014-03-21 11:59:41 +00:00
Rico Sennrich
45630a5851 various optimizations to make CYK+ parser several times faster and eat less memory.
speed-up of decoding depends on how much time is spent in parser:
10-50% speed-up for string-to-tree systems observed (more on long sentences and with high max-chart-span).

if you only use hiero or string-to-tree models (but none with source syntax), use compile-option --unlabelled-source for (small) efficiency gains.
2014-03-21 11:12:24 +00:00
Ulrich Germann
4685aeadd2 Bug fix. 2014-03-21 11:11:24 +00:00
Ulrich Germann
d5c96eff35 Minor edit of a comment. 2014-03-21 11:08:37 +00:00
Ulrich Germann
f030986f51 Bug fixes. Added some sanity checks. 2014-03-21 11:07:17 +00:00
Ulrich Germann
f1d39b50fc Added support for Mmsapt. 2014-03-21 11:03:45 +00:00
Ulrich Germann
43e6c68586 Changed default method for PhraseScoreFwd. Added minor debugging message. 2014-03-19 20:54:49 +00:00
Ulrich Germann
7e344d1c53 1. Added variant of CreateRuleLookupManager variant as required by the new
pure virtual function in PhraseTable.h (from the master branch).

2. Bug fix in pool_stats.
2014-03-19 20:43:47 +00:00
Ulrich Germann
c91fb5cc84 Added mmlex-lookup utility program. 2014-03-19 20:41:29 +00:00
Hieu Hoang
1c6061e781 bug when adding reordering model to ini file /jian zhang 2014-03-19 00:27:59 +00:00
Ulrich Germann
ef974cd6ad Merge branch 'master' into dynamic-phrase-tables 2014-03-18 18:25:35 +00:00
Ulrich Germann
740c9dc096 Memory-mapped stuff is now compiled only with --with-mm. 2014-03-18 18:23:59 +00:00
Ulrich Germann
ef2c012470 Merge branch 'master' into dynamic-phrase-tables
Conflicts:
	moses/Jamfile
2014-03-18 18:23:23 +00:00
Ulrich Germann
febb2afc4d Initial check-in. 2014-03-18 12:23:53 +00:00
Ulrich Germann
f20220066b Added choice of denominator for PhraseScoreFwd. 2014-03-18 12:23:31 +00:00
Ulrich Germann
a8eb6645c7 Bug fix. Added sanity check when adding data to dynamic suffix array. 2014-03-18 12:22:29 +00:00
Ulrich Germann
0562415ac0 Added program calc-coverage. 2014-03-18 12:21:12 +00:00
Ulrich Germann
e55dfa26b3 Added program calc-coverage. 2014-03-18 12:20:55 +00:00
Ulrich Germann
4aa88aaf2c Bug fix in constructor. 2014-03-18 12:20:29 +00:00
Ulrich Germann
a11b79175b Added function fill_token_seq. 2014-03-18 12:19:35 +00:00
Ulrich Germann
3f9cefe44e Removed some debugging messages.
Moved fill_token_seq to tpt_tokenindex.h.
2014-03-18 12:18:05 +00:00
Ulrich Germann
394a40b934 Added program calc-coverage. 2014-03-18 12:17:07 +00:00
Kenneth Heafield
fc61084743 Compiler warning 2014-03-17 07:18:11 -07:00