Commit Graph

102 Commits

Author SHA1 Message Date
Michael Denkowski
13942b77ab Add alias PhraseDictionaryBitextSampling 2014-08-05 14:47:07 -04:00
Hieu Hoang
e863592f40 TargetPhrase to have pointer to the phrase table that creates it 2014-08-04 19:28:04 +01:00
hieu
5741ef2635 compile error in gcc 4.4 2014-07-30 18:01:51 +01:00
Ulrich Germann
f9d167345a Changed feature and parameter names for Mmsapt / PhraseDictionaryBitextSampling as requested by PK. 2014-07-29 13:57:00 +01:00
Hieu Hoang
1347b153ee compiles with c++11. Used by oxlm 2014-07-17 23:13:06 +01:00
Ulrich Germann
4d41211c2c Major overhaul of Mmsapt. Reorganization of old and addition of new features in phrase tables. Many critical bug fixes. 2014-07-09 02:41:28 +01:00
Ulrich Germann
e1c9405b23 Utilities to check gain in phrase coverage by dynamic augmentation of the phrase table in a post-editing scenario. 2014-07-09 02:39:28 +01:00
Ulrich Germann
556e112366 Major bug fix in Mmsapt.combine_pstats. 2014-06-30 00:32:11 +01:00
Ulrich Germann
9eaf506ace Renamed lookup_mmsapt.cc to ptable-lookup.cc. 2014-06-27 01:10:14 +01:00
Ulrich Germann
9dfb354041 Phrase probability smoothing can now be turned off in ug_bitext.cc by setting the confidence threshold to 0. 2014-06-27 01:07:23 +01:00
Ulrich Germann
40ab68b3d2 Adapted lookup_mmsapt to binary phrase tables; renamed the utility to ptable-lookup 2014-06-27 01:06:00 +01:00
Ulrich Germann
2ac93eca5c PScoreBwd can now scales backward marginals according as specified by user. 2014-06-15 21:11:33 +01:00
Ulrich Germann
1a3d7d3266 Reorganization of phrase scorers in Mmsapt. 2014-06-14 13:03:31 +01:00
Ulrich Germann
c3db1a3a67 Better labeling of feature values in lookup_mmsapt. 2014-06-14 13:02:55 +01:00
Ulrich Germann
135ce20509 Smoothing parameter in LexicalPhraseScorer2 is now a configurable paramter. 2014-06-14 13:02:08 +01:00
Ulrich Germann
450c8c5b80 Moved phrase scorers to a separate file. 2014-06-14 12:58:33 +01:00
Ulrich Germann
ead3fc565e Initial check-in. 2014-06-12 18:30:41 +01:00
Ulrich Germann
5116f0072b Minor edits to ug_bitext.h. Added min_diverse to ug_bitext::job to ensure minimum number of translation alternatives before sampling stops. 2014-06-05 01:50:55 +01:00
Ulrich Germann
b92d599727 Bug fix in mmlex-lookup. 2014-06-05 01:48:11 +01:00
Ulrich Germann
2f109621bf Added configurable options and SetTableLimit to Mmsapt. 2014-06-05 01:47:29 +01:00
Ulrich Germann
a40fcbae02 Added utility lookup_mmsapt 2014-06-05 01:45:34 +01:00
Ulrich Germann
ce853731ae Added mmsapt lookup utility. 2014-06-05 01:38:55 +01:00
Ulrich Germann
8dee1725fb Removed Phrase penalty as a built-in feature function. 2014-06-03 21:36:04 +01:00
Ulrich Germann
9cda39f802 Major bug fix in setting ug_bitext::jstats.my_cnt2. 2014-06-03 15:48:27 +01:00
Ulrich Germann
8c9a26a3be Added phrase feature function for log counts. 2014-06-03 15:45:29 +01:00
Ulrich Germann
0fc80660f3 TSA_tree_iterator::approxOccurrenceCount() now returns exact count for small values. 2014-06-03 15:44:20 +01:00
Ulrich Germann
a248349da3 Added more phrase table features and configuration options to Mmsapt. 2014-06-03 15:41:59 +01:00
Ulrich Germann
ce435d22bf Mmsapt now adds the phrase penalty only if num_features specified
on the respective line in moses.ini is odd.
2014-06-01 18:23:00 +01:00
Ulrich Germann
c452875343 Mmsapt/ug_bitext:
- Sampling now uses a static seed for consistency between runs.
- (In order to have consistency guarantees, specify workers=1 in the
- Mmsapt line in moses.ini; otherwise the randomness of thread scheduling may
  cause inconsistencies between runs.
- Denominator for fwd phrase probabilities can now be chosen via an
  external argument.
2014-05-31 14:29:44 +01:00
Ulrich Germann
380b2da2d0 Bug fix in and in connection with Mmsapt::combine_pstats(). 2014-05-19 21:34:58 +01:00
Ulrich Germann
f63606a92b 1. Renamed PhraseDictionary member function release() to Release().
2. Added option to check phrase prefixes for early stopping of
   InputPath generation when translating confusion networks.
2014-03-25 01:49:24 +00:00
Ulrich Germann
80b031769d (1) Bug fixes related to locking. (2) pstats cache disabled when translating confusion networks. 2014-03-25 01:48:49 +00:00
Ulrich Germann
84299006da Added another constructor to TSA_tree_iterator. 2014-03-25 01:43:00 +00:00
Ulrich Germann
f5d969ab37 Added missing library for custom-pt.cc 2014-03-25 01:42:06 +00:00
Ulrich Germann
728ceb4e4c Added missing #include. 2014-03-24 14:51:51 +00:00
Ulrich Germann
14e52c9b94 Errors occurring when accessing data files now throw exceptions. 2014-03-24 13:39:15 +00:00
Ulrich Germann
ed825cd234 Bug fix: dependency .../UG/generic//generic was missing for custom-pt 2014-03-24 13:33:46 +00:00
Ulrich Germann
20aa9dd189 Adapted to configurable map type in pstats. 2014-03-24 13:32:41 +00:00
Ulrich Germann
794defa238 Numerous bug fixes; implemented caching and improved life cycle
management for TargetPhraseCollections.
2014-03-24 13:30:22 +00:00
Ulrich Germann
7fd87943ea Reinstated pstats caching. 2014-03-24 13:29:05 +00:00
germann
2063da274d Made member Tokenindex.ridx mutable. 2014-03-24 12:34:59 +00:00
germann
5ec4436b07 Initial check-in. 2014-03-24 12:33:17 +00:00
Ulrich Germann
1f32bd0d08 Fixed header inclusions. 2014-03-21 16:49:44 +00:00
Ulrich Germann
4685aeadd2 Bug fix. 2014-03-21 11:11:24 +00:00
Ulrich Germann
d5c96eff35 Minor edit of a comment. 2014-03-21 11:08:37 +00:00
Ulrich Germann
f030986f51 Bug fixes. Added some sanity checks. 2014-03-21 11:07:17 +00:00
Ulrich Germann
43e6c68586 Changed default method for PhraseScoreFwd. Added minor debugging message. 2014-03-19 20:54:49 +00:00
Ulrich Germann
7e344d1c53 1. Added variant of CreateRuleLookupManager variant as required by the new
pure virtual function in PhraseTable.h (from the master branch).

2. Bug fix in pool_stats.
2014-03-19 20:43:47 +00:00
Ulrich Germann
c91fb5cc84 Added mmlex-lookup utility program. 2014-03-19 20:41:29 +00:00
Ulrich Germann
febb2afc4d Initial check-in. 2014-03-18 12:23:53 +00:00