Michael Denkowski
13942b77ab
Add alias PhraseDictionaryBitextSampling
2014-08-05 14:47:07 -04:00
Hieu Hoang
e863592f40
TargetPhrase to have pointer to the phrase table that creates it
2014-08-04 19:28:04 +01:00
hieu
5741ef2635
compile error in gcc 4.4
2014-07-30 18:01:51 +01:00
Ulrich Germann
f9d167345a
Changed feature and parameter names for Mmsapt / PhraseDictionaryBitextSampling as requested by PK.
2014-07-29 13:57:00 +01:00
Hieu Hoang
1347b153ee
compiles with c++11. Used by oxlm
2014-07-17 23:13:06 +01:00
Ulrich Germann
4d41211c2c
Major overhaul of Mmsapt. Reorganization of old and addition of new features in phrase tables. Many critical bug fixes.
2014-07-09 02:41:28 +01:00
Ulrich Germann
e1c9405b23
Utilities to check gain in phrase coverage by dynamic augmentation of the phrase table in a post-editing scenario.
2014-07-09 02:39:28 +01:00
Ulrich Germann
556e112366
Major bug fix in Mmsapt.combine_pstats.
2014-06-30 00:32:11 +01:00
Ulrich Germann
9eaf506ace
Renamed lookup_mmsapt.cc to ptable-lookup.cc.
2014-06-27 01:10:14 +01:00
Ulrich Germann
9dfb354041
Phrase probability smoothing can now be turned off in ug_bitext.cc by setting the confidence threshold to 0.
2014-06-27 01:07:23 +01:00
Ulrich Germann
40ab68b3d2
Adapted lookup_mmsapt to binary phrase tables; renamed the utility to ptable-lookup
2014-06-27 01:06:00 +01:00
Ulrich Germann
2ac93eca5c
PScoreBwd can now scales backward marginals according as specified by user.
2014-06-15 21:11:33 +01:00
Ulrich Germann
1a3d7d3266
Reorganization of phrase scorers in Mmsapt.
2014-06-14 13:03:31 +01:00
Ulrich Germann
c3db1a3a67
Better labeling of feature values in lookup_mmsapt.
2014-06-14 13:02:55 +01:00
Ulrich Germann
135ce20509
Smoothing parameter in LexicalPhraseScorer2 is now a configurable paramter.
2014-06-14 13:02:08 +01:00
Ulrich Germann
450c8c5b80
Moved phrase scorers to a separate file.
2014-06-14 12:58:33 +01:00
Ulrich Germann
ead3fc565e
Initial check-in.
2014-06-12 18:30:41 +01:00
Ulrich Germann
5116f0072b
Minor edits to ug_bitext.h. Added min_diverse to ug_bitext::job to ensure minimum number of translation alternatives before sampling stops.
2014-06-05 01:50:55 +01:00
Ulrich Germann
b92d599727
Bug fix in mmlex-lookup.
2014-06-05 01:48:11 +01:00
Ulrich Germann
2f109621bf
Added configurable options and SetTableLimit to Mmsapt.
2014-06-05 01:47:29 +01:00
Ulrich Germann
a40fcbae02
Added utility lookup_mmsapt
2014-06-05 01:45:34 +01:00
Ulrich Germann
ce853731ae
Added mmsapt lookup utility.
2014-06-05 01:38:55 +01:00
Ulrich Germann
8dee1725fb
Removed Phrase penalty as a built-in feature function.
2014-06-03 21:36:04 +01:00
Ulrich Germann
9cda39f802
Major bug fix in setting ug_bitext::jstats.my_cnt2.
2014-06-03 15:48:27 +01:00
Ulrich Germann
8c9a26a3be
Added phrase feature function for log counts.
2014-06-03 15:45:29 +01:00
Ulrich Germann
0fc80660f3
TSA_tree_iterator::approxOccurrenceCount() now returns exact count for small values.
2014-06-03 15:44:20 +01:00
Ulrich Germann
a248349da3
Added more phrase table features and configuration options to Mmsapt.
2014-06-03 15:41:59 +01:00
Ulrich Germann
ce435d22bf
Mmsapt now adds the phrase penalty only if num_features specified
...
on the respective line in moses.ini is odd.
2014-06-01 18:23:00 +01:00
Ulrich Germann
c452875343
Mmsapt/ug_bitext:
...
- Sampling now uses a static seed for consistency between runs.
- (In order to have consistency guarantees, specify workers=1 in the
- Mmsapt line in moses.ini; otherwise the randomness of thread scheduling may
cause inconsistencies between runs.
- Denominator for fwd phrase probabilities can now be chosen via an
external argument.
2014-05-31 14:29:44 +01:00
Ulrich Germann
380b2da2d0
Bug fix in and in connection with Mmsapt::combine_pstats().
2014-05-19 21:34:58 +01:00
Ulrich Germann
f63606a92b
1. Renamed PhraseDictionary member function release() to Release().
...
2. Added option to check phrase prefixes for early stopping of
InputPath generation when translating confusion networks.
2014-03-25 01:49:24 +00:00
Ulrich Germann
80b031769d
(1) Bug fixes related to locking. (2) pstats cache disabled when translating confusion networks.
2014-03-25 01:48:49 +00:00
Ulrich Germann
84299006da
Added another constructor to TSA_tree_iterator.
2014-03-25 01:43:00 +00:00
Ulrich Germann
f5d969ab37
Added missing library for custom-pt.cc
2014-03-25 01:42:06 +00:00
Ulrich Germann
728ceb4e4c
Added missing #include.
2014-03-24 14:51:51 +00:00
Ulrich Germann
14e52c9b94
Errors occurring when accessing data files now throw exceptions.
2014-03-24 13:39:15 +00:00
Ulrich Germann
ed825cd234
Bug fix: dependency .../UG/generic//generic was missing for custom-pt
2014-03-24 13:33:46 +00:00
Ulrich Germann
20aa9dd189
Adapted to configurable map type in pstats.
2014-03-24 13:32:41 +00:00
Ulrich Germann
794defa238
Numerous bug fixes; implemented caching and improved life cycle
...
management for TargetPhraseCollections.
2014-03-24 13:30:22 +00:00
Ulrich Germann
7fd87943ea
Reinstated pstats caching.
2014-03-24 13:29:05 +00:00
germann
2063da274d
Made member Tokenindex.ridx mutable.
2014-03-24 12:34:59 +00:00
germann
5ec4436b07
Initial check-in.
2014-03-24 12:33:17 +00:00
Ulrich Germann
1f32bd0d08
Fixed header inclusions.
2014-03-21 16:49:44 +00:00
Ulrich Germann
4685aeadd2
Bug fix.
2014-03-21 11:11:24 +00:00
Ulrich Germann
d5c96eff35
Minor edit of a comment.
2014-03-21 11:08:37 +00:00
Ulrich Germann
f030986f51
Bug fixes. Added some sanity checks.
2014-03-21 11:07:17 +00:00
Ulrich Germann
43e6c68586
Changed default method for PhraseScoreFwd. Added minor debugging message.
2014-03-19 20:54:49 +00:00
Ulrich Germann
7e344d1c53
1. Added variant of CreateRuleLookupManager variant as required by the new
...
pure virtual function in PhraseTable.h (from the master branch).
2. Bug fix in pool_stats.
2014-03-19 20:43:47 +00:00
Ulrich Germann
c91fb5cc84
Added mmlex-lookup utility program.
2014-03-19 20:41:29 +00:00
Ulrich Germann
febb2afc4d
Initial check-in.
2014-03-18 12:23:53 +00:00