Commit Graph

34 Commits

Author SHA1 Message Date
Ulrich Germann
1b4b3a5103 Mmsapt: btfix now instatiated via intrusive pointer
... to prevent deletion while Mmsapt is live.
2015-06-05 16:27:49 +01:00
Jeroen Vermeulen
eca5824100 Remove trailing whitespace in C++ files. 2015-04-30 12:05:11 +07:00
Ulrich Germann
e4f5c69109 One step closer to eliminating the requirement to provide num-features=... in the config file.
Some FF (Mmsapt, LexicalReordering, Many single-value FF) provide this number during "registration";
when missing, a default weight vector of uniform 1.0 is automatically generated. This eliminates the
need for the user to figure out what the exact number of features is for each FF, which can get complicated,
e.g. in the case of Mmsapt/PhraseDictionaryBitextSampling.
2015-04-29 20:16:52 +01:00
Ulrich Germann
2c0851099b Work on integrating hierarchical lexicalized reordering models with sampled phrase tables. 2015-04-21 17:48:48 +01:00
Ulrich Germann
46e31a285c - Code refactoring for Bitext class.
- Bug fixes and conceptual improvements in biased sampling. The sampling now
  tries to stick to the bias, even when an unsuitable corpus dominates
  the occurrences.
2015-04-05 14:29:00 +01:00
Ulrich Germann
93ce2423df 1. A context string for biased sampling in Mmsapt can now be provided on the
command line with --context-string. Not available in server mode yet.
2. Numerous bug fixes related to biased sampling.
3. Biased sampling now checks that the sampling sticks to the bias. If
   the distribution of samples deviates too much from the bias, samples
   whose selection would push the sample distribution even further from the bias
   are not considered, even if that means that fewer samples are chosen in total.
2015-04-03 16:16:52 +01:00
Ulrich Germann
fcbfc5a535 Feature functions and the constructors of TranslationOptionCollections
now have access to the current translation task.

This was done to allow context-sensitive processing (if provided by the FF).
2015-03-30 01:20:17 +01:00
Ulrich Germann
2a66a55c85 Added document map (maps from sentences to document ids) to Bitext class.
Minor overhaul to the bias regime, which allows to specify bias by document
name (as provided in the document map) rather than by sentence in the static
parallel corpus.
2015-03-15 13:32:09 +00:00
Ulrich Germann
07202c544c Added ptable-describe-features to list features used by PhraseDictionaryBitextSampling. 2014-10-25 12:06:38 -07:00
Ulrich Germann
1d834e2b48 Fixed bug with respect to adding check option to Mmsapt::Load(). 2014-09-10 18:51:20 +02:00
Ulrich Germann
a58c7ceb18 Fixed issues with ambiguity in typedef of uint64_t (conflict between boost typedef and stdint typedef). 2014-09-10 12:07:57 +02:00
Ulrich Germann
31578d4915 Finished code for bias loading from Mmsapt config file. 2014-09-09 18:07:26 +01:00
Ulrich Germann
cda94c7d85 Fix in biased sampling. Started code on loading and using bias in Mmsapt. 2014-09-09 17:45:48 +01:00
Ulrich Germann
f86fa65a6f Added utility count-ptable-features to count features in Mmsapt given a moses.ini config line. 2014-09-08 16:56:45 +01:00
Ulrich Germann
a86d49fc88 Added bias to bitext sampling. 2014-09-08 09:26:08 +01:00
Michael Denkowski
3304030a4e Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2014-09-04 11:19:32 -04:00
Michael Denkowski
6c33bc99dc Option to add TM-specific word and phrase counts 2014-09-04 11:17:42 -04:00
Michael Denkowski
756bcf0f15 Option to add TM-specific word and phrase counts 2014-09-04 01:49:26 -04:00
Ulrich Germann
f32a313a05 Mmsapt now uses timespec on linux, timeval om MacOS for time stamps. 2014-08-05 02:22:20 +01:00
Ulrich Germann
4d41211c2c Major overhaul of Mmsapt. Reorganization of old and addition of new features in phrase tables. Many critical bug fixes. 2014-07-09 02:41:28 +01:00
Ulrich Germann
2ac93eca5c PScoreBwd can now scales backward marginals according as specified by user. 2014-06-15 21:11:33 +01:00
Ulrich Germann
1a3d7d3266 Reorganization of phrase scorers in Mmsapt. 2014-06-14 13:03:31 +01:00
Ulrich Germann
2f109621bf Added configurable options and SetTableLimit to Mmsapt. 2014-06-05 01:47:29 +01:00
Ulrich Germann
8dee1725fb Removed Phrase penalty as a built-in feature function. 2014-06-03 21:36:04 +01:00
Ulrich Germann
a248349da3 Added more phrase table features and configuration options to Mmsapt. 2014-06-03 15:41:59 +01:00
Ulrich Germann
f63606a92b 1. Renamed PhraseDictionary member function release() to Release().
2. Added option to check phrase prefixes for early stopping of
   InputPath generation when translating confusion networks.
2014-03-25 01:49:24 +00:00
Ulrich Germann
794defa238 Numerous bug fixes; implemented caching and improved life cycle
management for TargetPhraseCollections.
2014-03-24 13:30:22 +00:00
Ulrich Germann
7e344d1c53 1. Added variant of CreateRuleLookupManager variant as required by the new
pure virtual function in PhraseTable.h (from the master branch).

2. Bug fix in pool_stats.
2014-03-19 20:43:47 +00:00
Ulrich Germann
94657fd589 Work in progress. 2014-03-12 23:13:44 +00:00
Ulrich Germann
9cf86f6191 Added class Alignment as a friend and wlex21 and COOCraw for development purposes while working on word alignment issues. 2014-03-10 11:57:40 +00:00
Ulrich Germann
1252700c44 Removed constructor with both description and config line. 2014-02-22 00:25:02 +00:00
Ulrich Germann
9bcc315644 Added phrase-based word alignment to mmsapt (work in progress!). 2014-02-20 23:25:36 +00:00
Ulrich Germann
5f8ae20d01 Added dynamicly updatable corpus; updated or added query functions. 2014-02-08 17:56:48 +00:00
Hieu Hoang
4285a6c7df move Uli's new pt into it's own folder. Comment out bjam compile. Doesn't work on MacOSX yet 2013-09-26 10:10:12 +01:00