Ulrich Germann
1b4b3a5103
Mmsapt: btfix now instatiated via intrusive pointer
...
... to prevent deletion while Mmsapt is live.
2015-06-05 16:27:49 +01:00
Jeroen Vermeulen
eca5824100
Remove trailing whitespace in C++ files.
2015-04-30 12:05:11 +07:00
Ulrich Germann
e4f5c69109
One step closer to eliminating the requirement to provide num-features=... in the config file.
...
Some FF (Mmsapt, LexicalReordering, Many single-value FF) provide this number during "registration";
when missing, a default weight vector of uniform 1.0 is automatically generated. This eliminates the
need for the user to figure out what the exact number of features is for each FF, which can get complicated,
e.g. in the case of Mmsapt/PhraseDictionaryBitextSampling.
2015-04-29 20:16:52 +01:00
Ulrich Germann
2c0851099b
Work on integrating hierarchical lexicalized reordering models with sampled phrase tables.
2015-04-21 17:48:48 +01:00
Ulrich Germann
46e31a285c
- Code refactoring for Bitext class.
...
- Bug fixes and conceptual improvements in biased sampling. The sampling now
tries to stick to the bias, even when an unsuitable corpus dominates
the occurrences.
2015-04-05 14:29:00 +01:00
Ulrich Germann
93ce2423df
1. A context string for biased sampling in Mmsapt can now be provided on the
...
command line with --context-string. Not available in server mode yet.
2. Numerous bug fixes related to biased sampling.
3. Biased sampling now checks that the sampling sticks to the bias. If
the distribution of samples deviates too much from the bias, samples
whose selection would push the sample distribution even further from the bias
are not considered, even if that means that fewer samples are chosen in total.
2015-04-03 16:16:52 +01:00
Ulrich Germann
fcbfc5a535
Feature functions and the constructors of TranslationOptionCollections
...
now have access to the current translation task.
This was done to allow context-sensitive processing (if provided by the FF).
2015-03-30 01:20:17 +01:00
Ulrich Germann
2a66a55c85
Added document map (maps from sentences to document ids) to Bitext class.
...
Minor overhaul to the bias regime, which allows to specify bias by document
name (as provided in the document map) rather than by sentence in the static
parallel corpus.
2015-03-15 13:32:09 +00:00
Ulrich Germann
07202c544c
Added ptable-describe-features to list features used by PhraseDictionaryBitextSampling.
2014-10-25 12:06:38 -07:00
Ulrich Germann
1d834e2b48
Fixed bug with respect to adding check option to Mmsapt::Load().
2014-09-10 18:51:20 +02:00
Ulrich Germann
a58c7ceb18
Fixed issues with ambiguity in typedef of uint64_t (conflict between boost typedef and stdint typedef).
2014-09-10 12:07:57 +02:00
Ulrich Germann
31578d4915
Finished code for bias loading from Mmsapt config file.
2014-09-09 18:07:26 +01:00
Ulrich Germann
cda94c7d85
Fix in biased sampling. Started code on loading and using bias in Mmsapt.
2014-09-09 17:45:48 +01:00
Ulrich Germann
f86fa65a6f
Added utility count-ptable-features to count features in Mmsapt given a moses.ini config line.
2014-09-08 16:56:45 +01:00
Ulrich Germann
a86d49fc88
Added bias to bitext sampling.
2014-09-08 09:26:08 +01:00
Michael Denkowski
3304030a4e
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2014-09-04 11:19:32 -04:00
Michael Denkowski
6c33bc99dc
Option to add TM-specific word and phrase counts
2014-09-04 11:17:42 -04:00
Michael Denkowski
756bcf0f15
Option to add TM-specific word and phrase counts
2014-09-04 01:49:26 -04:00
Ulrich Germann
f32a313a05
Mmsapt now uses timespec on linux, timeval om MacOS for time stamps.
2014-08-05 02:22:20 +01:00
Ulrich Germann
4d41211c2c
Major overhaul of Mmsapt. Reorganization of old and addition of new features in phrase tables. Many critical bug fixes.
2014-07-09 02:41:28 +01:00
Ulrich Germann
2ac93eca5c
PScoreBwd can now scales backward marginals according as specified by user.
2014-06-15 21:11:33 +01:00
Ulrich Germann
1a3d7d3266
Reorganization of phrase scorers in Mmsapt.
2014-06-14 13:03:31 +01:00
Ulrich Germann
2f109621bf
Added configurable options and SetTableLimit to Mmsapt.
2014-06-05 01:47:29 +01:00
Ulrich Germann
8dee1725fb
Removed Phrase penalty as a built-in feature function.
2014-06-03 21:36:04 +01:00
Ulrich Germann
a248349da3
Added more phrase table features and configuration options to Mmsapt.
2014-06-03 15:41:59 +01:00
Ulrich Germann
f63606a92b
1. Renamed PhraseDictionary member function release() to Release().
...
2. Added option to check phrase prefixes for early stopping of
InputPath generation when translating confusion networks.
2014-03-25 01:49:24 +00:00
Ulrich Germann
794defa238
Numerous bug fixes; implemented caching and improved life cycle
...
management for TargetPhraseCollections.
2014-03-24 13:30:22 +00:00
Ulrich Germann
7e344d1c53
1. Added variant of CreateRuleLookupManager variant as required by the new
...
pure virtual function in PhraseTable.h (from the master branch).
2. Bug fix in pool_stats.
2014-03-19 20:43:47 +00:00
Ulrich Germann
94657fd589
Work in progress.
2014-03-12 23:13:44 +00:00
Ulrich Germann
9cf86f6191
Added class Alignment as a friend and wlex21 and COOCraw for development purposes while working on word alignment issues.
2014-03-10 11:57:40 +00:00
Ulrich Germann
1252700c44
Removed constructor with both description and config line.
2014-02-22 00:25:02 +00:00
Ulrich Germann
9bcc315644
Added phrase-based word alignment to mmsapt (work in progress!).
2014-02-20 23:25:36 +00:00
Ulrich Germann
5f8ae20d01
Added dynamicly updatable corpus; updated or added query functions.
2014-02-08 17:56:48 +00:00
Hieu Hoang
4285a6c7df
move Uli's new pt into it's own folder. Comment out bjam compile. Doesn't work on MacOSX yet
2013-09-26 10:10:12 +01:00