Commit Graph

638 Commits

Author SHA1 Message Date
Ulrich Germann
9af3a61678 Added try-align2. 2014-09-08 09:26:08 +01:00
Ulrich Germann
a028fec7af Work in progress. 2014-09-08 09:26:08 +01:00
Michael Denkowski
3304030a4e Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2014-09-04 11:19:32 -04:00
Michael Denkowski
6c33bc99dc Option to add TM-specific word and phrase counts 2014-09-04 11:17:42 -04:00
Michael Denkowski
756bcf0f15 Option to add TM-specific word and phrase counts 2014-09-04 01:49:26 -04:00
Rico Sennrich
2a46e8ccea parse chart compression for faster CYK+ parsing with syntax systems. 2014-09-01 18:16:22 +01:00
Michael Denkowski
1c45d780d4 all-restrict mode for MultiModel (restrict to phrases in first model) 2014-08-26 13:43:23 -04:00
Hieu Hoang
97e5a30d3a compiles with clang on osx 2014-08-25 18:07:42 +01:00
Michael Denkowski
da0ed4df81 tunable=false option for mmsapt 2014-08-18 19:22:50 -04:00
Michael Denkowski
93e99be108 Mode to pass through "all" scores in MultiModel 2014-08-18 17:57:05 -04:00
Nicola Bertoldi
77e9e91b08 minor fixes 2014-08-18 19:13:51 +02:00
Hieu Hoang
00a338d576 clang only function 2014-08-14 16:44:20 +01:00
Hieu Hoang
303387f9ac compiles with clang on osx 2014-08-14 16:17:21 +01:00
Hieu Hoang
fcbd64b3ac eclipse 2014-08-14 14:04:25 +01:00
Hieu Hoang
2bbaf69409 Merge branch 'master' into bo-safe 2014-08-13 18:52:14 +01:00
Hieu Hoang
94c44c03d5 merge 2014-08-13 18:03:05 +01:00
Hieu Hoang
18c1c4a132 method rename 2014-08-08 18:11:30 +01:00
Hieu Hoang
efa5befb16 method rename 2014-08-08 15:59:34 +01:00
Ulrich Germann
95b04d2558 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2014-08-05 21:28:06 +01:00
Ulrich Germann
5480499309 Fixed (?) problem with multiple identical extractable target phrases per source phrase occurrence. 2014-08-05 21:26:29 +01:00
Michael Denkowski
13942b77ab Add alias PhraseDictionaryBitextSampling 2014-08-05 14:47:07 -04:00
Ulrich Germann
f32a313a05 Mmsapt now uses timespec on linux, timeval om MacOS for time stamps. 2014-08-05 02:22:20 +01:00
Hieu Hoang
11471de9b8 mac osx 2014-08-04 18:50:10 +01:00
Ulrich Germann
c269abb083 Added num_read_write.cc to fakelib mm. 2014-08-04 17:52:08 +01:00
Ulrich Germann
9fad5d3eb0 Eliminated dependence on endian.h and related byte swapping on big-endian machines. 2014-08-04 17:52:08 +01:00
Hieu Hoang
3f29ed10f1 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2014-08-05 11:00:01 +01:00
Hieu Hoang
84d6b25802 TargetPhrase to have pointer to the phrase table that creates it 2014-08-05 10:59:48 +01:00
Hieu Hoang
f447a23067 TargetPhrase to have pointer to the phrase table that creates it 2014-08-05 10:26:42 +01:00
Hieu Hoang
e863592f40 TargetPhrase to have pointer to the phrase table that creates it 2014-08-04 19:28:04 +01:00
Hieu Hoang
abe68be588 initialise m_container 2014-08-04 15:59:32 +01:00
Hieu Hoang
3f3912772d initialise m_container 2014-08-04 15:46:40 +01:00
Hieu Hoang
5f90ccdb13 initialise m_container 2014-08-04 15:20:22 +01:00
Marcin Junczys-Dowmunt
5c9017c632 Forgot to add SetFeaturesToApply 2014-08-03 19:44:43 +02:00
Marcin Junczys-Dowmunt
ff6ed8cd21 Fixed segfault for features depending on factors not in phrase table (i.e. added by generation models) 2014-08-03 18:03:42 +02:00
Hieu Hoang
688bf4c061 each target phrase knows what decode graph created it 2014-08-02 17:15:01 +01:00
hieu
5741ef2635 compile error in gcc 4.4 2014-07-30 18:01:51 +01:00
Ulrich Germann
f9d167345a Changed feature and parameter names for Mmsapt / PhraseDictionaryBitextSampling as requested by PK. 2014-07-29 13:57:00 +01:00
Ulrich Germann
6a1beb770d Cleanup work to get rid of compiler warnings. 2014-07-29 13:51:44 +01:00
Nicola Bertoldi
1063012892 added a flag do disable the decaying in the cache 2014-07-22 11:25:03 +02:00
Nicola Bertoldi
02bf6d5d5e fixings about file loading and precomputation of ascores 2014-07-22 09:45:41 +02:00
Hieu Hoang
b10760f428 delete PhraseTableImplementation. Old enum 2014-07-18 20:36:53 +01:00
Hieu Hoang
1347b153ee compiles with c++11. Used by oxlm 2014-07-17 23:13:06 +01:00
Ulrich Germann
f06b145735 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2014-07-10 17:24:42 +01:00
Ulrich Germann
4d41211c2c Major overhaul of Mmsapt. Reorganization of old and addition of new features in phrase tables. Many critical bug fixes. 2014-07-09 02:41:28 +01:00
Ulrich Germann
e1c9405b23 Utilities to check gain in phrase coverage by dynamic augmentation of the phrase table in a post-editing scenario. 2014-07-09 02:39:28 +01:00
Rico Sennrich
bdf8d1a405 CreateFromString no longer requires factorDelimiter 2014-07-04 10:28:24 +01:00
Ulrich Germann
556e112366 Major bug fix in Mmsapt.combine_pstats. 2014-06-30 00:32:11 +01:00
Ulrich Germann
5c77f17833 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2014-06-27 01:12:22 +01:00
Ulrich Germann
9eaf506ace Renamed lookup_mmsapt.cc to ptable-lookup.cc. 2014-06-27 01:10:14 +01:00
Ulrich Germann
1e50ac587d Replaced TRACE_ERR(..) with VERBOSE(1,...) in PhraseDictionaryTree.cpp 2014-06-27 01:09:50 +01:00
Ulrich Germann
9dfb354041 Phrase probability smoothing can now be turned off in ug_bitext.cc by setting the confidence threshold to 0. 2014-06-27 01:07:23 +01:00
Ulrich Germann
40ab68b3d2 Adapted lookup_mmsapt to binary phrase tables; renamed the utility to ptable-lookup 2014-06-27 01:06:00 +01:00
Hieu Hoang
f2d3f57d07 Merge branch 'hieu' of ../mosesdecoder.hieu 2014-06-16 10:57:02 +01:00
Ulrich Germann
2ac93eca5c PScoreBwd can now scales backward marginals according as specified by user. 2014-06-15 21:11:33 +01:00
Ulrich Germann
5ff14f9798 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2014-06-14 13:29:52 +01:00
Ulrich Germann
1a3d7d3266 Reorganization of phrase scorers in Mmsapt. 2014-06-14 13:03:31 +01:00
Ulrich Germann
c3db1a3a67 Better labeling of feature values in lookup_mmsapt. 2014-06-14 13:02:55 +01:00
Ulrich Germann
135ce20509 Smoothing parameter in LexicalPhraseScorer2 is now a configurable paramter. 2014-06-14 13:02:08 +01:00
Ulrich Germann
450c8c5b80 Moved phrase scorers to a separate file. 2014-06-14 12:58:33 +01:00
Hieu Hoang
eb78782c5d merge with master 2014-06-13 10:35:35 +01:00
Hieu Hoang
9e43830c6e bjam for ProbingPT 2014-06-13 10:20:30 +01:00
Hieu Hoang
2c6cdb5889 bjam for ProbingPT 2014-06-13 09:48:32 +01:00
Hieu Hoang
4ebcfc2142 bjam for ProbingPT 2014-06-12 18:38:04 +01:00
Ulrich Germann
ead3fc565e Initial check-in. 2014-06-12 18:30:41 +01:00
Hieu Hoang
ad4825ad44 integrated probing pt into decoder 2014-06-11 14:25:18 +01:00
Hieu Hoang
253f8e35fd integrated probing pt into decoder 2014-06-11 13:52:20 +01:00
XapaJIaMnu
095877742c Add ProbingPT files to Moses 2014-06-11 11:47:00 +01:00
Hieu Hoang
341969e61d Merge ../mosesdecoder into hieu 2014-06-10 18:04:55 +01:00
Ulrich Germann
106a66eb17 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2014-06-10 14:28:59 +01:00
Hieu Hoang
3c6a31128d Merge ../mosesdecoder into hieu 2014-06-08 17:07:41 +01:00
Hieu Hoang
d68257c34d use standard c++ getline instead of old Moses SAFE_GETLINE 2014-06-08 16:37:12 +01:00
Hieu Hoang
45ed0a5b1f Merge ../mosesdecoder into hieu 2014-06-08 13:22:34 +01:00
Nicola Bertoldi
38477ac89e made compliant with last changes in master 2014-06-08 11:57:32 +02:00
Nicola Bertoldi
1fe4eb0528 beautify 2014-06-08 09:44:59 +02:00
Nicola Bertoldi
4d75c889f1 merged master into dynamic-models 2014-06-08 09:39:37 +02:00
Matthias Huck
a19aaada06 formatting 2014-06-06 20:16:40 +01:00
Matthias Huck
2ec50239e7 rule-limit=0: unlimited 2014-06-06 20:05:59 +01:00
Hieu Hoang
ce2a69ba25 Merge ../mosesdecoder into hieu 2014-06-05 17:18:26 +01:00
Hieu Hoang
d29bf716a4 minor change to debugging output. Memory pt used by both pb and chart-based models 2014-06-05 17:15:08 +01:00
Ulrich Germann
5116f0072b Minor edits to ug_bitext.h. Added min_diverse to ug_bitext::job to ensure minimum number of translation alternatives before sampling stops. 2014-06-05 01:50:55 +01:00
Ulrich Germann
b92d599727 Bug fix in mmlex-lookup. 2014-06-05 01:48:11 +01:00
Ulrich Germann
2f109621bf Added configurable options and SetTableLimit to Mmsapt. 2014-06-05 01:47:29 +01:00
Ulrich Germann
a40fcbae02 Added utility lookup_mmsapt 2014-06-05 01:45:34 +01:00
Ulrich Germann
ce853731ae Added mmsapt lookup utility. 2014-06-05 01:38:55 +01:00
Hieu Hoang
8e2c4a89c0 Merge ../mosesdecoder into hieu 2014-06-04 12:53:06 +01:00
Ulrich Germann
8dee1725fb Removed Phrase penalty as a built-in feature function. 2014-06-03 21:36:04 +01:00
Ulrich Germann
9c27dc08e9 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
Conflicts:
	moses/FF/ConstrainedDecoding.cpp
2014-06-03 18:55:53 +01:00
Ulrich Germann
9cda39f802 Major bug fix in setting ug_bitext::jstats.my_cnt2. 2014-06-03 15:48:27 +01:00
Ulrich Germann
8c9a26a3be Added phrase feature function for log counts. 2014-06-03 15:45:29 +01:00
Ulrich Germann
0fc80660f3 TSA_tree_iterator::approxOccurrenceCount() now returns exact count for small values. 2014-06-03 15:44:20 +01:00
Ulrich Germann
a248349da3 Added more phrase table features and configuration options to Mmsapt. 2014-06-03 15:41:59 +01:00
Ulrich Germann
ce435d22bf Mmsapt now adds the phrase penalty only if num_features specified
on the respective line in moses.ini is odd.
2014-06-01 18:23:00 +01:00
Ulrich Germann
249c40ec74 Eliminated factor delimiter from the parameters of
Phrase.CreateFromString() and Sentence.CreateFromString(),
as it was never used in those functions anyway ---
Word.CreateFromString() retrieves the factor delimiter
from StaticData directly.
2014-05-31 14:33:31 +01:00
Ulrich Germann
c452875343 Mmsapt/ug_bitext:
- Sampling now uses a static seed for consistency between runs.
- (In order to have consistency guarantees, specify workers=1 in the
- Mmsapt line in moses.ini; otherwise the randomness of thread scheduling may
  cause inconsistencies between runs.
- Denominator for fwd phrase probabilities can now be chosen via an
  external argument.
2014-05-31 14:29:44 +01:00
Ulrich Germann
380b2da2d0 Bug fix in and in connection with Mmsapt::combine_pstats(). 2014-05-19 21:34:58 +01:00
Nicola Bertoldi
10eed17d32 beautify 2014-05-19 15:35:45 +02:00
Nicola Bertoldi
2f3cd5e2fe beautify 2014-05-19 15:34:27 +02:00
Nicola Bertoldi
6611905cfb changes to allows the usage of multiple dynamic cache-based translation models contemporarily 2014-05-19 15:26:17 +02:00
Nicola Bertoldi
72afca907d added facility to store word-alignment in dynamic cache-based translation model 2014-05-17 22:37:22 +02:00
Hieu Hoang
35c812f5df Merge ../mosesdecoder into hieu 2014-05-13 18:24:17 +01:00