Commit Graph

9608 Commits

Author SHA1 Message Date
Tetsuo Kiso
6c04c4ad9c Add more tests to the Data class. 2012-12-07 02:46:59 +09:00
Tetsuo Kiso
c7f6e38326 Use FilePiece to load N-best lists.
Since FilePiece is friendly with StringPiece.
2012-12-07 02:39:02 +09:00
Tetsuo Kiso
38e145e556 Use util::TokenIter to tokenize n-best lists.
Reduce creating std::string objects, too. In both ScoreArray
and FeatureArray classes, the private members to track sentence
indices (namely, "m_index") were unnecessarily declared as
std::string, but it's better to directly declare them as 'int'.
2012-12-07 01:39:22 +09:00
Tetsuo Kiso
cd3fb3b831 Untabify. 2012-12-06 23:46:22 +09:00
Tetsuo Kiso
ac045a11c1 Speed up N-gram counts when running extractor.
By replacing std::map with boost::unordered_map.

Runtime of extractor on 100-best lists of 2679 sentences:

Before:
real    0m35.314s
user    0m34.030s
sys     0m1.280s

Ater:
real    0m26.729s
user    0m25.420s
sys     0m1.310s
2012-12-06 22:08:33 +09:00
Hieu Hoang
55f65c3104 race condition in chart decoding with -T arg 2012-12-03 14:57:33 +00:00
phikoehn
ab2effb6fe train MML in-/out-of-domain language models with same vocabulary 2012-12-01 13:46:59 +00:00
phikoehn
269883fedd Merge branch 'master' of git://github.com/moses-smt/mosesdecoder 2012-12-01 13:45:00 +00:00
phikoehn
0c5d000192 my change to weight-wt 2012-12-01 13:44:57 +00:00
Marcin Junczys-Dowmunt
205cea8644 Allow .minlexr suffix and bugfix 2012-12-01 00:38:20 +01:00
Eva Hasler
650d31fe73 don't need to specify weight-wt 2012-11-30 18:04:50 +00:00
Hieu Hoang
a07f71d095 race condition on letter sed cache. Requires locking 2012-11-30 17:15:32 +00:00
Hieu Hoang
7abb3c878a remove locking. Make wordIndex variable local 2012-11-30 13:50:59 +00:00
Hieu Hoang
5fd9cbb529 delete reference to numpy. Doesn't need it 2012-11-30 10:28:51 +00:00
Hieu Hoang
017bbe78e8 forgotten misc programs for Compact pt 2012-11-30 09:49:36 +00:00
phikoehn
338b7656a6 ooops 2012-11-30 07:36:59 +00:00
phikoehn
84cb04c05a fixes and extensions to modified Moore-Lewis filtering, now works with domain features 2012-11-30 07:28:31 +00:00
phikoehn
1f7ee0e6c5 change of settings for sigtest filtering 2012-11-29 23:44:10 +00:00
Hieu Hoang
d4ead15066 fuzzy match phrase-table is multi-threaded 2012-11-29 15:27:38 +00:00
Hieu Hoang
9aad7c65c9 move CompactPt to TranslationModel/ 2012-11-27 18:04:01 +00:00
Hieu Hoang
152064086f Merge branch 'master' of github.com:moses-smt/mosesdecoder 2012-11-27 17:33:42 +00:00
Hieu Hoang
b317ac1a34 compile error on misc programs 2012-11-27 17:33:04 +00:00
Hieu Hoang
bc1e96730d move CKY+Parser to TranslationModel/ 2012-11-27 17:23:31 +00:00
Hieu Hoang
ae8a48b022 move Score3Parser to TranslationModel/ 2012-11-27 17:09:23 +00:00
Hieu Hoang
1aae9aa23c move RuleTable to TranslationModel/ 2012-11-27 16:57:23 +00:00
Hieu Hoang
6bf2870f18 move the rest of DynSA to TranslationModel/ 2012-11-27 16:31:42 +00:00
Hieu Hoang
4d8e4ae6d8 move DynSAInclude to TranslationModel/ 2012-11-27 16:16:30 +00:00
Barry Haddow
f0e12912e7 mml-score.py. Support for combining with domain features. 2012-11-27 15:58:55 +00:00
Hieu Hoang
75108c0aaf minor debug messages 2012-11-27 15:39:08 +00:00
Hieu Hoang
0b54d32038 move fuzzy-match to TranslationModel/ 2012-11-27 15:36:24 +00:00
Hieu Hoang
59449f2925 make TranslationModel subdirectory and move files from moses/ into it 2012-11-27 15:08:31 +00:00
Hieu Hoang
a4cbaa51af multithreaded fuzzy match 2012-11-27 13:17:44 +00:00
Hieu Hoang
b30b2ffd03 compile error in fuzzy match 2012-11-27 12:29:36 +00:00
Hieu Hoang
9e08ef8c0d multithreaded fuzzy match 2012-11-27 10:42:28 +00:00
Hieu Hoang
ff1b210fe8 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2012-11-26 23:10:40 +00:00
Hieu Hoang
ea2ad978ef eclipse files 2012-11-26 23:10:23 +00:00
phikoehn
51afd322a1 Merge branch 'master' of git://github.com/moses-smt/mosesdecoder 2012-11-24 20:13:30 +00:00
phikoehn
b5d08745a5 extensions to modified moore-lewis filtering, bug fixes 2012-11-24 20:13:14 +00:00
marcinj
bb76c156cf Added another consistency check for false positives 2012-11-24 17:58:07 +01:00
phikoehn
ea610a0558 added modified-moore-lewis from Barry Haddow into EMS 2012-11-24 12:43:13 +00:00
Hieu Hoang
7b36e11276 make fuzzy match pt threadable by marking it as non-threadable! TODO - do it properly 2012-11-23 18:52:06 +00:00
Hieu Hoang
cb95548871 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2012-11-23 17:38:57 +00:00
phikoehn
d4cebb008a added ems support for sigtest-filter 2012-11-23 17:35:13 +00:00
Hieu Hoang
65457aa1ac load source corpus along with suffix array. Use vocab from SA execlusively 2012-11-23 17:33:00 +00:00
Hieu Hoang
d0cf8f47db order of lexical probability has flipped 2012-11-22 17:37:36 +00:00
Hieu Hoang
487822ed14 don't write to stdout 2012-11-22 15:08:00 +00:00
Hieu Hoang
b2997025f0 add test misc perl file 2012-11-21 16:41:05 +00:00
phikoehn
c2a96fcc33 adjust to irstlm changes 2012-11-20 17:19:17 +00:00
phikoehn
5cd614ecd8 adjust to irstlm changes 2012-11-20 17:18:57 +00:00
Hieu Hoang
7d6d91a2e8 move zmert to contrib folder 2012-11-20 16:42:49 +00:00