Ulrich Germann
|
febb2afc4d
|
Initial check-in.
|
2014-03-18 12:23:53 +00:00 |
|
Ulrich Germann
|
f20220066b
|
Added choice of denominator for PhraseScoreFwd.
|
2014-03-18 12:23:31 +00:00 |
|
Ulrich Germann
|
a8eb6645c7
|
Bug fix. Added sanity check when adding data to dynamic suffix array.
|
2014-03-18 12:22:29 +00:00 |
|
Ulrich Germann
|
0562415ac0
|
Added program calc-coverage.
|
2014-03-18 12:21:12 +00:00 |
|
Ulrich Germann
|
e55dfa26b3
|
Added program calc-coverage.
|
2014-03-18 12:20:55 +00:00 |
|
Ulrich Germann
|
4aa88aaf2c
|
Bug fix in constructor.
|
2014-03-18 12:20:29 +00:00 |
|
Ulrich Germann
|
a11b79175b
|
Added function fill_token_seq.
|
2014-03-18 12:19:35 +00:00 |
|
Ulrich Germann
|
3f9cefe44e
|
Removed some debugging messages.
Moved fill_token_seq to tpt_tokenindex.h.
|
2014-03-18 12:18:05 +00:00 |
|
Ulrich Germann
|
e0f95fee06
|
Bug fixes in dynamic phrase tables.
|
2014-03-14 02:42:38 +00:00 |
|
Ulrich Germann
|
ce75b58f6f
|
Routine check-in.
|
2014-03-13 13:41:32 +00:00 |
|
Ulrich Germann
|
94657fd589
|
Work in progress.
|
2014-03-12 23:13:44 +00:00 |
|
Ulrich Germann
|
9025ac065f
|
Added utilities:
- mam2symal converts memory-mapped word alignments to symal format
- mam_verify performs a sanity check on memory-mapped word alignments
|
2014-03-12 08:06:55 +00:00 |
|
Ulrich Germann
|
c02fbf7664
|
Completely rewritten. Now multi-threaded.
|
2014-03-11 13:57:42 +00:00 |
|
Ulrich Germann
|
fdc504d47a
|
Changes on main branch files while I was working on dynamic phrase tables.
|
2014-03-10 14:08:00 +00:00 |
|
Ulrich Germann
|
aa8ba7d9a7
|
Put alignment functionality into a separate class. Not working yet --- work in progress!
|
2014-03-10 12:03:27 +00:00 |
|
Ulrich Germann
|
ff4ce426e7
|
Made scorer in PScoreLex public for development purposes. Reset default number of workers to 20.
|
2014-03-10 12:02:05 +00:00 |
|
Ulrich Germann
|
f7ee316e12
|
Added initialization of wlex21 and COOCraw during loading.
|
2014-03-10 11:59:58 +00:00 |
|
Ulrich Germann
|
aad5d67947
|
Added option to also count raw cooccurrences.
|
2014-03-10 11:58:46 +00:00 |
|
Ulrich Germann
|
9cf86f6191
|
Added class Alignment as a friend and wlex21 and COOCraw for development purposes while working on word alignment issues.
|
2014-03-10 11:57:40 +00:00 |
|
Ulrich Germann
|
9159729ad0
|
Made internal table COOC public for development purposes.
|
2014-03-10 11:56:22 +00:00 |
|
Ulrich Germann
|
81ed9937e1
|
Routine check-in.
|
2014-03-05 11:53:05 +00:00 |
|
Ulrich Germann
|
2b19b71095
|
Routine check-in.
|
2014-03-04 15:51:59 +00:00 |
|
Ulrich Germann
|
6c37b8d252
|
Routine check-in.
|
2014-03-03 12:13:41 +00:00 |
|
Ulrich Germann
|
2b181ee691
|
Fixed Mmsapt constructor.
|
2014-02-25 03:10:16 +00:00 |
|
Ulrich Germann
|
4c003edb0d
|
Fixed #include-s.
|
2014-02-25 03:09:19 +00:00 |
|
Ulrich Germann
|
a8d66cd68d
|
Removed Mmsapt constuctor with both descriptor and config line.
|
2014-02-22 00:27:07 +00:00 |
|
Ulrich Germann
|
817e3695e0
|
Fixed some include paths.
|
2014-02-22 00:25:58 +00:00 |
|
Ulrich Germann
|
1252700c44
|
Removed constructor with both description and config line.
|
2014-02-22 00:25:02 +00:00 |
|
Ulrich Germann
|
4b95c3a906
|
Merge branch 'dynamic-phrase-tables' of ssh://thor//home/germann/git/mosesdecoder into dynamic-phrase-tables
due to resetting the location of the remote repository.
|
2014-02-21 01:09:38 +00:00 |
|
Ulrich Germann
|
ac238ef2d7
|
Changed construction from a given token sequence to allow partial matches.
|
2014-02-20 23:56:11 +00:00 |
|
Ulrich Germann
|
8afe62145b
|
Minor fix to make the compiler stop complain about unused typedef.
|
2014-02-20 23:54:15 +00:00 |
|
Ulrich Germann
|
e1d07e7475
|
Added pid2str conversion method to convert from phrase ids to the string.
|
2014-02-20 23:53:15 +00:00 |
|
Ulrich Germann
|
9536cf49e9
|
Phrase look-up now also gathers phrase orientation info (work in progress).
|
2014-02-20 23:51:17 +00:00 |
|
Ulrich Germann
|
6c66b9c631
|
Added Jamfile to produce try-align
|
2014-02-20 23:50:07 +00:00 |
|
Ulrich Germann
|
683635ce25
|
Minor fix to make the compiler stop complaining about unused variables.
|
2014-02-20 23:48:56 +00:00 |
|
Ulrich Germann
|
061b861639
|
Small test program for phrase-based alignment via mmsapt.
|
2014-02-20 23:29:37 +00:00 |
|
Ulrich Germann
|
c259e10b23
|
Various changes.
|
2014-02-20 23:28:01 +00:00 |
|
Ulrich Germann
|
9bcc315644
|
Added phrase-based word alignment to mmsapt (work in progress!).
|
2014-02-20 23:25:36 +00:00 |
|
Hieu Hoang
|
50cadc754f
|
use boost::unordered_map for CacheColl. Marginally faster
|
2014-02-11 03:43:58 +00:00 |
|
Ulrich Germann
|
a6ce081e15
|
Minor changes.
|
2014-02-08 18:25:46 +00:00 |
|
Ulrich Germann
|
594272ce05
|
Changed function count_tokens so that it can be run without passing a filter explicitly.
|
2014-02-08 18:06:11 +00:00 |
|
Ulrich Germann
|
9899364c46
|
Added implicit add-1 smoothing.
|
2014-02-08 18:03:18 +00:00 |
|
Ulrich Germann
|
40fbe226e4
|
Added private members numSent and numWords.
|
2014-02-08 18:02:03 +00:00 |
|
Ulrich Germann
|
66822b279b
|
Added append function to grow imTtracks dynamically in a thread-safe fashion.
|
2014-02-08 18:00:27 +00:00 |
|
Ulrich Germann
|
9f317f4849
|
Minor fix.
|
2014-02-08 17:58:05 +00:00 |
|
Ulrich Germann
|
5f8ae20d01
|
Added dynamicly updatable corpus; updated or added query functions.
|
2014-02-08 17:56:48 +00:00 |
|
Ulrich Germann
|
784654c831
|
Initial check-in.
|
2014-02-08 17:50:26 +00:00 |
|
Ulrich Germann
|
584626a767
|
Added a few programs.
|
2014-02-08 17:49:28 +00:00 |
|
Ulrich Germann
|
5c131f196c
|
Minor changes.
|
2014-02-08 17:22:57 +00:00 |
|
Ulrich Germann
|
4fb00ea6fd
|
Minor fixes.
|
2014-02-08 16:55:05 +00:00 |
|