Commit Graph

1190 Commits

Author SHA1 Message Date
hieuhoang1972
93593b891d make naming of hypo stacks classes consistent
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1282 1f5c12ca-751b-0410-a591-d2e778427230
2007-03-07 15:37:38 +00:00
hieuhoang1972
025f2f3e03 gcc compile error
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1276 1f5c12ca-751b-0410-a591-d2e778427230
2007-03-06 15:57:30 +00:00
hieuhoang1972
86e6d99d76 add comments and Reset()
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1275 1f5c12ca-751b-0410-a591-d2e778427230
2007-03-06 12:15:51 +00:00
hieuhoang1972
075681d6fc visual studio proj file
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1267 1f5c12ca-751b-0410-a591-d2e778427230
2007-03-04 09:01:50 +00:00
hieuhoang1972
5f22fb13d3 make output of decimal places consistent by not format anywhere but in Main.cpp
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1266 1f5c12ca-751b-0410-a591-d2e778427230
2007-03-04 00:43:45 +00:00
hieuhoang1972
2c9f1a13fe make output of decimal places consistent by not format anywhere but in Main.cpp
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1265 1f5c12ca-751b-0410-a591-d2e778427230
2007-03-04 00:17:54 +00:00
hieuhoang1972
c40d07d0a1 code cleanup
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1257 1f5c12ca-751b-0410-a591-d2e778427230
2007-03-02 13:37:12 +00:00
jdschroeder
19129713a2 added filter-and-binarize training script to released-files
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1256 1f5c12ca-751b-0410-a591-d2e778427230
2007-03-01 14:44:46 +00:00
maurocettolo
5439a7796d Fixed a minor bug in mert-moses.pl regarding sanity checks for specified lambda triples
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1254 1f5c12ca-751b-0410-a591-d2e778427230
2007-03-01 13:02:22 +00:00
hieuhoang1972
e6b7866f4a get rid of warning message in srilm class
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1251 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-28 11:59:32 +00:00
phkoehn
f1d2bd0eb5 added option -include-alignment-in-n-best to include the word alignment for each sentence in the n-best list file
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1246 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-26 20:59:41 +00:00
hieuhoang1972
3413bf7046 visual studio output paths
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1245 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-26 16:49:32 +00:00
jorcisai
ad28cee802 distortion filename was incorrectly written into moses.ini file in step 9
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1243 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-26 14:36:53 +00:00
phkoehn
a89acb34ae minor bug fix to recaser training
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1242 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-26 12:19:06 +00:00
hieuhoang1972
62b4741de0 move calling InitializeBeforeSentenceProcessing() & CleanUpAfterSentenceProcessing() in Manager class
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1241 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-24 20:32:32 +00:00
jorcisai
b361817067 In old_sge mode: sync script name is now prefixed by ${jobscript} to be able to run several moses_parallel.pl in parallel. Also a new function check_translation_old_sge was added, this function is derived from the former check_translation function
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1239 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-23 18:08:47 +00:00
jorcisai
d5b4565f23 language model parser for --lm option is now again able to parse $type, but it is backward compatible
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1238 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-23 17:32:51 +00:00
jorcisai
c69bd4079b reordering model was left in the local directory instead of model directory
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1236 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-23 12:27:50 +00:00
jorcisai
872f2d3612 Trying to parse $type in --lm option, but not available. So we just need to parse three tokens.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1235 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-23 11:49:54 +00:00
hieuhoang1972
bef38f4006 code cleanup
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1234 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-23 00:00:09 +00:00
hieuhoang1972
a1072b9a7a more verbose=0
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1233 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-22 23:54:59 +00:00
hieuhoang1972
c58393a4b4 verbose=0 nothing goes to stderr except for real, aborting errors
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1232 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-22 23:44:38 +00:00
phkoehn
6c5cb3a6ec changes to fit with edinburgh setup, added switch -generation-type: "single" only produces one probability, not both
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1231 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-22 19:37:11 +00:00
hieuhoang1972
8048aefeb0 fixed mem leak
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1230 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-22 12:04:53 +00:00
hieuhoang1972
6b611279d5 minor gcc compile error.
also, no longer use IRSTLM as a subsitutute for SRILM, and vice versa. They don't give identitcal results - avoids confusion.

git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1229 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-21 20:05:27 +00:00
hieuhoang1972
b62dda41ed change unknown word processing to be closer to the way pharaoh does it - create unknown word whenever single word is not in translation table but penalise hypothesis for using it.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1228 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-21 19:51:17 +00:00
hieuhoang1972
7ecb0ce66e change unknown word processing to be closer to the way pharaoh does it - create unknown word whenever single word is not in translation table but penalise hypothesis for using it.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1227 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-21 19:48:53 +00:00
jdschroeder
9576345394 added recaser scripts to released-files
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1226 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-21 15:47:13 +00:00
hieuhoang1972
7e0261b901 hack to fix hypo collection where all hypo scores are -inf. need to rethink pruning or creation of trans opt for unknown word
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1225 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-20 11:08:55 +00:00
phkoehn
9f227aa26b minor bug fix with config file
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1224 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-19 18:35:51 +00:00
hieuhoang1972
53578eda97 minor gcc compile error
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1219 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-16 18:15:06 +00:00
hieuhoang1972
f3cbacba3e code cleanup - make FactorCollection and StaticData totally accessible only globally
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1218 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-16 18:08:37 +00:00
lexi_birch
59c4ba9f4d Merging from branch.
From now on, can have multiple decoder step lists to accomodate backoff
Specify this as an extra parameter in the [mapping] option in the ini file
This is backwards compatible.
Before (and still accepted):
[mapping]
T 0

Now you can have:
[mapping]
0 T 0
1 T 1
1 G 0

Imagine for instance the translation table 0 is words - words, 
and the table 1 is stems - stems, and the generation table 0
is stems - words. This will allow us to backoff to stems if
words are not found.

It is not really backoff because all the options from both
decoder step lists get included into the translation option collection,
which is then used to create the hypotheses.
The different paths must have their weights carefully balanced.
MERT might not be enough to discover the best weights for all the
combined parameters. 




git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1217 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-16 15:56:44 +00:00
bojar
2f4c70b4ae Die if aclocal, autoconf or automake fail
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1214 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-15 01:50:26 +00:00
bojar
6eacf476f0 Die if aclocal, autoconf or automake fail.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1213 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-15 01:49:37 +00:00
hieuhoang1972
4237cba9c3 check in eclipse proj to make bin table
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1212 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-14 20:32:57 +00:00
phkoehn
14839768c8 a large number of changes. besides little tweaks:
* training script now has proper default behaviour for single-factor models, 
* mert script has better handling of default lambda parameters that now
  works with lexicalized reordering models, and also with multiple 
  models files (e.g. multiple language models)
* parallel mert script is more robust when single jobs fail: detects it
  and resubmits the crashed (or killed) jobs
* recaser added that builds on moses
* filtering script added that also binarizes filtered model files
  (this will be eventually replaced when the lexicalized reordering
  model also uses the binary format)


git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1210 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-13 19:22:35 +00:00
hieuhoang1972
e247f1da6f fixed regression test failing. Number of features for generation models MUST be specified in ini file, no backward compatability hack
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1209 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-13 19:15:34 +00:00
hieuhoang1972
6b4dfc4db2 added #def to use hypo pool
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1206 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-12 11:05:13 +00:00
hieuhoang1972
4a30043757 remove irstlm vs project
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1204 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-12 10:21:35 +00:00
hieuhoang1972
ced1a06fff minor fn name change
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1203 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-10 19:40:36 +00:00
hieuhoang1972
2b9fc4b5cc minor fn name change
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1202 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-10 19:31:43 +00:00
hieuhoang1972
79b01784af minor tweak
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1201 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-10 19:22:40 +00:00
hieuhoang1972
1b2f95ad6a create eclipse project for processing bin phrase table
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1200 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-09 19:32:53 +00:00
hieuhoang1972
006e2724e0 take out irstlm on VS build
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1195 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-09 17:36:07 +00:00
maurocettolo
7c7ee97f14 Minor revisions on consistency checks of IRSTLM package
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1190 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-09 13:44:46 +00:00
maurocettolo
075c88cd14 - update of irstlm library: files larger than 4Gb can be handled in
mmap (see irstlm/src/*cpp and irstlm/src/*h)
- fixed a bug in querying IRST LMs with OOVs (LanguageModelIRST.cpp)
- some more checks on config file: if specified, existence of generation
  and distortion files is checked (Parameter.cpp)
- 0 valued entries in binary phrase tables are loaded as 1.0e-38
  (PhraseDictionaryTree.cpp)



git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1189 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-08 13:48:43 +00:00
phkoehn
de9a5e96dd look for gziped generation file, if basefile does not exist,
this should be done for all model files (lm, phrase table, reordering table)


git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1183 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-07 19:02:42 +00:00
hieuhoang1972
3d7da64118 delete old kdevelop templates folder
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1182 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-06 22:39:36 +00:00
hieuhoang1972
6d217c5dda tweaked function to add hypo to stack
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1180 1f5c12ca-751b-0410-a591-d2e778427230
2007-02-06 22:20:41 +00:00