Commit Graph

133 Commits

Author SHA1 Message Date
Rico Sennrich
7138056b8f flexibility scores 2013-09-07 23:04:01 +02:00
Hieu Hoang
ac50d9f349 Merge /home/hieu/workspace/github/mosesdecoder into hieu_opt_input 2013-08-01 16:55:46 +01:00
Hieu Hoang
f234aa203f number recognizer treats each word as atomic, replace all of the word or nothing at all. Recognizer is designed to be run after the text has been tokenized, not before 2013-08-01 16:55:11 +01:00
Hieu Hoang
4bc7ce99ed add script to strip xml 2013-07-29 11:27:13 +01:00
Hieu Hoang
a3e3289b08 In corpus mode, replace number with number symbol 2013-07-25 15:54:47 +01:00
Hieu Hoang
78381d0213 @NUM@ --> @num@. In case using recaser 2013-07-25 15:16:15 +01:00
Hieu Hoang
6fc21a32fc Merge branch 'master' of github.com:moses-smt/mosesdecoder 2013-07-24 19:01:57 +01:00
Hieu Hoang
c104dee3b2 merge glue grammars, rather than writing them all to the same file. Required by Phil Williams & others when doing syntax extraction 2013-07-24 19:01:46 +01:00
Achim Ruopp
1813f9784b Additional factoring to allow more NE recognizers; bug fixes 2013-07-24 12:44:53 -04:00
unknown
54eb50523b Converted into modulino; added support for French numbers 2013-07-19 14:41:01 -04:00
Hieu Hoang
4e4cf1e313 script to replace numbers with placeholder. /Achim Ruopp 2013-07-01 23:00:59 +01:00
Wilker Aziz
f3cd72537c Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2013-06-24 15:39:18 +01:00
Wilker Aziz
b49e6a162f Wrapper to lmplz 2013-06-24 12:20:20 +01:00
Hieu Hoang
a85f819a53 superceded 2013-06-24 11:33:11 +01:00
Ondrej Bojar
93433cf015 support --translation-details OUTFILE in moses-parallel 2013-04-03 18:10:44 +02:00
Hieu Hoang
1b83b85f44 debug info from sort command 2013-03-18 16:48:40 +00:00
phikoehn
3a7f4f776a minor 2013-03-13 17:54:29 +00:00
phikoehn
d5cf38cab2 Merge branch 'master' of git://github.com/moses-smt/mosesdecoder 2013-01-14 19:23:02 +00:00
phikoehn
344b150372 bug fixes with escaping / truecasing interactions 2013-01-14 19:22:29 +00:00
Barry Haddow
459acf87b1 Add support for instance weights file 2013-01-04 14:55:24 +00:00
phikoehn
b275c94dbf allow for inclusion of extract from previous run 2012-12-12 07:02:59 +00:00
Hieu Hoang
487822ed14 don't write to stdout 2012-11-22 15:08:00 +00:00
phikoehn
5cd614ecd8 adjust to irstlm changes 2012-11-20 17:18:57 +00:00
Hieu Hoang
f3ec76ac56 minor change to calling irst training 2012-10-14 19:51:46 +01:00
phikoehn
04544f8bfc better error message when reference file not found 2012-10-04 23:22:19 +01:00
Lane Schwartz
7b042edc6c Send stderr to /dev/null when looking for pawd.
This cleans up the logs a bit for those of us who don't have pawd.
Otherwise, messages like the following show up in the logs:

/usr/bin/which: no pawd in ...
2012-09-28 14:55:09 -04:00
Lane Schwartz
a323c8daf7 Send stderr to /dev/null when looking for pawd.
This cleans up the logs a bit for those of us who don't have pawd.
Otherwise, messages like the following show up in the logs:

/usr/bin/which: no pawd in ...

bash: pawd: command not found
2012-09-28 14:37:53 -04:00
phikoehn
a84fbcb80a bug fix for using domain feature in multi-process extract 2012-09-24 11:14:36 +01:00
Hieu Hoang
30e5b0575b merge conflict 2012-09-03 19:12:00 +01:00
phikoehn
4a1a995878 a lot of changes 2012-08-18 23:48:26 +01:00
phikoehn
366ab93f8a a lot of changes 2012-08-18 23:47:05 +01:00
Hieu Hoang
a8117c0afd redirect more to stderr. Split input extract files every 50m lines, instead of 10m lines 2012-08-08 14:15:31 +01:00
Hieu Hoang
efdb983e0c redirect parallel scoring to standard err, appending to existing 2012-08-07 15:38:42 +01:00
Hieu Hoang
a1ab8e354a cleanup of variables. Need to delete temporary files 2012-07-31 02:21:48 +01:00
Hieu Hoang
23bbf12752 redirect stderr from extract program to stderr of script that runs it, for easier debugging 2012-07-18 22:53:47 +01:00
Hieu Hoang
f0a3e67717 redirect stderr from extract program to stderr of script that runs it, for easier debugging. Recursive create of output directory, in case output directory doesn't exist 2012-07-18 22:33:49 +01:00
Hieu Hoang
b609473645 under changes to make scripts handle directories with spaces. Karel gave up, and it messing up cruise control 2012-07-18 11:02:24 +01:00
Karel Bílek
bc64284b31 Merge remote-tracking branch 'upstream/master' 2012-07-17 04:37:16 +02:00
Karel Bílek
ad257b27d9 spaces in filepath does not wreak havoc anymore 2012-07-17 04:36:27 +02:00
Hieu Hoang
8180181a89 Merge https://github.com/moses-smt/mosesdecoder 2012-07-10 09:19:31 +01:00
Hieu Hoang
7d982bcd8e comments 2012-07-10 09:18:37 +01:00
phikoehn
51589a25e7 fix for leading, trailing, and multiple spaces, suggested by Yang Gao 2012-07-06 05:12:50 +01:00
Hieu Hoang
debe090426 Change Bin to RealBin. Thanks to Tom Hoar 2012-06-26 11:57:23 -04:00
phikoehn
7cd8356752 minor bug fixes 2012-06-25 23:38:46 +01:00
Barry Haddow
25f5def387 Fix for case where no ttable is to be created 2012-06-03 22:03:41 +01:00
phikoehn
6d1165654c script updates and added ems config help 2012-05-28 20:15:58 +01:00
Hieu Hoang
4c90c88733 make phrase-table.half naming consistent. Requested by Phil Williams 2012-05-26 12:21:32 +01:00
Hieu Hoang
e42f8d36ae parallel scoring 2012-05-24 15:20:23 +01:00
Hieu Hoang
d979b4454d parallel scoring 2012-05-24 15:05:23 +01:00
Hieu Hoang
f91b19fa41 Works with phrase-table scoring 2012-05-23 19:26:37 +01:00