Commit Graph

27 Commits

Author SHA1 Message Date
phikoehn
344b150372 bug fixes with escaping / truecasing interactions 2013-01-14 19:22:29 +00:00
Mark Fishel
01eb79cf6f fixed berkeley parser wrapper's broken output in case of in-text parentheses 2012-11-07 11:18:36 +01:00
Phil Williams
b2b9751227 parse-de-bitpar.perl: fix special char handling
Unescape special characters in input to BitPar and then re-escape
in output.
2012-10-06 16:27:33 +01:00
Hieu Hoang
debe090426 Change Bin to RealBin. Thanks to Tom Hoar 2012-06-26 11:57:23 -04:00
Hieu Hoang
0e621fe2d3 gzip grammar files 2012-06-08 00:43:47 +01:00
Hieu Hoang
82ca9fba69 delete example suffix array files 2012-06-07 19:40:10 +01:00
Hieu Hoang
03ce774d0e suffix array creation script also creates glue grammar 2012-06-07 19:37:11 +01:00
Hieu Hoang
0b59bee21c sa extraction works 2012-06-06 19:17:21 +01:00
Hieu Hoang
f0f4893a3d start integrating adam's suffix array 2012-06-01 16:18:34 +01:00
Hieu Hoang
86666c2a1f start integrating adam's suffix array 2012-06-01 16:06:37 +01:00
phikoehn
2e370ed11b more escaping in tokenizer; wrapper for berkeley parser (german) 2012-05-30 00:58:18 +01:00
Hieu Hoang
1008b7c27b Begin integrating Adam's suffix arrays into EMS 2012-05-14 19:44:53 +01:00
Hieu Hoang
93d0cc8313 Begin integrating Adam's suffix arrays into EMS 2012-05-14 18:56:10 +01:00
phikoehn
366d427ce6 minor fixes 2012-04-12 00:25:57 +01:00
phikoehn
791b5a7676 lotsa minor changes: mostly bug fixes, tokenizer now esacapes special Moses characters (|<>&) 2012-03-20 04:57:37 +00:00
Phil Williams
e70cafa1d7 The current version of BitPar's German grammar is encoded using UTF-8, so
disable the conversion to and from ISO-8859-1 in the parse-de-bitpar.perl
wrapper script.  For compatibility with older versions of BitPar, the old
behaviour is available using the --old-bitpar command-line option.
2011-11-08 15:05:02 +00:00
hieuhoang1972
57e3a92836 rollback. argument not supported by all iconv
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3712 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-15 12:50:11 +00:00
hieuhoang1972
ff339e56e3 don't drop unknown char. replace it with improbable string. avoid misalignment
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3709 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-14 20:50:15 +00:00
hieuhoang1972
f7904a871c add scripts to exclude unparseable sentences
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3704 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-12 14:43:52 +00:00
hieuhoang1972
687cf9bf29 add scripts to exclude unparseable sentences
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3702 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-12 14:20:11 +00:00
hieuhoang1972
a79a6bbaec add scripts to exclude unparseable sentences
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3700 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-11 18:04:16 +00:00
hieuhoang1972
f1f04daa0a add empty line if input is empty line
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3699 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-10 12:11:55 +00:00
phkoehn
85a5a13e4c improvements to web analysis, fixes to syntax wrappers
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3633 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-21 09:49:27 +00:00
hieuhoang1972
eedef63277 keep perl scripts with Unix line endings
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3612 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-11 11:32:27 +00:00
phkoehn
b271862d7c various updates, mostly related to experiment.perl
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3262 1f5c12ca-751b-0410-a591-d2e778427230
2010-05-18 17:39:16 +00:00
phkoehn
2ed6804f12 official release of experiment.perl
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3224 1f5c12ca-751b-0410-a591-d2e778427230
2010-05-04 23:04:10 +00:00
pjwilliams
2edfc16912 Merge remaining script support for tree-based models from mt3_chart.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3137 1f5c12ca-751b-0410-a591-d2e778427230
2010-04-16 09:45:51 +00:00