Commit Graph

1613 Commits

Author SHA1 Message Date
Ulrich Germann
fbb4b59084 Added option to disable output buffering to split-sentences.perl. 2014-04-16 02:40:23 +01:00
Ulrich Germann
21f141ce0d Merge branch 'master' into dynamic-phrase-tables
Conflicts:
	moses/InputPath.cpp
	moses/Word.cpp
2014-04-08 14:58:29 +01:00
Ulrich Germann
872facd171 Avoid errors in truecaser if input isn't factored and contains vertical bars. 2014-04-05 15:39:00 +01:00
Ulrich Germann
b1b9f6cfa2 Additions / changes to Moses server so that it can be used for Moses tuning. 2014-04-03 20:38:14 +01:00
Ulrich Germann
467b5d93e8 Additions / changes to mosesserver.cpp so that it can be used in parameter tuning. 2014-04-03 20:35:26 +01:00
Rico Sennrich
c8682e9420 target-syntax: use SoftMatchingFeature to assign non-terminal to unknown words 2014-03-24 14:57:24 +00:00
Phil Williams
cea86d6750 Transliteration support for syntax models. 2014-03-21 22:13:38 +00:00
Hieu Hoang
1c6061e781 bug when adding reordering model to ini file /jian zhang 2014-03-19 00:27:59 +00:00
Hieu Hoang
c501e5fab6 accidental error in perl script 2014-03-14 09:04:49 +00:00
Nadir Durrani
054a648713 Transliteration Script - Modifications 2014-03-13 13:10:38 +00:00
Ulrich Germann
a7c85780ee Merge branch 'master' into dynamic-phrase-tables
Conflicts:
	phrase-extract/score-main.cpp
2014-03-10 14:25:45 +00:00
Rico Sennrich
01bc3c111e swap position of alignment and scores in phrase table halves (before consolidate step).
ensures that multiple hierarchical rules with same source/target phrase, but different alignment, are sorted correctly
2014-03-02 16:55:42 +00:00
Ulrich Germann
3f678fa689 Updated pretokenizer for French. 2014-02-25 03:10:58 +00:00
Ulrich Germann
e70766a6de Preprocessor for tokenization that fixes known errors in the raw input. 2014-02-22 00:28:33 +00:00
Ulrich Germann
d3d14ccf3d - added option -no-escape to skip escaping of special characters
- fixed bug in aggressive tokenization
  (est-a-dire was split as est @-@ a-dire
              instead of   est @-@ a @-@ dire)
2014-02-21 14:14:03 +00:00
Ulrich Germann
ef2ef881a4 Merge branch 'dynamic-phrase-tables' of file:///home/germann/git/mosesdecoder into dynamic-phrase-tables 2014-02-21 01:04:02 +00:00
Hieu Hoang
641145a5c6 Merge pull request #59 from akholy/master
Update train-model.perl
2014-02-18 16:52:26 +00:00
Rico Sennrich
2ac7694f7e copy weights of features with 'tuneable=false' to final config (now also works with -return-best-dev) 2014-02-13 14:19:40 +00:00
phikoehn
049be8b71c Merge branch 'master' of ssh://github.com/moses-smt/mosesdecoder 2014-02-12 21:01:09 +00:00
phikoehn
d6b62db5b1 fix bug if interpolated lm on different factors 2014-02-12 21:00:55 +00:00
Ulrich Germann
a74d7e6095 Bug fix? 2014-02-08 17:54:48 +00:00
Rico Sennrich
ee06a0f652 don't complain if input contains non-escaped '<' or '>', but is not XML 2014-02-08 15:43:00 +00:00
Matthias Huck
65811a0325 tree fragments: tiny issues with the extraction pipeline 2014-02-03 18:13:10 +00:00
Rico Sennrich
d26fe4cc4d fix truecaser with XML input (didn't do anything depending on formatting/whitespace) 2014-01-29 23:01:53 +00:00
Hieu Hoang
b6d47733da remove duplicate spaces caused when XML are stripped 2014-01-24 18:09:55 +00:00
Hieu Hoang
dc3d5b8d38 source labelling for test set. 2014-01-24 16:33:30 +00:00
Hieu Hoang
878e7ab899 source labelling for tuning set. More debugging message in filtering script 2014-01-24 16:21:47 +00:00
Ahmed El Kholy
1ec14d1c6e Update train-model.perl
When I was working with factored models, my experiments failed. After some invistigation, I discovered that the delimiter is hardcoded in several places to "|" so I changed it to $___FACTOR_DELIMITER
2014-01-23 22:38:44 -05:00
Hieu Hoang
6a10f8ce71 corrected phrase-table name / type mixup when creating filtering script 2014-01-23 17:09:56 +00:00
Hieu Hoang
05de672bd8 need to 'label' target side too 2014-01-21 19:21:24 +00:00
Hieu Hoang
27152ccce4 add source labeller to EMS 2014-01-20 23:26:06 +00:00
Hieu Hoang
4c009e31e8 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder into hieu 2014-01-20 17:08:02 +00:00
Rico Sennrich
5d2b4d8efc sparse feature weights can now be set in moses.ini; overrides weights in weight-file. 2014-01-20 15:54:17 +00:00
Hieu Hoang
ade7400fc7 minor error 2014-01-20 15:42:17 +00:00
Rico Sennrich
043444b293 copy weights of untuneable features from original config in mert 2014-01-20 10:06:42 +00:00
Hieu Hoang
1632194a9e syntax error 2014-01-17 17:08:34 +00:00
Hieu Hoang
0ff626a716 -extract-command for train-model.perl to override the extract program to be used 2014-01-17 15:43:51 +00:00
phikoehn
4e75911331 changed biconcor location in EMS example config files 2014-01-16 13:58:45 +00:00
Hieu Hoang
ebc724b3de Merge branch 'master' of github.com:moses-smt/mosesdecoder 2014-01-12 13:51:04 +00:00
Hieu Hoang
a975e3d32d Add Exception as a keyword for detecting error in EMS step 2014-01-12 13:50:01 +00:00
Hieu Hoang
29d9afa767 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2014-01-10 21:18:01 +00:00
Hieu Hoang
bb649a185e factor wrapper scripts from Philipp Koehn 2014-01-10 21:17:06 +00:00
phikoehn
25553079d9 bug fix with sparse feature handling depending on word alignment in compact phrase table 2014-01-10 18:34:47 +00:00
amittai
c82c1a4522 Delete mteval-v13a2.pl
Accidentally committed a symlink.
2014-01-08 03:58:57 -08:00
amittai
717c211ffa Merge pull request #58 from amittai/amittai
new flag: --no-retokenization, overrides --international-tokenization (o...
2014-01-08 03:51:00 -08:00
amittai
9250c789d4 new flag: --no-retokenization, overrides --international-tokenization (or lack thereof) 2014-01-08 03:38:50 -08:00
phikoehn
9ea0f5dd0e reporting on init (pretty slow for binary phrase table!) and bug fix in experiment.perl with setting filter settings 2014-01-05 22:39:47 +00:00
Hieu Hoang
eb5112f09e Merge branch 'master' of github.com:moses-smt/mosesdecoder 2014-01-05 14:12:53 +00:00
Hieu Hoang
a8cb0f7f13 Tamil tokenization /P.Arththika 2014-01-05 14:07:28 +00:00
phikoehn
c8b5cc4f0e avoid warning; 2013-12-31 19:21:28 +00:00