Ulrich Germann
fbb4b59084
Added option to disable output buffering to split-sentences.perl.
2014-04-16 02:40:23 +01:00
Ulrich Germann
21f141ce0d
Merge branch 'master' into dynamic-phrase-tables
...
Conflicts:
moses/InputPath.cpp
moses/Word.cpp
2014-04-08 14:58:29 +01:00
Ulrich Germann
872facd171
Avoid errors in truecaser if input isn't factored and contains vertical bars.
2014-04-05 15:39:00 +01:00
Ulrich Germann
b1b9f6cfa2
Additions / changes to Moses server so that it can be used for Moses tuning.
2014-04-03 20:38:14 +01:00
Ulrich Germann
467b5d93e8
Additions / changes to mosesserver.cpp so that it can be used in parameter tuning.
2014-04-03 20:35:26 +01:00
Rico Sennrich
c8682e9420
target-syntax: use SoftMatchingFeature to assign non-terminal to unknown words
2014-03-24 14:57:24 +00:00
Phil Williams
cea86d6750
Transliteration support for syntax models.
2014-03-21 22:13:38 +00:00
Hieu Hoang
1c6061e781
bug when adding reordering model to ini file /jian zhang
2014-03-19 00:27:59 +00:00
Hieu Hoang
c501e5fab6
accidental error in perl script
2014-03-14 09:04:49 +00:00
Nadir Durrani
054a648713
Transliteration Script - Modifications
2014-03-13 13:10:38 +00:00
Ulrich Germann
a7c85780ee
Merge branch 'master' into dynamic-phrase-tables
...
Conflicts:
phrase-extract/score-main.cpp
2014-03-10 14:25:45 +00:00
Rico Sennrich
01bc3c111e
swap position of alignment and scores in phrase table halves (before consolidate step).
...
ensures that multiple hierarchical rules with same source/target phrase, but different alignment, are sorted correctly
2014-03-02 16:55:42 +00:00
Ulrich Germann
3f678fa689
Updated pretokenizer for French.
2014-02-25 03:10:58 +00:00
Ulrich Germann
e70766a6de
Preprocessor for tokenization that fixes known errors in the raw input.
2014-02-22 00:28:33 +00:00
Ulrich Germann
d3d14ccf3d
- added option -no-escape to skip escaping of special characters
...
- fixed bug in aggressive tokenization
(est-a-dire was split as est @-@ a-dire
instead of est @-@ a @-@ dire)
2014-02-21 14:14:03 +00:00
Ulrich Germann
ef2ef881a4
Merge branch 'dynamic-phrase-tables' of file:///home/germann/git/mosesdecoder into dynamic-phrase-tables
2014-02-21 01:04:02 +00:00
Hieu Hoang
641145a5c6
Merge pull request #59 from akholy/master
...
Update train-model.perl
2014-02-18 16:52:26 +00:00
Rico Sennrich
2ac7694f7e
copy weights of features with 'tuneable=false' to final config (now also works with -return-best-dev)
2014-02-13 14:19:40 +00:00
phikoehn
049be8b71c
Merge branch 'master' of ssh://github.com/moses-smt/mosesdecoder
2014-02-12 21:01:09 +00:00
phikoehn
d6b62db5b1
fix bug if interpolated lm on different factors
2014-02-12 21:00:55 +00:00
Ulrich Germann
a74d7e6095
Bug fix?
2014-02-08 17:54:48 +00:00
Rico Sennrich
ee06a0f652
don't complain if input contains non-escaped '<' or '>', but is not XML
2014-02-08 15:43:00 +00:00
Matthias Huck
65811a0325
tree fragments: tiny issues with the extraction pipeline
2014-02-03 18:13:10 +00:00
Rico Sennrich
d26fe4cc4d
fix truecaser with XML input (didn't do anything depending on formatting/whitespace)
2014-01-29 23:01:53 +00:00
Hieu Hoang
b6d47733da
remove duplicate spaces caused when XML are stripped
2014-01-24 18:09:55 +00:00
Hieu Hoang
dc3d5b8d38
source labelling for test set.
2014-01-24 16:33:30 +00:00
Hieu Hoang
878e7ab899
source labelling for tuning set. More debugging message in filtering script
2014-01-24 16:21:47 +00:00
Ahmed El Kholy
1ec14d1c6e
Update train-model.perl
...
When I was working with factored models, my experiments failed. After some invistigation, I discovered that the delimiter is hardcoded in several places to "|" so I changed it to $___FACTOR_DELIMITER
2014-01-23 22:38:44 -05:00
Hieu Hoang
6a10f8ce71
corrected phrase-table name / type mixup when creating filtering script
2014-01-23 17:09:56 +00:00
Hieu Hoang
05de672bd8
need to 'label' target side too
2014-01-21 19:21:24 +00:00
Hieu Hoang
27152ccce4
add source labeller to EMS
2014-01-20 23:26:06 +00:00
Hieu Hoang
4c009e31e8
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder into hieu
2014-01-20 17:08:02 +00:00
Rico Sennrich
5d2b4d8efc
sparse feature weights can now be set in moses.ini; overrides weights in weight-file.
2014-01-20 15:54:17 +00:00
Hieu Hoang
ade7400fc7
minor error
2014-01-20 15:42:17 +00:00
Rico Sennrich
043444b293
copy weights of untuneable features from original config in mert
2014-01-20 10:06:42 +00:00
Hieu Hoang
1632194a9e
syntax error
2014-01-17 17:08:34 +00:00
Hieu Hoang
0ff626a716
-extract-command for train-model.perl to override the extract program to be used
2014-01-17 15:43:51 +00:00
phikoehn
4e75911331
changed biconcor location in EMS example config files
2014-01-16 13:58:45 +00:00
Hieu Hoang
ebc724b3de
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2014-01-12 13:51:04 +00:00
Hieu Hoang
a975e3d32d
Add Exception as a keyword for detecting error in EMS step
2014-01-12 13:50:01 +00:00
Hieu Hoang
29d9afa767
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2014-01-10 21:18:01 +00:00
Hieu Hoang
bb649a185e
factor wrapper scripts from Philipp Koehn
2014-01-10 21:17:06 +00:00
phikoehn
25553079d9
bug fix with sparse feature handling depending on word alignment in compact phrase table
2014-01-10 18:34:47 +00:00
amittai
c82c1a4522
Delete mteval-v13a2.pl
...
Accidentally committed a symlink.
2014-01-08 03:58:57 -08:00
amittai
717c211ffa
Merge pull request #58 from amittai/amittai
...
new flag: --no-retokenization, overrides --international-tokenization (o...
2014-01-08 03:51:00 -08:00
amittai
9250c789d4
new flag: --no-retokenization, overrides --international-tokenization (or lack thereof)
2014-01-08 03:38:50 -08:00
phikoehn
9ea0f5dd0e
reporting on init (pretty slow for binary phrase table!) and bug fix in experiment.perl with setting filter settings
2014-01-05 22:39:47 +00:00
Hieu Hoang
eb5112f09e
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2014-01-05 14:12:53 +00:00
Hieu Hoang
a8cb0f7f13
Tamil tokenization /P.Arththika
2014-01-05 14:07:28 +00:00
phikoehn
c8b5cc4f0e
avoid warning;
2013-12-31 19:21:28 +00:00