Commit Graph

443 Commits

Author SHA1 Message Date
Philipp Koehn
dd9a59499f progress on deleting steps and runs 2014-05-21 11:16:40 -04:00
Philipp Koehn
aac51cec89 ems: delete a run. may work. 2014-05-16 16:57:34 -04:00
Your Name
93d2d19c3e delete crashed steps 2014-05-08 16:42:11 -04:00
Ulrich Germann
3d4ab5a0d9 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder into dynamic-phrase-tables 2014-04-28 10:22:12 +01:00
Ulrich Germann
7c145d045b Merge branch 'master' into dynamic-phrase-tables
Conflicts:
	contrib/server/Jamfile
	contrib/server/mosesserver.cpp
2014-04-28 10:00:07 +01:00
Philipp Koehn
fa85d15e31 relative links to javascript sub directory in ems web interface 2014-04-23 16:56:10 -04:00
phikoehn
4ee4e07c1b minor ems fixes 2014-04-23 13:50:08 +01:00
Nadir Durrani
5e3e50d4ec In-Decoding Transliteration Module 2014-04-16 17:28:49 +01:00
Ulrich Germann
fbb4b59084 Added option to disable output buffering to split-sentences.perl. 2014-04-16 02:40:23 +01:00
Rico Sennrich
c8682e9420 target-syntax: use SoftMatchingFeature to assign non-terminal to unknown words 2014-03-24 14:57:24 +00:00
Phil Williams
cea86d6750 Transliteration support for syntax models. 2014-03-21 22:13:38 +00:00
Hieu Hoang
c501e5fab6 accidental error in perl script 2014-03-14 09:04:49 +00:00
Nadir Durrani
054a648713 Transliteration Script - Modifications 2014-03-13 13:10:38 +00:00
phikoehn
049be8b71c Merge branch 'master' of ssh://github.com/moses-smt/mosesdecoder 2014-02-12 21:01:09 +00:00
phikoehn
d6b62db5b1 fix bug if interpolated lm on different factors 2014-02-12 21:00:55 +00:00
Matthias Huck
65811a0325 tree fragments: tiny issues with the extraction pipeline 2014-02-03 18:13:10 +00:00
Hieu Hoang
dc3d5b8d38 source labelling for test set. 2014-01-24 16:33:30 +00:00
Hieu Hoang
878e7ab899 source labelling for tuning set. More debugging message in filtering script 2014-01-24 16:21:47 +00:00
Hieu Hoang
6a10f8ce71 corrected phrase-table name / type mixup when creating filtering script 2014-01-23 17:09:56 +00:00
Hieu Hoang
05de672bd8 need to 'label' target side too 2014-01-21 19:21:24 +00:00
Hieu Hoang
27152ccce4 add source labeller to EMS 2014-01-20 23:26:06 +00:00
phikoehn
4e75911331 changed biconcor location in EMS example config files 2014-01-16 13:58:45 +00:00
Hieu Hoang
ebc724b3de Merge branch 'master' of github.com:moses-smt/mosesdecoder 2014-01-12 13:51:04 +00:00
Hieu Hoang
a975e3d32d Add Exception as a keyword for detecting error in EMS step 2014-01-12 13:50:01 +00:00
phikoehn
25553079d9 bug fix with sparse feature handling depending on word alignment in compact phrase table 2014-01-10 18:34:47 +00:00
phikoehn
9ea0f5dd0e reporting on init (pretty slow for binary phrase table!) and bug fix in experiment.perl with setting filter settings 2014-01-05 22:39:47 +00:00
phikoehn
c8b5cc4f0e avoid warning; 2013-12-31 19:21:28 +00:00
Nadir Durrani
7f75018349 Post-decoding Transliteration Script 2013-12-18 16:10:57 +00:00
Nadir Durrani
c291f859a0 Transliteration Mining 2013-12-16 18:19:44 +00:00
phikoehn
dab6a301fa make reference-from-sgm.perl more robust 2013-11-30 02:00:04 +00:00
Hieu Hoang
f85d26ec60 delete reuse-weights.perl 2013-11-12 12:19:44 +00:00
Hieu Hoang
df3f3d130f reuse-weights.perl --> substitute-weights.perl 2013-11-12 12:07:06 +00:00
JIDFmaster@JIDF.org
f68a92e9c1 correcting the reuse-weights.perl for a new format 2013-11-12 00:38:03 +01:00
Barry Haddow
97695164dd Basic support for WADE analysis
Partial support for running WADE (http://www.umiacs.umd.edu/~hal/damt/)
analysis from ems. You still need to create the input-reference alignments
somehow - for example by running training with the test set concatenated
to the training set.

To use WADE, (i) add 'wade = /path/to/wade.py' to the EVALUATION section and
(ii) add 'alignment = /path/to/alignments' to the approriate stanza
for each test set.
2013-11-01 16:56:55 +00:00
phikoehn
29d2c015a3 removed spurious $input-extension 2013-10-11 02:02:01 +01:00
Hieu Hoang
e8951c9243 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2013-10-10 21:30:17 +00:00
Hieu Hoang
9cbfa50102 add actual config file and run command used to train and decode this example 2013-10-10 21:30:07 +00:00
phikoehn
75e007d0f3 minor fixes 2013-10-10 10:12:56 +01:00
phikoehn
1e702c46b2 updated web interface for experiment.perl 2013-09-25 23:16:53 +01:00
Barry Haddow
ef43d6e038 Need phrase penalty weight 2013-09-11 10:59:48 +01:00
Barry Haddow
03997dfc3a Change number of weights in example 2013-09-11 10:41:17 +01:00
phikoehn
9f40416ee4 changed default for hierarchical phrase table binarization 2013-09-08 18:30:48 +01:00
Barry Haddow
867c6efe6c Merge branch 'master' of github.com:moses-smt/mosesdecoder 2013-09-06 13:29:29 +01:00
Barry Haddow
7425036c3a scoring correction 2013-09-06 13:29:20 +01:00
Nadir Durrani
4156c7acb6 Config files 2013-08-27 13:47:09 +01:00
Nadir Durrani
696c0eff61 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2013-08-26 13:22:57 +01:00
nadir
fb35e1f3c9 Training Scripts for Factored OSM 2013-08-26 13:21:04 +01:00
phikoehn
79a2c98ff7 better ems support for different binarizers and reordering models 2013-08-25 20:30:37 +01:00
phikoehn
b368085609 xml constraint 2013-08-15 11:46:45 +01:00
Hieu Hoang
02c7af3fb8 Mira changes. Manually applied Eva's patch 2013-08-12 13:03:26 +01:00
Hieu Hoang
b05a443f36 correct arguments to substitute-filtered-tables-and-weights.perl 2013-07-30 11:14:17 +01:00
Barry Haddow
46ee1ca42d More lattice fixes squashed by merge 2013-07-24 16:09:32 +01:00
Barry Haddow
d5e40a5b08 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2013-07-24 11:38:23 +01:00
Nadir Durrani
30544ae17e Sample Config File 2013-07-23 12:29:23 +01:00
Nadir Durrani
61e56ecdcd Sample Config File 2013-07-23 12:18:57 +01:00
Barry Haddow
ecc6c7177c Reinstate lattice fixes squashed by merge 2013-07-22 17:25:01 +01:00
Nadir Durrani
deae3ac7b9 OSM entries in train-model.perl, experiment.* 2013-07-07 13:05:09 +01:00
Nadir Durrani
d2bc6a2584 In EMS 2013-07-04 19:58:19 +01:00
Nadir Durrani
fbdb07a94c EMS 2013-07-03 10:54:38 +01:00
Wilker Aziz
2c19238c24 Patching up the suffix array wrappers 2013-06-24 15:38:10 +01:00
phikoehn
2e8fbe77a2 corrected example files 2013-06-08 14:45:55 +01:00
phikoehn
730da7edec sparse feature specification bug fix 2013-06-08 13:39:15 +01:00
Graham Neubig
33d5aac6af Fixed crash in interpolation for small lambdas
The EMS crashed when interpolating language models when the ideal lambdas included numbers so small that they required scientific notation (eg: 1.332e-07). This patch adds "e" and "-" to the acceptable numbers to fix this problem
2013-06-01 12:37:24 +09:00
phikoehn
68501f5a36 bug fix with weight substitution 2013-05-31 12:27:35 +01:00
phikoehn
8944ea541a fast align parameter 2013-05-25 23:20:27 +01:00
phikoehn
542cd72c63 moved config creation back into train-model.perl 2013-05-19 03:28:02 +01:00
Hieu Hoang
0596ba4245 carry [weight-file] from tuned ini 2013-05-17 18:23:55 +01:00
Hieu Hoang
11632e298e add substitute-filtered-tables-and-weights.perl for applying filter for evaluation step 2013-05-17 16:13:24 +01:00
Hieu Hoang
42c292765a add substitute-filtered-tables-and-weights.perl for applying filter for evaluation step 2013-05-17 13:28:21 +01:00
phikoehn
4cdffc8a89 fixes for sparse feature handling 2013-05-17 08:37:29 +01:00
phikoehn
13991fc88f added specification to example config files for fast align 2013-05-17 06:42:54 +01:00
Barry Haddow
585786d26b can specify location of create-ini 2013-05-16 19:34:56 +01:00
Hieu Hoang
8dd84d7a40 change integration of sparse features with EMS to account for new weights format 2013-05-16 15:38:05 +01:00
phikoehn
41da5b2760 Merge branch 'master' of git://github.com/moses-smt/mosesdecoder 2013-05-12 08:16:22 +01:00
Hieu Hoang
a8f4e2c8fe changes for cruise control 2013-05-10 15:43:49 +01:00
Hieu Hoang
e2f2aff94a merged. Mostly by discarding new changes 2013-05-03 14:36:39 +01:00
Barry Haddow
8a965cd62e Fixes to binarize-all 2013-05-03 10:15:37 +01:00
Barry Haddow
8993339df4 Make sure tuning uses filtered config when available. 2013-05-02 18:50:21 +01:00
Barry Haddow
5eebb9538e Enable skipping of filtering in EMS
Use 'binarize-all = path-to-binarize-model.perl
2013-05-02 15:15:52 +01:00
phikoehn
d19a28ae21 Merge branch 'master' of git://github.com/moses-smt/mosesdecoder 2013-05-01 19:22:00 +01:00
phikoehn
cd8915647b support for Chris Dyer's fast-align; bug fix with sparse word translations feature; threshold pruning in filter 2013-05-01 19:20:05 +01:00
Hieu Hoang
3ed17bbedd merge 2013-05-01 11:50:29 +01:00
Barry Haddow
5638aa6a32 don't rebuild tables when a TRAINING:config is specified 2013-05-01 11:25:32 +01:00
Hieu Hoang
b1da4dbe0e merged 2013-04-19 15:03:34 +01:00
Barry Haddow
9d42c7f6f7 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2013-04-12 16:07:26 +01:00
Hieu Hoang
71a2b49a47 Merge github.com:moses-smt/mosesdecoder into weight-new 2013-04-01 16:43:32 +01:00
phikoehn
0a978e9f01 bug fixes 2013-04-01 14:31:32 +01:00
Hieu Hoang
fd4e954322 merge 2013-03-24 09:57:36 +00:00
Barry Haddow
8efeb59228 don't lowercase reference if there's a recaser 2013-03-18 21:29:17 +00:00
Hieu Hoang
1f1a0297db runtime error in creating ini file 2013-03-16 15:05:14 +00:00
Hieu Hoang
3271710dd5 tell create-ini about input factors 2013-03-11 19:31:53 +00:00
Hieu Hoang
7fb363d922 tell create-ini what factors are being used 2013-03-11 13:42:26 +00:00
Hieu Hoang
96d76ff642 correctly call substitute-weights.perl in apply-weights 2013-03-08 12:53:18 +00:00
Hieu Hoang
0869921b80 remove misc/reuse-weights.cpp 2013-03-07 16:53:50 +00:00
Hieu Hoang
0f2b2acd78 added substitute-weights.perl 2013-03-06 19:34:48 +00:00
Hieu Hoang
3e60705ec2 change format of pt. Allow overriding of feature name 2013-03-06 12:39:41 +00:00
Hieu Hoang
f3e3bc7636 Merge github.com:moses-smt/mosesdecoder into weight-new 2013-02-27 23:32:50 +01:00
amittai
5cdf65ba33 Revert "Revert "let's be consistently case-insensitive with respect to the xml tags""
This reverts commit 8b6e98c633.
2013-02-26 11:32:29 -08:00
amittai
8b6e98c633 Revert "let's be consistently case-insensitive with respect to the xml tags"
This reverts commit 2eb0c5e11d.
2013-02-24 18:10:19 -08:00
amittai
2eb0c5e11d let's be consistently case-insensitive with respect to the xml tags 2013-02-24 18:07:11 -08:00
Barry Haddow
51ab9aa19d Merge remote branch 'origin/master' into phrase-weighting 2013-02-21 17:34:59 +00:00
Hieu Hoang
b9b74c53ae new ini file with hiero models 2013-02-18 16:51:44 +00:00
Hieu Hoang
9fe9b0008b new ini file with hiero models 2013-02-18 12:05:10 +00:00
Hieu Hoang
30850bf45f new substitute-filtered-tables.perl 2013-02-16 11:01:23 +00:00
Hieu Hoang
ae52a15c4d use new create-ini program to create ini file, rather than step 9 of train-model.perl 2013-02-14 12:07:48 +00:00
Hieu Hoang
825edd282b use new create-ini program to create ini file, rather than step 9 of train-model.perl 2013-02-14 11:19:40 +00:00
Hieu Hoang
aadefc6df9 Merge branch 'master' into weight-new 2013-01-17 18:05:02 +00:00
phikoehn
124c36a837 bug fix with MML settings 2013-01-14 19:39:26 +00:00
phikoehn
d5cf38cab2 Merge branch 'master' of git://github.com/moses-smt/mosesdecoder 2013-01-14 19:23:02 +00:00
phikoehn
344b150372 bug fixes with escaping / truecasing interactions 2013-01-14 19:22:29 +00:00
Hieu Hoang
fa60724391 merge 2013-01-14 10:39:41 +00:00
Kenneth Heafield
c9687e3b50 Fix longstanding bug in sentence splitter spacing.
"Foo Bar.  Baz Quux." is two sentences even though there are two spaces instead of one.
2013-01-11 13:32:24 +00:00
Graham Neubig
c55a1474df Updated experiment.meta 2013-01-10 16:16:23 +09:00
Hieu Hoang
bc615bdac8 Merge branch 'master' into weight-new 2012-12-18 12:46:00 +00:00
Hieu Hoang
aa00aebee6 rewrite reuse-weight 2012-12-18 10:23:00 +00:00
Hieu Hoang
667b8a495f decoder -show-weight to output consistent new format 2012-12-17 17:17:44 +00:00
phikoehn
b275c94dbf allow for inclusion of extract from previous run 2012-12-12 07:02:59 +00:00
phikoehn
24e1df7520 support for use of baseline alignment model 2012-12-12 03:59:14 +00:00
phikoehn
438dcb1a34 bug fix in experiment.perl wrt. get-corpus-script 2012-12-10 23:50:14 +00:00
Barry Haddow
16ea68f55f Fix bug in mml scoring
Line length calculation was out of step with LM scoring.
2012-12-10 15:54:24 +00:00
phikoehn
ed2d191821 allow specification of end point for experiment.perl 2012-12-10 05:56:51 +00:00
phikoehn
ccf9e13d8e bug fix with multicore parallelizer 2012-12-09 22:27:02 +00:00
phikoehn
466b502ae0 minor bug fixes with MML 2012-12-09 20:31:20 +00:00
Hieu Hoang
55e5af4785 add my workstation to ems list 2012-12-07 19:24:58 +00:00
phikoehn
ab2effb6fe train MML in-/out-of-domain language models with same vocabulary 2012-12-01 13:46:59 +00:00
Hieu Hoang
5fd9cbb529 delete reference to numpy. Doesn't need it 2012-11-30 10:28:51 +00:00
phikoehn
338b7656a6 ooops 2012-11-30 07:36:59 +00:00
phikoehn
84cb04c05a fixes and extensions to modified Moore-Lewis filtering, now works with domain features 2012-11-30 07:28:31 +00:00
phikoehn
1f7ee0e6c5 change of settings for sigtest filtering 2012-11-29 23:44:10 +00:00
Barry Haddow
f0e12912e7 mml-score.py. Support for combining with domain features. 2012-11-27 15:58:55 +00:00
phikoehn
b5d08745a5 extensions to modified moore-lewis filtering, bug fixes 2012-11-24 20:13:14 +00:00
phikoehn
ea610a0558 added modified-moore-lewis from Barry Haddow into EMS 2012-11-24 12:43:13 +00:00
phikoehn
d4cebb008a added ems support for sigtest-filter 2012-11-23 17:35:13 +00:00
phikoehn
c2a96fcc33 adjust to irstlm changes 2012-11-20 17:19:17 +00:00
phikoehn
5cd614ecd8 adjust to irstlm changes 2012-11-20 17:18:57 +00:00
Barry Haddow
a90e1861c0 Alignments on by default for phrase-based 2012-11-15 12:35:43 +00:00
Barry Haddow
5e3726eb90 Remove -use-alignment-info 2012-11-15 09:42:58 +00:00
Barry Haddow
c316fe68de Strip segmentation before scoring. 2012-11-14 13:28:40 +00:00
Barry Haddow
095b307cfc Make sure alignment info is not on for hiero 2012-11-08 18:16:53 +00:00
Barry Haddow
12786dd58f Alignments on by default.
Use TRAINING:include-word-alignment-in-rules = no
to turn them off.
2012-11-08 17:52:10 +00:00
Barry Haddow
01c4de24b7 don't delete moses ini specified in config 2012-11-06 10:59:15 +00:00
Barry Haddow
f9d0721145 correct format for word trans features 2012-10-19 09:07:18 +01:00
phikoehn
0dda804c46 sparse feature fixes 2012-10-18 03:09:49 +01:00
phikoehn
7d66a4f8d5 superminortiny fix 2012-10-18 02:52:30 +01:00
phikoehn
98dafc0301 Merge branch 'master' of git://github.com/moses-smt/mosesdecoder 2012-10-18 02:20:45 +01:00
phikoehn
f19d53dac9 minor fixes 2012-10-18 02:20:38 +01:00
Barry Haddow
61ae24aa5d Merge remote-tracking branch 'origin/master' into miramerge
Conflicts:
	moses/src/PhraseDictionary.cpp
	moses/src/TargetPhrase.cpp
	moses/src/TargetPhrase.h
2012-10-14 14:18:03 +01:00
Lane Schwartz
c541c77b2f Merge branch 'master' of www:/repos/git/Decoders/mosesdecoder 2012-10-11 10:15:00 -04:00
Lane Schwartz
11679849db In verbose mode, experiment.perl should print the full qsub command
prior to actually running it.
2012-10-10 13:25:58 -04:00
Lane Schwartz
0904531749 Make experiment.perl use qsub-settings from GENERAL section
if no qsub-settings are defined for the specific section being run.
2012-10-10 13:25:09 -04:00