Commit Graph

175 Commits

Author SHA1 Message Date
Jeroen Vermeulen
ef028446f3 Add license notices to scripts.
This is not pleasant to read (and much, much less pleasant to write!) but
sort of necessary in an open project.  Right now it's quite hard to figure
out what is licensed how, which doesn't matter much to most people but can
suddenly become very important when people want to know what they're being
allowed to do.

I kept the notices as short as I could.  As far as I could see, everything
without a clear license notice is LGPL v2.1 or later.
2015-05-29 18:30:26 +07:00
Hieu Hoang
582a845524 don't use zcat 2015-05-24 20:04:01 +04:00
Hieu Hoang
2f0ee5502e delete debugging info 2015-05-18 16:26:26 +04:00
Jeroen Vermeulen
a25193cc5d Fix a lot of lint, mostly trailing whitespace.
This is lint reported by the new lint-checking functionality in beautify.py.
(We can change to a different lint checker if we have a better one, but it
would probably still flag these same problems.)

Lint checking can help a lot, but only if we get the lint under control.
2015-05-17 20:04:04 +07:00
Jeroen Vermeulen
9bdcb5f7c1 Fix more Python lint.
This is about the last that isn't in contrib or generated files.  At this
point we can start doing regular lint checks, at least on the Python files,
without being completely inundated with warnings.
2015-05-16 18:03:54 +07:00
Jeroen Vermeulen
61162dd242 Fix more Python lint.
Most of the complaints fixed here were from Pocketlint, but many were also
from Syntastic the vim plugin.
2015-05-16 17:26:56 +07:00
Nicola Bertoldi
5700fbaabf Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-04-22 07:50:07 +02:00
Hieu Hoang
c15f3ef068 duplicated functionality with ems/support/lmplz-wrapper.perl 2015-04-21 17:54:34 +04:00
Hieu Hoang
6162223690 add use warnings to all perl scripts 2015-04-13 20:42:33 +04:00
Hieu Hoang
54e55f2dcb better detection of pigz, sort, split. In case they are not in the default directory 2015-04-06 11:31:44 +04:00
Hieu Hoang
2d1da3219d consistently use 'env perl' command for environments where the 1st perl in PATH isn't the default perl. Which is kinda stupid 2015-04-02 17:38:56 +04:00
Matthias Huck
06e87d851e GHKM: extract POS phrase property (from preterminals in the syntactic parse tree) 2015-03-04 21:40:56 +00:00
Nicola Bertoldi
a1539505c8 minor change to make extract-parallel.perl compliant with MacOSX split command 2015-02-04 09:02:51 +01:00
Matthias Huck
9987beb453 SoftSourceSyntacticConstraintsFeature: Now for both non-terminals (as before) _and_ terminals.
Also added score components based on relative frequency.
(TODO: logprobs right now; are plain probabilities better?)
2015-01-23 18:41:18 +00:00
Hieu Hoang
90d4b2d713 use pigz rather than gzip if it exists 2015-01-13 15:16:22 +00:00
mjdenkowski
40e8f2eca0 Hypergraph output 2014-11-03 09:16:12 -05:00
mjdenkowski
a1f561ac31 Only update dynamic models 2014-10-10 15:09:53 -04:00
Ondrej Bojar
14449b3601 towards a simple line-oriented dump of FSA 2014-09-11 14:50:59 +02:00
Michael Denkowski
b8c9ae2c55 Update models named "Dynamic..." 2014-08-29 14:40:20 -04:00
Michael Denkowski
9098f3a8b4 Support simulated post-editing with MultiModel 2014-08-19 16:20:35 -04:00
Michael Denkowski
300de5d041 Text size limits jobs 2014-08-13 16:51:20 -04:00
Michael Denkowski
057066ea0e Minor fixes for simulated post-editing with mert-moses.pl 2014-08-13 15:58:51 -04:00
Hieu Hoang
94c44c03d5 merge 2014-08-13 18:03:05 +01:00
Matthias Huck
c27cbf55ea source labels: integration into EMS 2014-08-07 21:02:51 +01:00
Michael Denkowski
9ad59e2d69 Header and some instructions 2014-08-05 15:11:35 -04:00
Michael Denkowski
e7c36ee804 Simulated post-editing merge: XML update, parallel SPE script, MERT 2014-08-05 14:20:00 -04:00
Matthias Huck
3a5dee12e8 implementation of phrase orientation in GHKM extraction
(...but a corresponding feature function for the chart-based decoder has not been written yet)
2014-07-28 18:27:12 +01:00
Hieu Hoang
f95a1bb75b more shards 2014-06-23 09:36:13 -04:00
phikoehn
ac7670c5e7 minor bugs with factors 2014-06-06 14:14:35 +01:00
phikoehn
ceadacd3af Merge branch 'master' of ssh://github.com/moses-smt/mosesdecoder 2014-06-05 21:33:35 +01:00
Philipp Koehn
15288213be allow < > in factors 2014-06-05 21:31:09 +01:00
Ulrich Germann
9c27dc08e9 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
Conflicts:
	moses/FF/ConstrainedDecoding.cpp
2014-06-03 18:55:53 +01:00
Ulrich Germann
0b6a0e42d6 scripts/generic/bsbleu.py: Bug fix. 2014-05-30 13:51:22 +01:00
Ulrich Germann
d3b058ca4b scripts/generic/bsbleu.py: Confidence interval can now be set via command line argument. 2014-05-30 13:50:10 +01:00
Ulrich Germann
67be805211 Added scripts/generic/bsbleu.py: bleu score computation with confidence intervals via boostrap resampling. 2014-05-30 13:12:07 +01:00
Hieu Hoang
42a1989f86 generic parallization that take stdin and output to stdout. For parsing 2014-05-23 01:03:29 +01:00
Hieu Hoang
403a473222 utf8 encoding 2014-05-22 21:23:31 +01:00
Hieu Hoang
3df23ea6ad utf8 encoding 2014-05-22 19:00:36 +01:00
Hieu Hoang
7ec0c53e92 generic parallization that take stdin and output to stdout. For parsing 2014-05-22 15:22:55 +01:00
Hieu Hoang
d97be10a27 generic parallization that take stdin and output to stdout. For parsing 2014-05-22 14:38:27 +01:00
Hieu Hoang
b6d47733da remove duplicate spaces caused when XML are stripped 2014-01-24 18:09:55 +00:00
Hieu Hoang
19aa8c1056 changes to moses-parallel.pl to make it work. /Thomas Meyer 2013-12-05 12:23:40 +00:00
Rico Sennrich
7138056b8f flexibility scores 2013-09-07 23:04:01 +02:00
Hieu Hoang
ac50d9f349 Merge /home/hieu/workspace/github/mosesdecoder into hieu_opt_input 2013-08-01 16:55:46 +01:00
Hieu Hoang
f234aa203f number recognizer treats each word as atomic, replace all of the word or nothing at all. Recognizer is designed to be run after the text has been tokenized, not before 2013-08-01 16:55:11 +01:00
Hieu Hoang
4bc7ce99ed add script to strip xml 2013-07-29 11:27:13 +01:00
Hieu Hoang
a3e3289b08 In corpus mode, replace number with number symbol 2013-07-25 15:54:47 +01:00
Hieu Hoang
78381d0213 @NUM@ --> @num@. In case using recaser 2013-07-25 15:16:15 +01:00
Hieu Hoang
6fc21a32fc Merge branch 'master' of github.com:moses-smt/mosesdecoder 2013-07-24 19:01:57 +01:00
Hieu Hoang
c104dee3b2 merge glue grammars, rather than writing them all to the same file. Required by Phil Williams & others when doing syntax extraction 2013-07-24 19:01:46 +01:00