Jeroen Vermeulen
ef028446f3
Add license notices to scripts.
...
This is not pleasant to read (and much, much less pleasant to write!) but
sort of necessary in an open project. Right now it's quite hard to figure
out what is licensed how, which doesn't matter much to most people but can
suddenly become very important when people want to know what they're being
allowed to do.
I kept the notices as short as I could. As far as I could see, everything
without a clear license notice is LGPL v2.1 or later.
2015-05-29 18:30:26 +07:00
Hieu Hoang
582a845524
don't use zcat
2015-05-24 20:04:01 +04:00
Hieu Hoang
2f0ee5502e
delete debugging info
2015-05-18 16:26:26 +04:00
Jeroen Vermeulen
a25193cc5d
Fix a lot of lint, mostly trailing whitespace.
...
This is lint reported by the new lint-checking functionality in beautify.py.
(We can change to a different lint checker if we have a better one, but it
would probably still flag these same problems.)
Lint checking can help a lot, but only if we get the lint under control.
2015-05-17 20:04:04 +07:00
Jeroen Vermeulen
9bdcb5f7c1
Fix more Python lint.
...
This is about the last that isn't in contrib or generated files. At this
point we can start doing regular lint checks, at least on the Python files,
without being completely inundated with warnings.
2015-05-16 18:03:54 +07:00
Jeroen Vermeulen
61162dd242
Fix more Python lint.
...
Most of the complaints fixed here were from Pocketlint, but many were also
from Syntastic the vim plugin.
2015-05-16 17:26:56 +07:00
Nicola Bertoldi
5700fbaabf
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-04-22 07:50:07 +02:00
Hieu Hoang
c15f3ef068
duplicated functionality with ems/support/lmplz-wrapper.perl
2015-04-21 17:54:34 +04:00
Hieu Hoang
6162223690
add use warnings to all perl scripts
2015-04-13 20:42:33 +04:00
Hieu Hoang
54e55f2dcb
better detection of pigz, sort, split. In case they are not in the default directory
2015-04-06 11:31:44 +04:00
Hieu Hoang
2d1da3219d
consistently use 'env perl' command for environments where the 1st perl in PATH isn't the default perl. Which is kinda stupid
2015-04-02 17:38:56 +04:00
Matthias Huck
06e87d851e
GHKM: extract POS phrase property (from preterminals in the syntactic parse tree)
2015-03-04 21:40:56 +00:00
Nicola Bertoldi
a1539505c8
minor change to make extract-parallel.perl compliant with MacOSX split command
2015-02-04 09:02:51 +01:00
Matthias Huck
9987beb453
SoftSourceSyntacticConstraintsFeature: Now for both non-terminals (as before) _and_ terminals.
...
Also added score components based on relative frequency.
(TODO: logprobs right now; are plain probabilities better?)
2015-01-23 18:41:18 +00:00
Hieu Hoang
90d4b2d713
use pigz rather than gzip if it exists
2015-01-13 15:16:22 +00:00
mjdenkowski
40e8f2eca0
Hypergraph output
2014-11-03 09:16:12 -05:00
mjdenkowski
a1f561ac31
Only update dynamic models
2014-10-10 15:09:53 -04:00
Ondrej Bojar
14449b3601
towards a simple line-oriented dump of FSA
2014-09-11 14:50:59 +02:00
Michael Denkowski
b8c9ae2c55
Update models named "Dynamic..."
2014-08-29 14:40:20 -04:00
Michael Denkowski
9098f3a8b4
Support simulated post-editing with MultiModel
2014-08-19 16:20:35 -04:00
Michael Denkowski
300de5d041
Text size limits jobs
2014-08-13 16:51:20 -04:00
Michael Denkowski
057066ea0e
Minor fixes for simulated post-editing with mert-moses.pl
2014-08-13 15:58:51 -04:00
Hieu Hoang
94c44c03d5
merge
2014-08-13 18:03:05 +01:00
Matthias Huck
c27cbf55ea
source labels: integration into EMS
2014-08-07 21:02:51 +01:00
Michael Denkowski
9ad59e2d69
Header and some instructions
2014-08-05 15:11:35 -04:00
Michael Denkowski
e7c36ee804
Simulated post-editing merge: XML update, parallel SPE script, MERT
2014-08-05 14:20:00 -04:00
Matthias Huck
3a5dee12e8
implementation of phrase orientation in GHKM extraction
...
(...but a corresponding feature function for the chart-based decoder has not been written yet)
2014-07-28 18:27:12 +01:00
Hieu Hoang
f95a1bb75b
more shards
2014-06-23 09:36:13 -04:00
phikoehn
ac7670c5e7
minor bugs with factors
2014-06-06 14:14:35 +01:00
phikoehn
ceadacd3af
Merge branch 'master' of ssh://github.com/moses-smt/mosesdecoder
2014-06-05 21:33:35 +01:00
Philipp Koehn
15288213be
allow < > in factors
2014-06-05 21:31:09 +01:00
Ulrich Germann
9c27dc08e9
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
...
Conflicts:
moses/FF/ConstrainedDecoding.cpp
2014-06-03 18:55:53 +01:00
Ulrich Germann
0b6a0e42d6
scripts/generic/bsbleu.py: Bug fix.
2014-05-30 13:51:22 +01:00
Ulrich Germann
d3b058ca4b
scripts/generic/bsbleu.py: Confidence interval can now be set via command line argument.
2014-05-30 13:50:10 +01:00
Ulrich Germann
67be805211
Added scripts/generic/bsbleu.py: bleu score computation with confidence intervals via boostrap resampling.
2014-05-30 13:12:07 +01:00
Hieu Hoang
42a1989f86
generic parallization that take stdin and output to stdout. For parsing
2014-05-23 01:03:29 +01:00
Hieu Hoang
403a473222
utf8 encoding
2014-05-22 21:23:31 +01:00
Hieu Hoang
3df23ea6ad
utf8 encoding
2014-05-22 19:00:36 +01:00
Hieu Hoang
7ec0c53e92
generic parallization that take stdin and output to stdout. For parsing
2014-05-22 15:22:55 +01:00
Hieu Hoang
d97be10a27
generic parallization that take stdin and output to stdout. For parsing
2014-05-22 14:38:27 +01:00
Hieu Hoang
b6d47733da
remove duplicate spaces caused when XML are stripped
2014-01-24 18:09:55 +00:00
Hieu Hoang
19aa8c1056
changes to moses-parallel.pl to make it work. /Thomas Meyer
2013-12-05 12:23:40 +00:00
Rico Sennrich
7138056b8f
flexibility scores
2013-09-07 23:04:01 +02:00
Hieu Hoang
ac50d9f349
Merge /home/hieu/workspace/github/mosesdecoder into hieu_opt_input
2013-08-01 16:55:46 +01:00
Hieu Hoang
f234aa203f
number recognizer treats each word as atomic, replace all of the word or nothing at all. Recognizer is designed to be run after the text has been tokenized, not before
2013-08-01 16:55:11 +01:00
Hieu Hoang
4bc7ce99ed
add script to strip xml
2013-07-29 11:27:13 +01:00
Hieu Hoang
a3e3289b08
In corpus mode, replace number with number symbol
2013-07-25 15:54:47 +01:00
Hieu Hoang
78381d0213
@NUM@ --> @num@. In case using recaser
2013-07-25 15:16:15 +01:00
Hieu Hoang
6fc21a32fc
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2013-07-24 19:01:57 +01:00
Hieu Hoang
c104dee3b2
merge glue grammars, rather than writing them all to the same file. Required by Phil Williams & others when doing syntax extraction
2013-07-24 19:01:46 +01:00