Commit Graph

46 Commits

Author SHA1 Message Date
Jeroen Vermeulen
ef028446f3 Add license notices to scripts.
This is not pleasant to read (and much, much less pleasant to write!) but
sort of necessary in an open project.  Right now it's quite hard to figure
out what is licensed how, which doesn't matter much to most people but can
suddenly become very important when people want to know what they're being
allowed to do.

I kept the notices as short as I could.  As far as I could see, everything
without a clear license notice is LGPL v2.1 or later.
2015-05-29 18:30:26 +07:00
Hieu Hoang
2f0ee5502e delete debugging info 2015-05-18 16:26:26 +04:00
Jeroen Vermeulen
a25193cc5d Fix a lot of lint, mostly trailing whitespace.
This is lint reported by the new lint-checking functionality in beautify.py.
(We can change to a different lint checker if we have a better one, but it
would probably still flag these same problems.)

Lint checking can help a lot, but only if we get the lint under control.
2015-05-17 20:04:04 +07:00
Nicola Bertoldi
5700fbaabf Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-04-22 07:50:07 +02:00
Hieu Hoang
6162223690 add use warnings to all perl scripts 2015-04-13 20:42:33 +04:00
Hieu Hoang
54e55f2dcb better detection of pigz, sort, split. In case they are not in the default directory 2015-04-06 11:31:44 +04:00
Hieu Hoang
2d1da3219d consistently use 'env perl' command for environments where the 1st perl in PATH isn't the default perl. Which is kinda stupid 2015-04-02 17:38:56 +04:00
Nicola Bertoldi
a1539505c8 minor change to make extract-parallel.perl compliant with MacOSX split command 2015-02-04 09:02:51 +01:00
Hieu Hoang
90d4b2d713 use pigz rather than gzip if it exists 2015-01-13 15:16:22 +00:00
Matthias Huck
c27cbf55ea source labels: integration into EMS 2014-08-07 21:02:51 +01:00
Matthias Huck
3a5dee12e8 implementation of phrase orientation in GHKM extraction
(...but a corresponding feature function for the chart-based decoder has not been written yet)
2014-07-28 18:27:12 +01:00
Hieu Hoang
f95a1bb75b more shards 2014-06-23 09:36:13 -04:00
Rico Sennrich
7138056b8f flexibility scores 2013-09-07 23:04:01 +02:00
Hieu Hoang
c104dee3b2 merge glue grammars, rather than writing them all to the same file. Required by Phil Williams & others when doing syntax extraction 2013-07-24 19:01:46 +01:00
Hieu Hoang
1b83b85f44 debug info from sort command 2013-03-18 16:48:40 +00:00
phikoehn
d5cf38cab2 Merge branch 'master' of git://github.com/moses-smt/mosesdecoder 2013-01-14 19:23:02 +00:00
phikoehn
344b150372 bug fixes with escaping / truecasing interactions 2013-01-14 19:22:29 +00:00
Barry Haddow
459acf87b1 Add support for instance weights file 2013-01-04 14:55:24 +00:00
phikoehn
b275c94dbf allow for inclusion of extract from previous run 2012-12-12 07:02:59 +00:00
phikoehn
a84fbcb80a bug fix for using domain feature in multi-process extract 2012-09-24 11:14:36 +01:00
Hieu Hoang
30e5b0575b merge conflict 2012-09-03 19:12:00 +01:00
phikoehn
4a1a995878 a lot of changes 2012-08-18 23:48:26 +01:00
phikoehn
366ab93f8a a lot of changes 2012-08-18 23:47:05 +01:00
Hieu Hoang
23bbf12752 redirect stderr from extract program to stderr of script that runs it, for easier debugging 2012-07-18 22:53:47 +01:00
Hieu Hoang
f0a3e67717 redirect stderr from extract program to stderr of script that runs it, for easier debugging. Recursive create of output directory, in case output directory doesn't exist 2012-07-18 22:33:49 +01:00
Hieu Hoang
b609473645 under changes to make scripts handle directories with spaces. Karel gave up, and it messing up cruise control 2012-07-18 11:02:24 +01:00
Karel Bílek
ad257b27d9 spaces in filepath does not wreak havoc anymore 2012-07-17 04:36:27 +02:00
Barry Haddow
25f5def387 Fix for case where no ttable is to be created 2012-06-03 22:03:41 +01:00
Hieu Hoang
f91b19fa41 Works with phrase-table scoring 2012-05-23 19:26:37 +01:00
Hieu Hoang
761ac4f0fc parallel extract. Works with reordering 2012-05-23 19:02:36 +01:00
Hieu Hoang
e74f67abc5 use new gzipped extract file. Always save extract files already sorted 2012-05-13 10:23:59 +01:00
Hieu Hoang
066c209fcb use new gzipped extract file. Always save extract files already sorted 2012-05-12 15:41:07 +01:00
Hieu Hoang
ef390a705f 1. use new gsort, rather than sort, if it is available
2. check return value of cat command in extract-parallel in case it runs out of disk space etc
2012-03-11 15:50:46 +07:00
Hieu Hoang
de9eeab7e9 mac osx compatible split & sort 2012-02-23 13:26:19 +00:00
Hieu Hoang
7073f7d891 bug fix by Guchun Zhang 2012-02-16 10:28:09 +07:00
Hieu Hoang
7f6f8a99f9 bug fix by Guchun Zhang 2012-02-16 10:26:48 +07:00
Hieu Hoang
53b41f7c45 parallel extract 2012-02-09 18:24:49 +07:00
hieuhoang1972
6a27dc4f17 example of how to run
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4084 1f5c12ca-751b-0410-a591-d2e778427230
2011-07-22 08:32:09 +00:00
hieuhoang1972
debca7632b change order of arguments. Arguments for extract-parallel are simple extension of normal extract
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4027 1f5c12ca-751b-0410-a591-d2e778427230
2011-06-23 02:25:49 +00:00
hieuhoang1972
2cdc39f63f parallelize extract using perl fork.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4025 1f5c12ca-751b-0410-a591-d2e778427230
2011-06-21 21:48:56 +00:00
hieuhoang1972
4b5c8aaf10 parallelize extract using perl fork.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4024 1f5c12ca-751b-0410-a591-d2e778427230
2011-06-21 21:34:18 +00:00
hieuhoang1972
4689d33d0f parallelize extract using perl fork.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4023 1f5c12ca-751b-0410-a591-d2e778427230
2011-06-21 21:31:47 +00:00
hieuhoang1972
9eb51e31fb parallelize extract using perl fork.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4022 1f5c12ca-751b-0410-a591-d2e778427230
2011-06-21 16:47:22 +00:00
hieuhoang1972
62ddd6eb53 parallelize extract using perl fork.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4021 1f5c12ca-751b-0410-a591-d2e778427230
2011-06-21 16:45:42 +00:00
hieuhoang1972
bd64e748ff parallelize extract using perl fork.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4020 1f5c12ca-751b-0410-a591-d2e778427230
2011-06-21 16:38:15 +00:00
hieuhoang1972
56fc94c2a7 parallelize extract using perl fork. Not quite ready for prime time
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4019 1f5c12ca-751b-0410-a591-d2e778427230
2011-06-21 16:20:22 +00:00