Commit Graph

13771 Commits

Author SHA1 Message Date
Dingyuan Wang
aea07b0a19 Fix some problems in EMS:
* remove absolute links
* fix coverage bar highlighting
* change Base64 library to support UTF-8
2015-04-03 23:47:25 +08:00
Rico Sennrich
8d8097632b re-apply commit 1fb51dc (use gunzip -c instead of zcat)
plus be more tolerant about xml input
2015-04-03 15:00:45 +01:00
Jeroen Vermeulen
c52d7be025 Make an include path explicit.
Better to see clearly where that header is supposed to come from, since
it lives halfway across the codebase.

I think this may also make the "classifier" dance in moses/Jamfile
unnecessary, which would be a nice simplification, but I'm not
confident enough to just make the change.
2015-04-03 17:47:48 +07:00
Hieu Hoang
0c58e19491 uncomment out lines that clang choked on. Now works 2015-04-02 22:39:44 +04:00
Hieu Hoang
00065fee15 Merge pull request #105 from aminorex/master
compile with clang 3.3 x86_x6, no warnings
2015-04-02 22:29:42 +04:00
akimbal1
ad70c9a35d resolve conflicts 2015-04-02 14:15:15 -04:00
akimbal1
b4e24a2fb8 compile with clang 3.3 x86_x6, no warnings 2015-04-02 14:07:23 -04:00
Hieu Hoang
d71e516176 make it compile on osx/clang 2015-04-02 21:30:47 +04:00
Hieu Hoang
b2f9ba2b64 revert last commit to add MASTER_PATH. Not needed 2015-04-02 19:29:42 +04:00
Hieu Hoang
27b36e0c96 pass in PATH variable from master node. When you're running of a grid but really just qsubbing everything to 1 slave node 2015-04-02 19:15:21 +04:00
Hieu Hoang
2d1da3219d consistently use 'env perl' command for environments where the 1st perl in PATH isn't the default perl. Which is kinda stupid 2015-04-02 17:38:56 +04:00
Hieu Hoang
035c806059 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-04-02 14:10:42 +04:00
Hieu Hoang
d2137487f5 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-04-02 14:08:19 +04:00
Hieu Hoang
e76247e19b Conditional import of Thread package for perl installations that don't support threads 2015-04-02 14:07:57 +04:00
Hieu Hoang
2dc75945e5 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-04-02 09:54:38 +04:00
Hieu Hoang
f0e4f8b557 Merge pull request #103 from aminorex/master
tokenizer improvements, faster better sentence splitter
2015-04-02 09:49:41 +04:00
Jeroen Vermeulen
ebc0930500 Replace use of tmpnam with boost::filesystem.
Silences a few annoying warnings from gcc: "tmpnam is dangerous" (and
the suggestion to use mkstemp instead).
2015-04-02 10:42:06 +07:00
akimbal1
8cea968067 handle asian stock tickers better 2015-04-01 18:49:51 -04:00
akimbal1
6771bf9142 Merge remote-tracking branch 'upstream/master' 2015-04-01 18:30:44 -04:00
akimbal1
d4ef9ce106 make -a work more like the perl tokenizer 2015-04-01 18:26:19 -04:00
akimbal1
2e39e829bf splitter and tokenizer tweaks, multithreading tokenizer 2015-04-01 15:49:32 -04:00
Hieu Hoang
5310bfad64 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-04-01 23:27:46 +04:00
Hieu Hoang
e22d275c32 don't ignore lowercasing of factored LM. Must be consistent with pt 2015-04-01 23:25:57 +04:00
XapaJIaMnu
29a729c99b Remove old obsolete probingPT tests 2015-04-01 16:58:21 +01:00
akimbal1
fd596b1972 splitter tweaks 2015-04-01 02:21:03 -04:00
akimbal1
3db8c87c7c add -B option 2015-03-31 22:03:32 -04:00
akimbal1
9aa73eed4f add splitter 2015-03-31 21:53:14 -04:00
Kenneth Heafield
6626d59cfc Less error-like complaint when substituting fallback discounts / Matthias Huck 2015-03-31 21:51:38 -04:00
Hieu Hoang
1a61e1405f delete External FF. FF framework changes too fast to be able to keep this up-to-date 2015-03-31 19:45:59 +04:00
Matthias Huck
633e7be8f0 integer overflows in Good-Turing discounting 2015-03-30 17:42:55 +01:00
Phil Williams
6ce3060dd8 lmplz-wrapper.perl: use Getopt::Long's "pass_through" option
This avoids the need to duplicate all of lmplz's options in the wrapper and
it prevents --prune 0 0 1 from being truncated to --prune 0 if the user forgets
to quote the arguments.
2015-03-30 10:18:51 +01:00
Jeroen Vermeulen
c634f6ee5b Remove some unused variables.
This silences a few more compiler warnings.
2015-03-30 10:26:39 +07:00
Jeroen Vermeulen
b124d99330 Use boost::filesystem for "rm -rf".
Replaces a system() call (which was a portability problem) and fixes,
en passant, a warning about its return value being ignored.
2015-03-29 18:33:58 +07:00
Jeroen Vermeulen
789a2e2bc3 Fix some compile warnings (gcc 4.9.2).
Mostly signed/unsigned comparisons and reordered member
initializations; also a few unused variables.

There are more, but if I chip away at them for a while, who knows, it
may catch on and warnings may eventually become socially stigmatizing.
:)
2015-03-29 18:10:51 +07:00
Jeroen Vermeulen
6fa57a1dac Modernize "C" includes in biconcor.
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on.  In this
branch the #include names are updated for the biconcor/ subdirectory.

C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 21:00:30 +07:00
Jeroen Vermeulen
4c59b350df Modernize "C" includes in contrib.
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on.  In this
branch the #include names are updated for the contrib/ subdirectory.

C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 20:58:46 +07:00
Jeroen Vermeulen
b3c0a29044 Modernize "C" includes in lm.
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on.  In this
branch the #include names are updated for the lm/ subdirectory.

C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 20:28:45 +07:00
Jeroen Vermeulen
536c6e375f Modernize "C" includes in mert.
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on.  In this
branch the #include names are updated for the mert/ subdirectory; more
branches to follow.

C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 20:20:58 +07:00
Jeroen Vermeulen
98e09b14cb Modernize "C" includes in misc.
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on.  In this
branch the #include names are updated for the misc/ subdirectory; more
branches to follow.

C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 20:18:39 +07:00
Jeroen Vermeulen
a9c8f44896 Modernize "C" includes in moses.
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on.  In this
branch the #include names are updated for the moses/ subdirectory; more
branches to follow.

C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 20:09:03 +07:00
Jeroen Vermeulen
9852a0c2ff Modernize "C" includes in phrase-extract.
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on.  In this
branch the #include names are updated for the phrase-extract/
subdirectory; more branches to follow.

C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 19:56:20 +07:00
Jeroen Vermeulen
dea76752e9 Modernize "C" includes in search.
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on.  In this
branch the #include names are updated for the search/ subdirectory; more
branches to follow.

C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 19:48:20 +07:00
Jeroen Vermeulen
88e90957a1 Modernize "C" includes in util.
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on.  In this
branch the #include names are updated for the util/ subdirectory; more
branches to follow.

C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 19:37:48 +07:00
Kenneth Heafield
206d0c9698 Make NPLM wrapper compile again. Fixes #102. 2015-03-26 11:57:41 -04:00
Kenneth Heafield
8b323abbca KenLM 240ea65a021574261a38d45eb68143f26ad177e5 2015-03-25 10:40:21 -04:00
Hieu Hoang
1064aaacbe delete typedefs for UINT32 and UINT64. MSVC now has uint32_t and uint64_t /Ken 2015-03-25 00:55:39 +00:00
Jeroen Vermeulen
cdc5e0fb33 Check for MinGW using __MINGW32__, not MINGW.
This code used the wrong preprocessor macro to check whether it was
compiling in a MinGW environment, as far as I can see.
2015-03-23 22:35:03 +07:00
Jeroen Vermeulen
9ae4e2b2fb Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-03-23 22:12:34 +07:00
Jeroen Vermeulen
01d3db1629 Adjust for MinGW's mkdir().
MinGW has mkdir(), but it only accepts one argument: the directory's
path.  It does not accept a file mode.
2015-03-23 22:00:01 +07:00
Matthias Huck
506427368f filter-model-given-input.pl: drop "-encoding None" from phrase table binaization with processPhraseTableMin. Recommended by Marcin. 2015-03-23 14:38:24 +00:00