Commit Graph

1126 Commits

Author SHA1 Message Date
XapaJIaMnu
070e0a83e1 LD_PRELOAD systax was wrong 2015-07-02 16:57:17 +01:00
Hieu Hoang
930dce10bf prune multiple files at once. Make up for failure in ems to give the full path of the gen table 2015-06-25 13:02:29 +04:00
Hieu Hoang
c80df1212e prune multiple files at once. Make up for failure in ems to give the full path of the gen table 2015-06-25 10:48:35 +04:00
Hieu Hoang
555f91eb7e codelite 2015-06-24 18:31:05 +04:00
Hieu Hoang
0f943dd9c1 clang compile errors 2015-06-21 21:16:12 +04:00
Hieu Hoang
57f9e0f6af Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-06-14 20:32:31 +04:00
Hieu Hoang
bf1dad4fe9 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-06-04 14:52:00 +04:00
Hieu Hoang
3ea5faead8 codelite 2015-06-02 21:44:58 +04:00
Hieu Hoang
d624fc903e Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-06-02 21:04:51 +04:00
Jeroen Vermeulen
1d7ed728ee Rename ‘aux’ to ‘auxiliary’ for Windows' sake.
Any filename like “aux” or “aux.*” is special in Windows, and can't be
opened, dir'ed, and so on.  This was causing some people problems.
2015-06-03 00:00:57 +07:00
Hieu Hoang
efdb8566b1 delete ChangeSource(). Not used 2015-06-02 21:00:32 +04:00
Hieu Hoang
fa57de7b52 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-06-02 11:46:47 +04:00
Hieu Hoang
df5aff2d82 eclipse 2015-05-24 15:12:05 +01:00
Hieu Hoang
ed8cf103f8 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-05-24 17:34:52 +04:00
Hieu Hoang
502e72ce91 eclipse 2015-05-24 17:08:57 +04:00
Hieu Hoang
097afb93df eclipse 2015-05-21 14:04:42 +04:00
Hieu Hoang
a922245864 default to using lmplz for convenience and because SRILM uses tonnes of memory 2015-05-12 11:44:05 +04:00
Hieu Hoang
a6b441dab9 codelite 2015-05-11 18:51:34 +04:00
XapaJIaMnu
8e6eb067bc Add profiling option to the testsuite. Untested. 2015-05-05 17:36:19 +01:00
XapaJIaMnu
dab63e8076 Initial work to add support for profiling mode in the testsuite 2015-04-30 23:46:50 +01:00
Hieu Hoang
15e4b16f49 delete unused var 2015-04-30 14:01:03 +04:00
Jeroen Vermeulen
eca5824100 Remove trailing whitespace in C++ files. 2015-04-30 12:05:11 +07:00
Hieu Hoang
e6951aa9f2 codelite 2015-04-27 23:01:14 +04:00
Hieu Hoang
0ee4feecb5 add extra parser argument to ChartCellCollection to support FF::EvaluateGivenAllOtherTransOpts() 2015-04-27 22:41:47 +04:00
Hieu Hoang
41529227b2 boost unique lock 2015-04-26 18:11:11 +04:00
Ulrich Germann
0d72cdd72c Merge branch 'master' of https://github.com/moses-smt/mosesdecoder into mmt-dev
Conflicts:
	moses/Syntax/F2S/Manager-inl.h
	moses/TranslationModel/UG/mmsapt.cpp
2015-04-26 02:12:16 +01:00
Jeroen Vermeulen
10bd942127 Seed the randomizer once, not every time.
This bit of mira code used to re-seed the randomizer on every call, instead
of just once on startup.  The result of time(NULL) was used as a seed, meaning
that every such call to the randomizer within the same second would return the
same value.
2015-04-24 20:00:07 +07:00
Matthias Huck
bbcc8bf23b Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-04-23 18:14:40 +01:00
Matthias Huck
f24f31f965 n-best list creation in phrase-based decoding: improved efficiency with sparse features 2015-04-23 18:13:02 +01:00
Jeroen Vermeulen
38d790cac0 Add cross-platform randomizer module.
The code uses two mechanisms for generating random numbers: srand()/rand(),
which is not thread-safe, and srandom()/random(), which is POSIX-specific.

Here I add a util/random.cc module that centralizes these calls, and unifies
some common usage patterns.  If the implementation is not good enough, we can
now change it in a single place.

To keep things simple, this uses the portable srand()/rand() but protects them
with a lock to avoid concurrency problems.

The hard part was to keep the regression tests passing: they rely on fixed
sequences of random numbers, so a small code change could break them very
thoroughly.  Util::rand(), for wide types like size_t, calls std::rand() not
once but twice.  This behaviour was generalized into utils::wide_rand() and
friends.
2015-04-23 23:46:04 +07:00
Hieu Hoang
40933b4a78 hack to allow target side of tokenized parallel corpus to be used for LM 2015-04-22 19:01:12 +04:00
Jeroen Vermeulen
6a4943ca41 Replace deprecated bcopy() with memcpy().
The bcopy() function is POSIX-specific and deprecated.  The recommended
replacement (at least for non-overlapping source and destination ranges)
is memcpy(), which is in the standard C library.

Note that the source and destination parameters are in a different order
between these two functions.
2015-04-16 19:19:34 +07:00
Jeroen Vermeulen
21a93421dc Replace deprecated bzero() with memset().
The bzero() function is POSIX-specific and deprecated.  The recommended
replacement is memset(), which is in the standard C library.
2015-04-16 19:03:57 +07:00
Hieu Hoang
044968bb4b Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-04-14 11:30:33 +04:00
Hieu Hoang
7af653ac80 misc script to parallelize madamira on grid engine 2015-04-14 11:29:56 +04:00
Hieu Hoang
2f7c328db9 codelite 2015-04-11 20:21:50 +04:00
Ulrich Germann
3e2f878576 Merge branch 'master' into mmt-dev
Conflicts:
	Jamroot
	moses/TranslationModel/UG/mmsapt.h
2015-04-05 15:51:50 +01:00
Ulrich Germann
71005445d1 Unused variable. 2015-04-03 21:11:45 +01:00
Hieu Hoang
0c58e19491 uncomment out lines that clang choked on. Now works 2015-04-02 22:39:44 +04:00
akimbal1
ad70c9a35d resolve conflicts 2015-04-02 14:15:15 -04:00
akimbal1
b4e24a2fb8 compile with clang 3.3 x86_x6, no warnings 2015-04-02 14:07:23 -04:00
Hieu Hoang
d71e516176 make it compile on osx/clang 2015-04-02 21:30:47 +04:00
Hieu Hoang
f0e4f8b557 Merge pull request #103 from aminorex/master
tokenizer improvements, faster better sentence splitter
2015-04-02 09:49:41 +04:00
Jeroen Vermeulen
ebc0930500 Replace use of tmpnam with boost::filesystem.
Silences a few annoying warnings from gcc: "tmpnam is dangerous" (and
the suggestion to use mkstemp instead).
2015-04-02 10:42:06 +07:00
akimbal1
8cea968067 handle asian stock tickers better 2015-04-01 18:49:51 -04:00
akimbal1
6771bf9142 Merge remote-tracking branch 'upstream/master' 2015-04-01 18:30:44 -04:00
akimbal1
d4ef9ce106 make -a work more like the perl tokenizer 2015-04-01 18:26:19 -04:00
akimbal1
2e39e829bf splitter and tokenizer tweaks, multithreading tokenizer 2015-04-01 15:49:32 -04:00
akimbal1
fd596b1972 splitter tweaks 2015-04-01 02:21:03 -04:00
akimbal1
3db8c87c7c add -B option 2015-03-31 22:03:32 -04:00