Commit Graph

262 Commits

Author SHA1 Message Date
Marcin Junczys-Dowmunt
5b868db6db make static tests optional 2018-04-18 13:23:16 -07:00
Marcin Junczys-Dowmunt
5a9bb9792b continue linking against shared zlib 2018-04-18 12:07:22 -07:00
Marcin Junczys-Dowmunt
3f1667726e get rid of obsolete python depdencies 2018-04-18 10:17:11 -07:00
Marcin Junczys-Dowmunt
d49a711474 Prefer static linking wherever possible 2018-04-18 08:26:46 -07:00
Marcin Junczys-Dowmunt
54a533d45e disable type checking for tensor::set 2018-04-14 20:59:57 -07:00
Marcin Junczys-Dowmunt
f0b18f240f add CMake build profiles 2018-04-13 11:30:14 -07:00
Marcin Junczys-Dowmunt
07e5105075 update test 2018-04-11 23:18:00 -07:00
Marcin Junczys-Dowmunt
78a9947374 working memoization 2018-04-11 16:50:45 -07:00
Marcin Junczys-Dowmunt
9928625001 add gemm from sharp models, dull hardware 2018-04-08 22:13:30 -07:00
CatarinaSilva
ade15ca0a2 Fix CMakeLists.txt 2018-04-05 15:09:28 +01:00
Marcin Junczys-Dowmunt
83e9e8927a add missing flag to CUDA_NVCC_FLAGS 2018-03-04 17:50:21 -08:00
Marcin Junczys-Dowmunt
8ec6eef9d5 more coda separation 2018-03-02 18:06:26 -08:00
Roman Grundkiewicz
059a45e661 Add CMake option: COMPILE_CPU 2018-03-01 12:36:27 +00:00
Marcin Junczys-Dowmunt
36e8077a7d fix issue #183 by using boost regex, version prior to g++-4.9 fail 2018-02-26 10:38:27 -08:00
Marcin Junczys-Dowmunt
b241011336 make search for MPI optional 2018-02-25 22:08:25 -08:00
Marcin Junczys-Dowmunt
cead1b4feb Restrict to OpenBLAS 2018-02-24 21:03:06 -08:00
Marcin Junczys-Dowmunt
dbba0f220d add cudnn back 2018-02-24 20:11:02 -08:00
Marcin Junczys-Dowmunt
845063b342 re-enable mpi 2018-02-24 17:56:33 -08:00
Marcin Junczys-Dowmunt
ccbbbb95d3 fix FindMKL 2018-02-24 16:11:53 -08:00
Marcin Junczys-Dowmunt
73c38cf82c clean up cpu version 2018-02-22 17:51:52 -08:00
Marcin Junczys-Dowmunt
8d3f70599f batched sgemm 2018-02-21 22:08:36 -08:00
Marcin Junczys-Dowmunt
f224db8257 add FinMKL 2018-02-21 21:39:29 -08:00
Marcin Junczys-Dowmunt
d9d66f416b prototype cpu version 2018-02-21 17:44:04 -08:00
Marcin Junczys-Dowmunt
4834267989 clean up allocators and devices 2018-02-15 15:23:45 -08:00
Roman Grundkiewicz
ae827f8bbe Merge branch 'master' of https://github.com/tneck/marian-dev into tneck-master 2018-01-31 11:23:05 +00:00
Marcin Junczys-Dowmunt
bfe6839880 sqlite3-based corpus source 2018-01-23 18:35:55 -08:00
r0452425
0a1cdbc9d5 Add logic and compile config for multi-node 2018-01-21 07:06:51 +00:00
Marcin Junczys-Dowmunt
97ddb30948 disable server if no openssl found 2018-01-13 11:24:03 -08:00
Roman Grundkiewicz
d9063529ea Add COMPILE_SERVER option to CMake 2018-01-09 14:18:57 +00:00
Tomasz Dwojak
1fbd30674a Merge branch 'master' of https://github.com/marian-nmt/marian-dev into charS2S 2017-11-27 13:26:01 +00:00
Hieu Hoang
fe7ca63a66 compilable with cuda 8 2017-11-24 17:13:19 +00:00
Hieu Hoang
a1b13e8cf0 gpu arch consistent with amun 2017-11-24 16:27:56 +00:00
Marcin Junczys-Dowmunt
39e15a6d3e remove tcmaloc_minimal 2017-11-24 17:06:25 +01:00
Tomasz Dwojak
537fccc3e8 Merge branch 'master' of https://github.com/marian-nmt/marian-dev into charS2S 2017-11-23 11:26:38 +00:00
Marcin Junczys-Dowmunt
e378a764ce update changelog 2017-11-21 09:45:46 +00:00
Tomasz Dwojak
8d33f14785 Set cudnn version at least 7 2017-11-15 23:30:41 +00:00
Marcin Junczys-Dowmunt
c8c7f2f62a remove lib 2017-11-02 23:25:54 +01:00
Marcin Junczys-Dowmunt
7a80d2a390 Merge branch 'master' into functional 2017-11-02 21:06:42 +01:00
Marcin Junczys-Dowmunt
ca64c429e4 new functional templates 2017-11-02 20:37:43 +01:00
Roman Grundkiewicz
a3b72c2ffe Add TCMalloc 2017-11-02 10:33:47 +00:00
Marcin Junczys-Dowmunt
0d20bb57c1 clean up permutation 2017-11-01 22:33:47 +01:00
Marcin Junczys-Dowmunt
c7d236762a more gpu-specific TMP 2017-11-01 20:09:42 +01:00
Marcin Junczys-Dowmunt
4f14c9175a unroll loop for variable GPU shape length 2017-11-01 10:28:14 +01:00
Marcin Junczys-Dowmunt
0daa611030 simplify shape 2017-11-01 01:07:02 +01:00
Roman Grundkiewicz
1f9fef5836 Show commit SHA with the project version 2017-08-30 13:35:47 +00:00
Roman Grundkiewicz
41f99c5136 Clean Boost components 2017-08-19 13:49:55 +01:00
Roman Grundkiewicz
2214a90181 Make OpenSSL library optional 2017-08-19 12:55:01 +01:00
Roman Grundkiewicz
95fe229d26 Add Simple-WebSocket-Server library 2017-08-19 08:01:31 +01:00
Roman Grundkiewicz
6bf7a14f6d Add python module returning version 2017-08-18 18:04:10 +01:00
Roman Grundkiewicz
7aaa3ba985 Switch to manual project versioning 2017-07-13 13:02:43 +01:00
Marcin Junczys-Dowmunt
0f4d113148 revert spdlog submodule for now 2017-07-11 16:27:45 +00:00
Ulrich Germann
0b0d2a4b18 Automatic submodule initialization should work now. 2017-07-10 14:12:08 +01:00
Ulrich Germann
e46aa14ae8 Made spdlog a submodule that will be updated automatically when cmake is run. 2017-07-09 00:40:56 +01:00
Roman Grundkiewicz
dfcb4ab1ba Add project versioning 2017-07-04 14:06:26 +01:00
Marcin Junczys-Dowmunt
b42141d613 merge with master 2017-06-29 21:43:25 +02:00
Roman Grundkiewicz
51618fec22 Add USE_CUDNN options to CMake; fix tests and examples compilation 2017-06-27 15:20:32 +01:00
Marcin Junczys-Dowmunt
7e79bd5d73 temporarily disable CUDNN, add missing ifdef 2017-06-26 19:15:07 +02:00
Tomasz Dwojak
ad468d55e8 Add CUDNN to CMakeLists 2017-06-21 11:10:51 +00:00
Marcin Junczys-Dowmunt
84c3121885 some more refactoring 2017-06-17 21:22:27 +02:00
Roman Grundkiewicz
3f51d65bd6 Add a framework for unit tests 2017-06-14 14:06:20 +02:00
Marcin Junczys-Dowmunt
77dfe53ed1 timing and compilation options 2017-06-02 12:43:36 +02:00
Marcin Junczys-Dowmunt
eae71b42e4 remove cudaHostRegister - +10% speed for multi-gpu 2017-06-01 00:03:55 +02:00
Roman Grundkiewicz
e7bcc2ea57 Add Iris example 2017-05-29 12:50:10 +02:00
Marcin Junczys-Dowmunt
7300b52727 more ensembles 2017-05-13 19:11:40 +02:00
Marcin Junczys-Dowmunt
e466abce2f scorers and ensembling 2017-05-10 19:43:00 +02:00
Marcin Junczys-Dowmunt
a2b10c9ecd fix scripts 2017-04-23 23:09:03 +02:00
Marcin Junczys-Dowmunt
98c84dd8df fix missing embeddings CDI transducer 2017-04-23 21:00:07 +02:00
Marcin Junczys-Dowmunt
e75d0dc748 lexprobs 2017-04-19 17:08:44 +02:00
Marcin Junczys-Dowmunt
63ed92d7ce translation-based validation 2017-04-13 14:15:14 +02:00
Ulrich Germann
8498134cf0 set(CMAKE_LIBRARY_PATH /usr/local/cuda-8.0/lib64) 2017-03-29 23:51:50 +01:00
Marcin Junczys-Dowmunt
ba91eda85d fix cmake 2017-03-23 20:30:21 +01:00
Ulrich Germann
d80502cd53 Allow Marian being built as library-only (no executables) within Amunmt.
Building executables also may be possible, but will require fiddling with the cmake setup.
2017-03-22 01:53:45 +00:00
Marcin Junczys-Dowmunt
af1177eab9 fixes #42 and #41 2017-03-20 16:46:16 +01:00
Marcin Junczys-Dowmunt
cdd4bd087c handle curand better in cmake 2017-03-02 16:34:05 +01:00
Marcin Junczys-Dowmunt
fcd99c49f7 scaling dropout across rnn time steps and layers 2017-02-26 20:24:17 +00:00
Marcin Junczys-Dowmunt
61db9defe3 beam search 2017-02-20 00:24:12 +00:00
Marcin Junczys-Dowmunt
a0daeff6d3 towards beam search 2017-02-18 23:04:29 +00:00
Marcin Junczys-Dowmunt
07bcb1d9d2 restore compile flags 2017-02-18 17:53:25 +00:00
Marcin Junczys-Dowmunt
e84c6d63fe greedy search 2017-02-18 17:52:45 +00:00
Tomasz Dwojak
d1e90650dc Small changes in dropout tests 2017-02-13 09:56:03 +00:00
Tomasz Dwojak
bc73140de3 Work on dropout 2017-02-06 14:06:05 +00:00
Marcin Junczys-Dowmunt
5f6889183c proper model loading for graph groups, some more refactoring, fixed tensor test 2017-02-06 00:22:08 +00:00
Marcin Junczys-Dowmunt
2ad81a1de0 enabled stream per thread 2017-02-04 00:26:34 +00:00
Marcin Junczys-Dowmunt
516531f9d0 preparations for multi-gpu 2017-02-01 01:07:33 +00:00
Marcin Junczys-Dowmunt
7646a3c01e works on different gpus 2017-01-28 23:47:59 +00:00
Marcin Junczys-Dowmunt
3ea3a46847 fixed order for cuda includes in CMake 2017-01-21 15:11:40 +00:00
Marcin Junczys-Dowmunt
3d7912e3fd added architecture for GPU 2017-01-21 14:07:58 +00:00
Marcin Junczys-Dowmunt
9bcbef8cc9 gradient clipping and fast reduce 2017-01-21 12:31:48 +00:00
Marcin Junczys-Dowmunt
8a5bfbf28f simplify code, remove clutter, prepare for grouped processing 2017-01-12 01:44:53 +01:00
Marcin Junczys-Dowmunt
6410d2a937 simple command line interface 2017-01-11 19:32:28 +01:00
Hieu Hoang
ec47eef77d compile error 2017-01-07 17:26:47 +00:00
Marcin Junczys-Dowmunt
566be8ebc1 towards working batching for parallel corpora 2017-01-05 20:42:49 +01:00
Marcin Junczys-Dowmunt
619fff5902 refactored a bit, added saving to nematus format, trai first models on three sentences 2017-01-04 23:14:44 +01:00
Marcin Junczys-Dowmunt
7fc232e3d5 a number of optimizations and bug fixes 2016-12-28 15:30:16 +00:00
Marcin Junczys-Dowmunt
07134aef36 improved reshaping for multi-dimensional tensors when doing 2D dot product 2016-12-26 21:53:25 +00:00
Marcin Junczys-Dowmunt
cbf57e13c5 more work on nematus reimplementation, temporary fix for mnist 2016-12-02 16:26:11 +01:00
Marcin Junczys-Dowmunt
59b434e479 towards finished decoder 2016-11-29 18:01:24 +01:00
Marcin Junczys-Dowmunt
47859e4ecd fast forward for GRU 2016-11-16 02:26:06 +01:00
Marcin Junczys-Dowmunt
7b3bd14893 working memory allocation 2016-11-13 00:11:52 +01:00
Marcin Junczys-Dowmunt
c74e749698 partially fixed memory problem 2016-11-12 02:12:35 +01:00
Marcin Junczys-Dowmunt
d7d6062dbd restored speed 2016-11-11 01:12:07 +01:00
Marcin Junczys-Dowmunt
349bca4416 fixed incorrect broadcasting 2016-11-10 19:45:03 +01:00
Marcin Junczys-Dowmunt
8056b07c7e debugging null pointers in virtual table 2016-10-19 18:18:46 +00:00
Marcin Junczys-Dowmunt
56e241695c removed dependencies on extended lambdas, fixes #15 2016-09-22 15:16:03 +02:00
Marcin Junczys-Dowmunt
a057ff1776 working dropout node 2016-09-21 00:39:27 +02:00
Marcin Junczys-Dowmunt
4c8b6bb171 a number of comments and test 2016-09-20 13:27:08 +02:00
Lane Schwartz
016c777d42 Don't make doc when make all 2016-09-16 19:21:15 +02:00
Lane Schwartz
5e9cad3169 Use CMake to invoke Doxygen 2016-09-16 18:28:00 +02:00
Marcin Junczys-Dowmunt
94914d56e8 working separated graphs, added params 2016-09-16 00:23:47 +02:00
Marcin Junczys-Dowmunt
cef8a37fc7 some clean-up 2016-08-28 00:24:41 +02:00
Marcin Junczys-Dowmunt
3db0f30312 function keywords 2016-05-08 18:42:19 +02:00
Marcin Junczys-Dowmunt
7fd950fbda working basic training 2016-05-07 22:56:23 +02:00