Marcin Junczys-Dowmunt
|
5b868db6db
|
make static tests optional
|
2018-04-18 13:23:16 -07:00 |
|
Marcin Junczys-Dowmunt
|
5a9bb9792b
|
continue linking against shared zlib
|
2018-04-18 12:07:22 -07:00 |
|
Marcin Junczys-Dowmunt
|
3f1667726e
|
get rid of obsolete python depdencies
|
2018-04-18 10:17:11 -07:00 |
|
Marcin Junczys-Dowmunt
|
d49a711474
|
Prefer static linking wherever possible
|
2018-04-18 08:26:46 -07:00 |
|
Marcin Junczys-Dowmunt
|
54a533d45e
|
disable type checking for tensor::set
|
2018-04-14 20:59:57 -07:00 |
|
Marcin Junczys-Dowmunt
|
f0b18f240f
|
add CMake build profiles
|
2018-04-13 11:30:14 -07:00 |
|
Marcin Junczys-Dowmunt
|
07e5105075
|
update test
|
2018-04-11 23:18:00 -07:00 |
|
Marcin Junczys-Dowmunt
|
78a9947374
|
working memoization
|
2018-04-11 16:50:45 -07:00 |
|
Marcin Junczys-Dowmunt
|
9928625001
|
add gemm from sharp models, dull hardware
|
2018-04-08 22:13:30 -07:00 |
|
CatarinaSilva
|
ade15ca0a2
|
Fix CMakeLists.txt
|
2018-04-05 15:09:28 +01:00 |
|
Marcin Junczys-Dowmunt
|
83e9e8927a
|
add missing flag to CUDA_NVCC_FLAGS
|
2018-03-04 17:50:21 -08:00 |
|
Marcin Junczys-Dowmunt
|
8ec6eef9d5
|
more coda separation
|
2018-03-02 18:06:26 -08:00 |
|
Roman Grundkiewicz
|
059a45e661
|
Add CMake option: COMPILE_CPU
|
2018-03-01 12:36:27 +00:00 |
|
Marcin Junczys-Dowmunt
|
36e8077a7d
|
fix issue #183 by using boost regex, version prior to g++-4.9 fail
|
2018-02-26 10:38:27 -08:00 |
|
Marcin Junczys-Dowmunt
|
b241011336
|
make search for MPI optional
|
2018-02-25 22:08:25 -08:00 |
|
Marcin Junczys-Dowmunt
|
cead1b4feb
|
Restrict to OpenBLAS
|
2018-02-24 21:03:06 -08:00 |
|
Marcin Junczys-Dowmunt
|
dbba0f220d
|
add cudnn back
|
2018-02-24 20:11:02 -08:00 |
|
Marcin Junczys-Dowmunt
|
845063b342
|
re-enable mpi
|
2018-02-24 17:56:33 -08:00 |
|
Marcin Junczys-Dowmunt
|
ccbbbb95d3
|
fix FindMKL
|
2018-02-24 16:11:53 -08:00 |
|
Marcin Junczys-Dowmunt
|
73c38cf82c
|
clean up cpu version
|
2018-02-22 17:51:52 -08:00 |
|
Marcin Junczys-Dowmunt
|
8d3f70599f
|
batched sgemm
|
2018-02-21 22:08:36 -08:00 |
|
Marcin Junczys-Dowmunt
|
f224db8257
|
add FinMKL
|
2018-02-21 21:39:29 -08:00 |
|
Marcin Junczys-Dowmunt
|
d9d66f416b
|
prototype cpu version
|
2018-02-21 17:44:04 -08:00 |
|
Marcin Junczys-Dowmunt
|
4834267989
|
clean up allocators and devices
|
2018-02-15 15:23:45 -08:00 |
|
Roman Grundkiewicz
|
ae827f8bbe
|
Merge branch 'master' of https://github.com/tneck/marian-dev into tneck-master
|
2018-01-31 11:23:05 +00:00 |
|
Marcin Junczys-Dowmunt
|
bfe6839880
|
sqlite3-based corpus source
|
2018-01-23 18:35:55 -08:00 |
|
r0452425
|
0a1cdbc9d5
|
Add logic and compile config for multi-node
|
2018-01-21 07:06:51 +00:00 |
|
Marcin Junczys-Dowmunt
|
97ddb30948
|
disable server if no openssl found
|
2018-01-13 11:24:03 -08:00 |
|
Roman Grundkiewicz
|
d9063529ea
|
Add COMPILE_SERVER option to CMake
|
2018-01-09 14:18:57 +00:00 |
|
Tomasz Dwojak
|
1fbd30674a
|
Merge branch 'master' of https://github.com/marian-nmt/marian-dev into charS2S
|
2017-11-27 13:26:01 +00:00 |
|
Hieu Hoang
|
fe7ca63a66
|
compilable with cuda 8
|
2017-11-24 17:13:19 +00:00 |
|
Hieu Hoang
|
a1b13e8cf0
|
gpu arch consistent with amun
|
2017-11-24 16:27:56 +00:00 |
|
Marcin Junczys-Dowmunt
|
39e15a6d3e
|
remove tcmaloc_minimal
|
2017-11-24 17:06:25 +01:00 |
|
Tomasz Dwojak
|
537fccc3e8
|
Merge branch 'master' of https://github.com/marian-nmt/marian-dev into charS2S
|
2017-11-23 11:26:38 +00:00 |
|
Marcin Junczys-Dowmunt
|
e378a764ce
|
update changelog
|
2017-11-21 09:45:46 +00:00 |
|
Tomasz Dwojak
|
8d33f14785
|
Set cudnn version at least 7
|
2017-11-15 23:30:41 +00:00 |
|
Marcin Junczys-Dowmunt
|
c8c7f2f62a
|
remove lib
|
2017-11-02 23:25:54 +01:00 |
|
Marcin Junczys-Dowmunt
|
7a80d2a390
|
Merge branch 'master' into functional
|
2017-11-02 21:06:42 +01:00 |
|
Marcin Junczys-Dowmunt
|
ca64c429e4
|
new functional templates
|
2017-11-02 20:37:43 +01:00 |
|
Roman Grundkiewicz
|
a3b72c2ffe
|
Add TCMalloc
|
2017-11-02 10:33:47 +00:00 |
|
Marcin Junczys-Dowmunt
|
0d20bb57c1
|
clean up permutation
|
2017-11-01 22:33:47 +01:00 |
|
Marcin Junczys-Dowmunt
|
c7d236762a
|
more gpu-specific TMP
|
2017-11-01 20:09:42 +01:00 |
|
Marcin Junczys-Dowmunt
|
4f14c9175a
|
unroll loop for variable GPU shape length
|
2017-11-01 10:28:14 +01:00 |
|
Marcin Junczys-Dowmunt
|
0daa611030
|
simplify shape
|
2017-11-01 01:07:02 +01:00 |
|
Roman Grundkiewicz
|
1f9fef5836
|
Show commit SHA with the project version
|
2017-08-30 13:35:47 +00:00 |
|
Roman Grundkiewicz
|
41f99c5136
|
Clean Boost components
|
2017-08-19 13:49:55 +01:00 |
|
Roman Grundkiewicz
|
2214a90181
|
Make OpenSSL library optional
|
2017-08-19 12:55:01 +01:00 |
|
Roman Grundkiewicz
|
95fe229d26
|
Add Simple-WebSocket-Server library
|
2017-08-19 08:01:31 +01:00 |
|
Roman Grundkiewicz
|
6bf7a14f6d
|
Add python module returning version
|
2017-08-18 18:04:10 +01:00 |
|
Roman Grundkiewicz
|
7aaa3ba985
|
Switch to manual project versioning
|
2017-07-13 13:02:43 +01:00 |
|
Marcin Junczys-Dowmunt
|
0f4d113148
|
revert spdlog submodule for now
|
2017-07-11 16:27:45 +00:00 |
|
Ulrich Germann
|
0b0d2a4b18
|
Automatic submodule initialization should work now.
|
2017-07-10 14:12:08 +01:00 |
|
Ulrich Germann
|
e46aa14ae8
|
Made spdlog a submodule that will be updated automatically when cmake is run.
|
2017-07-09 00:40:56 +01:00 |
|
Roman Grundkiewicz
|
dfcb4ab1ba
|
Add project versioning
|
2017-07-04 14:06:26 +01:00 |
|
Marcin Junczys-Dowmunt
|
b42141d613
|
merge with master
|
2017-06-29 21:43:25 +02:00 |
|
Roman Grundkiewicz
|
51618fec22
|
Add USE_CUDNN options to CMake; fix tests and examples compilation
|
2017-06-27 15:20:32 +01:00 |
|
Marcin Junczys-Dowmunt
|
7e79bd5d73
|
temporarily disable CUDNN, add missing ifdef
|
2017-06-26 19:15:07 +02:00 |
|
Tomasz Dwojak
|
ad468d55e8
|
Add CUDNN to CMakeLists
|
2017-06-21 11:10:51 +00:00 |
|
Marcin Junczys-Dowmunt
|
84c3121885
|
some more refactoring
|
2017-06-17 21:22:27 +02:00 |
|
Roman Grundkiewicz
|
3f51d65bd6
|
Add a framework for unit tests
|
2017-06-14 14:06:20 +02:00 |
|
Marcin Junczys-Dowmunt
|
77dfe53ed1
|
timing and compilation options
|
2017-06-02 12:43:36 +02:00 |
|
Marcin Junczys-Dowmunt
|
eae71b42e4
|
remove cudaHostRegister - +10% speed for multi-gpu
|
2017-06-01 00:03:55 +02:00 |
|
Roman Grundkiewicz
|
e7bcc2ea57
|
Add Iris example
|
2017-05-29 12:50:10 +02:00 |
|
Marcin Junczys-Dowmunt
|
7300b52727
|
more ensembles
|
2017-05-13 19:11:40 +02:00 |
|
Marcin Junczys-Dowmunt
|
e466abce2f
|
scorers and ensembling
|
2017-05-10 19:43:00 +02:00 |
|
Marcin Junczys-Dowmunt
|
a2b10c9ecd
|
fix scripts
|
2017-04-23 23:09:03 +02:00 |
|
Marcin Junczys-Dowmunt
|
98c84dd8df
|
fix missing embeddings CDI transducer
|
2017-04-23 21:00:07 +02:00 |
|
Marcin Junczys-Dowmunt
|
e75d0dc748
|
lexprobs
|
2017-04-19 17:08:44 +02:00 |
|
Marcin Junczys-Dowmunt
|
63ed92d7ce
|
translation-based validation
|
2017-04-13 14:15:14 +02:00 |
|
Ulrich Germann
|
8498134cf0
|
set(CMAKE_LIBRARY_PATH /usr/local/cuda-8.0/lib64)
|
2017-03-29 23:51:50 +01:00 |
|
Marcin Junczys-Dowmunt
|
ba91eda85d
|
fix cmake
|
2017-03-23 20:30:21 +01:00 |
|
Ulrich Germann
|
d80502cd53
|
Allow Marian being built as library-only (no executables) within Amunmt.
Building executables also may be possible, but will require fiddling with the cmake setup.
|
2017-03-22 01:53:45 +00:00 |
|
Marcin Junczys-Dowmunt
|
af1177eab9
|
fixes #42 and #41
|
2017-03-20 16:46:16 +01:00 |
|
Marcin Junczys-Dowmunt
|
cdd4bd087c
|
handle curand better in cmake
|
2017-03-02 16:34:05 +01:00 |
|
Marcin Junczys-Dowmunt
|
fcd99c49f7
|
scaling dropout across rnn time steps and layers
|
2017-02-26 20:24:17 +00:00 |
|
Marcin Junczys-Dowmunt
|
61db9defe3
|
beam search
|
2017-02-20 00:24:12 +00:00 |
|
Marcin Junczys-Dowmunt
|
a0daeff6d3
|
towards beam search
|
2017-02-18 23:04:29 +00:00 |
|
Marcin Junczys-Dowmunt
|
07bcb1d9d2
|
restore compile flags
|
2017-02-18 17:53:25 +00:00 |
|
Marcin Junczys-Dowmunt
|
e84c6d63fe
|
greedy search
|
2017-02-18 17:52:45 +00:00 |
|
Tomasz Dwojak
|
d1e90650dc
|
Small changes in dropout tests
|
2017-02-13 09:56:03 +00:00 |
|
Tomasz Dwojak
|
bc73140de3
|
Work on dropout
|
2017-02-06 14:06:05 +00:00 |
|
Marcin Junczys-Dowmunt
|
5f6889183c
|
proper model loading for graph groups, some more refactoring, fixed tensor test
|
2017-02-06 00:22:08 +00:00 |
|
Marcin Junczys-Dowmunt
|
2ad81a1de0
|
enabled stream per thread
|
2017-02-04 00:26:34 +00:00 |
|
Marcin Junczys-Dowmunt
|
516531f9d0
|
preparations for multi-gpu
|
2017-02-01 01:07:33 +00:00 |
|
Marcin Junczys-Dowmunt
|
7646a3c01e
|
works on different gpus
|
2017-01-28 23:47:59 +00:00 |
|
Marcin Junczys-Dowmunt
|
3ea3a46847
|
fixed order for cuda includes in CMake
|
2017-01-21 15:11:40 +00:00 |
|
Marcin Junczys-Dowmunt
|
3d7912e3fd
|
added architecture for GPU
|
2017-01-21 14:07:58 +00:00 |
|
Marcin Junczys-Dowmunt
|
9bcbef8cc9
|
gradient clipping and fast reduce
|
2017-01-21 12:31:48 +00:00 |
|
Marcin Junczys-Dowmunt
|
8a5bfbf28f
|
simplify code, remove clutter, prepare for grouped processing
|
2017-01-12 01:44:53 +01:00 |
|
Marcin Junczys-Dowmunt
|
6410d2a937
|
simple command line interface
|
2017-01-11 19:32:28 +01:00 |
|
Hieu Hoang
|
ec47eef77d
|
compile error
|
2017-01-07 17:26:47 +00:00 |
|
Marcin Junczys-Dowmunt
|
566be8ebc1
|
towards working batching for parallel corpora
|
2017-01-05 20:42:49 +01:00 |
|
Marcin Junczys-Dowmunt
|
619fff5902
|
refactored a bit, added saving to nematus format, trai first models on three sentences
|
2017-01-04 23:14:44 +01:00 |
|
Marcin Junczys-Dowmunt
|
7fc232e3d5
|
a number of optimizations and bug fixes
|
2016-12-28 15:30:16 +00:00 |
|
Marcin Junczys-Dowmunt
|
07134aef36
|
improved reshaping for multi-dimensional tensors when doing 2D dot product
|
2016-12-26 21:53:25 +00:00 |
|
Marcin Junczys-Dowmunt
|
cbf57e13c5
|
more work on nematus reimplementation, temporary fix for mnist
|
2016-12-02 16:26:11 +01:00 |
|
Marcin Junczys-Dowmunt
|
59b434e479
|
towards finished decoder
|
2016-11-29 18:01:24 +01:00 |
|
Marcin Junczys-Dowmunt
|
47859e4ecd
|
fast forward for GRU
|
2016-11-16 02:26:06 +01:00 |
|
Marcin Junczys-Dowmunt
|
7b3bd14893
|
working memory allocation
|
2016-11-13 00:11:52 +01:00 |
|
Marcin Junczys-Dowmunt
|
c74e749698
|
partially fixed memory problem
|
2016-11-12 02:12:35 +01:00 |
|
Marcin Junczys-Dowmunt
|
d7d6062dbd
|
restored speed
|
2016-11-11 01:12:07 +01:00 |
|
Marcin Junczys-Dowmunt
|
349bca4416
|
fixed incorrect broadcasting
|
2016-11-10 19:45:03 +01:00 |
|
Marcin Junczys-Dowmunt
|
8056b07c7e
|
debugging null pointers in virtual table
|
2016-10-19 18:18:46 +00:00 |
|
Marcin Junczys-Dowmunt
|
56e241695c
|
removed dependencies on extended lambdas, fixes #15
|
2016-09-22 15:16:03 +02:00 |
|
Marcin Junczys-Dowmunt
|
a057ff1776
|
working dropout node
|
2016-09-21 00:39:27 +02:00 |
|
Marcin Junczys-Dowmunt
|
4c8b6bb171
|
a number of comments and test
|
2016-09-20 13:27:08 +02:00 |
|
Lane Schwartz
|
016c777d42
|
Don't make doc when make all
|
2016-09-16 19:21:15 +02:00 |
|
Lane Schwartz
|
5e9cad3169
|
Use CMake to invoke Doxygen
|
2016-09-16 18:28:00 +02:00 |
|
Marcin Junczys-Dowmunt
|
94914d56e8
|
working separated graphs, added params
|
2016-09-16 00:23:47 +02:00 |
|
Marcin Junczys-Dowmunt
|
cef8a37fc7
|
some clean-up
|
2016-08-28 00:24:41 +02:00 |
|
Marcin Junczys-Dowmunt
|
3db0f30312
|
function keywords
|
2016-05-08 18:42:19 +02:00 |
|
Marcin Junczys-Dowmunt
|
7fd950fbda
|
working basic training
|
2016-05-07 22:56:23 +02:00 |
|