mirror of https://github.com/marian-nmt/marian.git synced 2024-11-30 21:39:52 +03:00

Fast Neural Machine Translation in C++

Go to file

Marcin Junczys-Dowmunt 7333366551 reorganized files, first attempt at logging		2016-04-20 12:44:59 +02:00
scripts	Shebang	2016-04-18 14:55:25 +00:00
src	reorganized files, first attempt at logging	2016-04-20 12:44:59 +02:00
.gitignore	Add python's pyc files to ignore	2016-04-15 16:27:48 +02:00
CMakeLists.txt	reorganized files, first attempt at logging	2016-04-20 12:44:59 +02:00
LICENSE	Initial commit	2016-04-14 11:27:41 +01:00
README.md	Update README.md	2016-04-18 22:05:53 +01:00

amuNN

A C++ decoder for Neural Machine Translation (NMT) models trained with Theano-based scripts from DL4MT (https://github.com/nyu-dl/dl4mt-tutorial)

Requirements:

The project is a standard Cmake out-of-source build:

mkdir build
cd build
cmake .. -DKENLM=path/to/kenlm
make -j

On Ubuntu 16.04, you need g++4.9 and cuda-7.5 and a boost version compiled with g++4.9

CUDA_BIN_PATH=/usr/local/cuda-7.5 BOOST_ROOT=/home/marcin/myboost cmake .. \
-DCMAKE_CXX_COMPILER=g++-4.9 -DCUDA_HOST_COMPILER=/usr/bin/g++-4.9

Vocabularies (*.pkl extension) need to be converted to text with the scripts in the scripts folder.

python scripts/vocab2txt.py vocab.en.pkl > vocab.en