mirror of https://github.com/marian-nmt/marian.git synced 2024-11-27 10:33:14 +03:00

Fast Neural Machine Translation in C++

cuda fast gpu neural-machine-translation starred-marian-nmt-repo starred-repo

Go to file

Marcin Junczys-Dowmunt 7e76b61f88 Towards YAML configurations		2016-04-28 20:00:43 +02:00
cmake	Towards YAML configurations	2016-04-28 20:00:43 +02:00
scripts	fixed tuning	2016-04-27 12:48:50 +02:00
src	Towards YAML configurations	2016-04-28 20:00:43 +02:00
.gitignore	Add python's pyc files to ignore	2016-04-15 16:27:48 +02:00
CMakeLists.txt	Towards YAML configurations	2016-04-28 20:00:43 +02:00
LICENSE	Initial commit	2016-04-14 11:27:41 +01:00
README.md	Update README.md	2016-04-21 00:07:59 +01:00

README.md

amuNN

A C++ decoder for Neural Machine Translation (NMT) models trained with Theano-based scripts from DL4MT (https://github.com/nyu-dl/dl4mt-tutorial)

Requirements:

CMake 3.5.1 (due to CUDA related bugs in earlier versions)
Boost 1.5
CUDA 7.5
KenLM (https://github.com/kpu/kenlm, current master)

Compilation

The project is a standard Cmake out-of-source build:

mkdir build
cd build
cmake .. -DKENLM=path/to/kenlm
make -j

On Ubuntu 16.04, you need g++4.9 and cuda-7.5 and a boost version compiled with g++4.9

CUDA_BIN_PATH=/usr/local/cuda-7.5 BOOST_ROOT=/path/to/custom/boost cmake .. \
-DCMAKE_CXX_COMPILER=g++-4.9 -DCUDA_HOST_COMPILER=/usr/bin/g++-4.9 -DKENLM=path/to/kenlm

Vocabularies (*.pkl extension) need to be converted to text with the scripts in the scripts folder.

python scripts/vocab2txt.py vocab.en.pkl > vocab.en