Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.
Go to file
Abhishek Aggarwal b86f8a7dc2 Improved README
- Clears up the spaghetti of model packaging
 - Usage instructions
 - Formatting changes
2021-02-17 14:21:51 +01:00
.github/workflows CI scripts: master -> main 2021-01-23 14:39:08 +00:00
3rd_party Updated marian submodule 2021-02-15 16:36:26 +01:00
app Changed translate() API from non-blocking to blocking 2021-02-10 11:15:16 +01:00
doc Unified api draft (#1) 2020-10-29 09:17:32 +01:00
docker Add reproducible docker-based builds + let test page use these by default 2021-02-15 11:27:47 +02:00
src Improved cmake to use wasm compilation flags across project 2021-02-12 11:36:33 +01:00
wasm Updated test page to use the model structure from bergamot-models repo 2021-02-16 17:00:45 +02:00
.gitignore Add reproducible docker-based builds + let test page use these by default 2021-02-15 11:27:47 +02:00
.gitmodules Updated ssplit submodule to a different repository 2021-02-10 10:33:01 +01:00
CMakeLists.txt Re-enable simd shuffle pattern for intgemm compilation 2021-02-15 17:18:59 +01:00
LICENSE Initial commit 2020-10-19 13:49:38 +02:00
README.md Improved README 2021-02-17 14:21:51 +01:00

Bergamot Translator

Bergamot translator provides a unified API for (Marian NMT framework based) neural machine translation functionality in accordance with the Bergamot project that focuses on improving client-side machine translation in a web browser.

Build Instructions

Build Natively

git clone  --recursive https://github.com/browsermt/bergamot-translator
cd bergamot-translator
mkdir build
cd build
cmake ../
make -j

Build WASM

Compiling for the first time

  1. Download and Install Emscripten using following instructions

    • Get the latest sdk: git clone https://github.com/emscripten-core/emsdk.git
    • Enter the cloned directory: cd emsdk
    • Install the lastest sdk tools: ./emsdk install latest
    • Activate the latest sdk tools: ./emsdk activate latest
    • Activate path variables: source ./emsdk_env.sh
  2. Clone the repository and checkout the appropriate branch using these instructions:

    git clone https://github.com/browsermt/bergamot-translator
    cd bergamot-translator
    git checkout -b wasm-integration origin/wasm-integration
    git submodule update --init --recursive
    
  3. Download models (only required if you want to package files in wasm binary)

    This step is only required if you want to package files (e.g. models, vocabularies etc.) into wasm binary. If you don't then just skip this step.

    The build preloads the files in Emscriptens virtual file system.

    If you want to use bergamot models, please follow these instructions:

    mkdir models
    git clone https://github.com/mozilla-applied-ml/bergamot-models
    cp -rf bergamot-models/* models
    gunzip models/*/*
    
  4. Compile

    1. Create a folder where you want to build all the artefacts (build-wasm in this case)

      mkdir build-wasm
      cd build-wasm
      
    2. Compile the artefacts

      • If you want to package files into wasm binary then execute following commands (Replace FILES_TO_PACKAGE with the absolute path of the directory containing the files to be packaged in wasm binary)

        emcmake cmake -DCOMPILE_WASM=on -DPACKAGE_DIR=FILES_TO_PACKAGE ../
        emmake make -j
        
      • If you don't want to package any file into wasm binary then execute following commands:

        emcmake cmake -DCOMPILE_WASM=on ../
        emmake make -j
        

    The artefacts (.js and .wasm files) will be available in wasm folder of build directory ("build-wasm" in this case).

Recompiling

As long as you don't update any submodule, just follow steps in 4.ii to recompile.
If you update a submodule, execute following command before executing steps in 4.ii to recompile.

git submodule update --init --recursive

How to use

Using Native version

The builds generate library that can be integrated to any project. All the public header files are specified in src folder.
A short example of how to use the APIs is provided in app/main.cpp file.

Using WASM version

Please follow the README inside the wasm folder of this repository that demonstrates how to use the translator in JavaScript.