marian/scripts
Martin Junczys-Dowmunt c3fb60cbcd Merged PR 13476: Add LASER reimplementation and code for embeddings sentences
This reimplements the LASER encoder from:
```
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
Mikel Artetxe, Holger Schwenk
https://arxiv.org/abs/1812.10464
```

and adds functionality to embed sentences with any Marian encoder, also different from LASER. Some early attempts to train a transformer model with Encoder-Decoder bottle-neck. This is quite early code, so some code-duplication is to be expected. Nevertheless, it's functional and I would like to have it in master as we will slowly put that into production in various places. I will make the code "nicer" as we go along.
2020-06-24 01:54:27 +00:00
..
bert add back execution rights for python/perl scripts 2019-04-30 12:52:13 -07:00
checkpoints python3 shebang from #620 (#621) 2020-04-16 11:15:42 +01:00
contrib python3 shebang from #620 (#621) 2020-04-16 11:15:42 +01:00
embeddings python3 shebang from #620 (#621) 2020-04-16 11:15:42 +01:00
laser Merged PR 13476: Add LASER reimplementation and code for embeddings sentences 2020-06-24 01:54:27 +00:00
onnx Merged PR 13081: minor regression fix in ONNX expoeter 2020-05-27 06:07:28 +00:00
server Update Simple-WebSocket-Server and move it to submodules (#639) 2020-04-27 10:34:10 +01:00
shortlist add back execution rights for python/perl scripts 2019-04-30 12:52:13 -07:00