mirror of https://github.com/facebookresearch/fairseq.git synced 2024-10-04 04:37:58 +03:00

History

Myle Ott b41c74dc5b Add code for "Pay Less Attention with Lightweight and Dynamic Convolutions" (#473 ) Summary: Changelog: - `e330f56`: Add code for the "Pay Less Attention with Lightweight and Dynamic Convolutions" paper - `5e3b98c`: Add scripts for computing tokenized BLEU with compound splitting and sacrebleu - update READMEs - misc fixes Pull Request resolved: https://github.com/pytorch/fairseq/pull/473 Differential Revision: D13819717 Pulled By: myleott fbshipit-source-id: f2dc12ea89a436b950cafec3593ed1b04af808e9	2019-01-25 15:40:26 -08:00
..
README.md	Add code for "Pay Less Attention with Lightweight and Dynamic Convolutions" (#473 )	2019-01-25 15:40:26 -08:00

Myle Ott b41c74dc5b Add code for "Pay Less Attention with Lightweight and Dynamic Convolutions" (#473 )

Summary:
Changelog:
- `e330f56`: Add code for the "Pay Less Attention with Lightweight and Dynamic Convolutions" paper
- `5e3b98c`: Add scripts for computing tokenized BLEU with compound splitting and sacrebleu
- update READMEs
- misc fixes
Pull Request resolved: https://github.com/pytorch/fairseq/pull/473

Differential Revision: D13819717

Pulled By: myleott

fbshipit-source-id: f2dc12ea89a436b950cafec3593ed1b04af808e9

2019-01-25 15:40:26 -08:00

README.md

Add code for "Pay Less Attention with Lightweight and Dynamic Convolutions" (#473 )

2019-01-25 15:40:26 -08:00

README.md

Convolutional Sequence to Sequence Learning (Gehring et al., 2017)

Pre-trained models

Description	Dataset	Model	Test set(s)
Convolutional (Gehring et al., 2017)	WMT14 English-French	download (.tar.bz2)	newstest2014: download (.tar.bz2) newstest2012/2013: download (.tar.bz2)
Convolutional (Gehring et al., 2017)	WMT14 English-German	download (.tar.bz2)	newstest2014: download (.tar.bz2)
Convolutional (Gehring et al., 2017)	WMT17 English-German	download (.tar.bz2)	newstest2014: download (.tar.bz2)

Example usage

See the translation README for instructions on reproducing results for WMT'14 En-De and WMT'14 En-Fr using the fconv_wmt_en_de and fconv_wmt_en_fr model architectures.

Citation

@inproceedings{gehring2017convs2s,
  title = {Convolutional Sequence to Sequence Learning},
  author = {Gehring, Jonas, and Auli, Michael and Grangier, David and Yarats, Denis and Dauphin, Yann N},
  booktitle = {Proc. of ICML},
  year = 2017,
}