fairseq

mirror of https://github.com/facebookresearch/fairseq.git synced 2024-09-11 17:25:31 +03:00

History

Myle Ott 61aad8f9cd Force certain optimizers to set --fp16-no-flatten-grads (#1010 ) Summary: When training with `--fp16` we usually flatten the grads since it's faster. But flat grads are not semantically equivalent for certain optimizers (e.g., Adafactor, LAMB), thus the user needed to be aware of this and set `--fp16-no-flatten-grads`. Let's raise a RuntimeError in this case instead. Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/1010 Differential Revision: D19575773 Pulled By: myleott fbshipit-source-id: bac99c3026f9870e6127e0fa55f70e8a3e4507dc		2020-01-28 08:02:30 -08:00
..
speech_recognition	fix Windows build (#1007 )	2020-01-24 10:32:20 -08:00
__init__.py	fairseq-py goes distributed (#106 )	2018-02-27 17:09:42 -05:00
test_average_checkpoints.py	Small fixes	2019-08-19 15:08:25 -07:00
test_backtranslation_dataset.py	Add a diverse beam search variant to sequence_generator.py (#953 )	2020-01-06 08:24:02 -08:00
test_binaries.py	Force certain optimizers to set --fp16-no-flatten-grads (#1010 )	2020-01-28 08:02:30 -08:00
test_bmuf.py	Create build.yml	2019-12-17 20:45:11 -08:00
test_character_token_embedder.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_concat_dataset.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_convtbc.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_dictionary.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_export.py	Cleanup new incremental state API (#1005 )	2020-01-27 10:25:33 -08:00
test_file_io.py	Added unit test for PathManager file io (with or without fvcore).	2019-12-09 14:19:51 -08:00
test_iterators.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_label_smoothing.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_memory_efficient_fp16.py	Clean up tests	2020-01-22 11:29:20 -08:00
test_metrics.py	Fix logging of training sets (fixes #1632 ) (#1634 )	2020-01-20 16:34:33 -08:00
test_multi_corpus_sampled_dataset.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_multihead_attention.py	Fixing key padding mask during transformer generation	2019-11-05 06:50:53 -08:00
test_noising.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_reproducibility.py	Script MultiheadAttention (#1002 )	2020-01-21 18:35:28 -08:00
test_resampling_dataset.py	Add dataset class for weighted sampling with replacement. (#861 )	2019-09-19 10:36:00 -07:00
test_sequence_generator.py	Add a diverse beam search variant to sequence_generator.py (#953 )	2020-01-06 08:24:02 -08:00
test_sequence_scorer.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_sparse_multihead_attention.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_token_block_dataset.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_train.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
test_utils.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00
utils.py	Relicense fairseq under MIT license (#786 )	2019-07-30 07:48:23 -07:00