fairseq/tests
Myle Ott 61aad8f9cd Force certain optimizers to set --fp16-no-flatten-grads (#1010)
Summary:
When training with `--fp16` we usually flatten the grads since it's faster. But flat grads are not semantically equivalent for certain optimizers (e.g., Adafactor, LAMB), thus the user needed to be aware of this and set `--fp16-no-flatten-grads`. Let's raise a RuntimeError in this case instead.
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/1010

Differential Revision: D19575773

Pulled By: myleott

fbshipit-source-id: bac99c3026f9870e6127e0fa55f70e8a3e4507dc
2020-01-28 08:02:30 -08:00
..
speech_recognition fix Windows build (#1007) 2020-01-24 10:32:20 -08:00
__init__.py fairseq-py goes distributed (#106) 2018-02-27 17:09:42 -05:00
test_average_checkpoints.py Small fixes 2019-08-19 15:08:25 -07:00
test_backtranslation_dataset.py Add a diverse beam search variant to sequence_generator.py (#953) 2020-01-06 08:24:02 -08:00
test_binaries.py Force certain optimizers to set --fp16-no-flatten-grads (#1010) 2020-01-28 08:02:30 -08:00
test_bmuf.py Create build.yml 2019-12-17 20:45:11 -08:00
test_character_token_embedder.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_concat_dataset.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_convtbc.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_dictionary.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_export.py Cleanup new incremental state API (#1005) 2020-01-27 10:25:33 -08:00
test_file_io.py Added unit test for PathManager file io (with or without fvcore). 2019-12-09 14:19:51 -08:00
test_iterators.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_label_smoothing.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_memory_efficient_fp16.py Clean up tests 2020-01-22 11:29:20 -08:00
test_metrics.py Fix logging of training sets (fixes #1632) (#1634) 2020-01-20 16:34:33 -08:00
test_multi_corpus_sampled_dataset.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_multihead_attention.py Fixing key padding mask during transformer generation 2019-11-05 06:50:53 -08:00
test_noising.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_reproducibility.py Script MultiheadAttention (#1002) 2020-01-21 18:35:28 -08:00
test_resampling_dataset.py Add dataset class for weighted sampling with replacement. (#861) 2019-09-19 10:36:00 -07:00
test_sequence_generator.py Add a diverse beam search variant to sequence_generator.py (#953) 2020-01-06 08:24:02 -08:00
test_sequence_scorer.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_sparse_multihead_attention.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_token_block_dataset.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_train.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_utils.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
utils.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00