fairseq/scripts
Sam Shleifer be1d186fa5 FSDP uses new optimizer gathering to save optimizer state (#1744)
Summary:
- Full unflattened optimizer state dict is in `checkpoints/shard_0.pt`, other checkpoint files do not have the `last_optimizer_state` key.
- requires master version of fairscale (eventually fairscale>=0.3.3)

Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/1744

Reviewed By: myleott

Differential Revision: D27342305

Pulled By: sshleifer

fbshipit-source-id: 7442b8c6ed01599d8ab0050213e84051f4e98acd
2021-03-26 07:18:59 -07:00
..
constraints Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
__init__.py remediation of S205607 2020-07-17 17:21:51 -07:00
average_checkpoints.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
build_sym_alignment.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
compare_namespaces.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
compound_split_bleu.sh fix parameter error (#1596) 2020-01-09 10:41:06 -08:00
convert_dictionary.lua Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
convert_model.lua Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
count_docs.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
read_binarized.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
rm_pt.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
sacrebleu.sh Small fixes (#1215) 2020-07-14 14:17:13 -07:00
shard_docs.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
split_train_valid_docs.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
spm_decode.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
spm_encode.py Apply black+isort (#1357) 2020-10-18 18:14:51 -07:00
spm_train.py remove shlex.quote in scripts/spm_train.py (#972) 2019-08-19 07:34:46 -07:00
test_fsdp.sh FSDP uses new optimizer gathering to save optimizer state (#1744) 2021-03-26 07:18:59 -07:00