fairseq

mirror of https://github.com/facebookresearch/fairseq.git synced 2024-08-17 04:20:36 +03:00

Author	SHA1	Message	Date
Piyush Kansal	e29f53bfea	initial revision (#5328 )	2023-09-15 15:01:49 -04:00
alexeib	995c204337	Data2vec prelim (#2929 ) Summary: Preliminaries for data2vec release, include some minor improvements and bug fixes Most important change is that we now default to raising an exception when fields in config do not have a corresponding field in the model dataclass Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/2929 Reviewed By: wnhsu Differential Revision: D33649708 Pulled By: alexeib fbshipit-source-id: 629bdb4c361550740b451c570c2005bb956c6fcb	2022-01-20 00:02:16 -08:00
Vimal Manohar	cf8ff8c3c5	Add unittests for jitting EMA model Summary: As title Reviewed By: nayansinghal Differential Revision: D32005717 fbshipit-source-id: ebdf1ed0e4a2b9fccffd841d0fa7be0b50ec6b79	2022-01-13 01:53:42 -08:00
dianaml0	0dfd6b6240	Add linting with black (#2678 ) Summary: # Before submitting - [ ] Was this discussed/approved via a Github issue? (no need for typos, doc improvements) - [ ] Did you read the [contributor guideline](https://github.com/pytorch/fairseq/blob/main/CONTRIBUTING.md)? - [ ] Did you make sure to update the docs? - [ ] Did you write any new necessary tests? ## What does this PR do? Fixes # (issue). ## PR review Anyone in the community is free to review the PR once the tests have passed. If we didn't discuss your PR in Github issues there's a high chance it will not be merged. ## Did you have fun? Make sure you had fun coding � Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/2678 Reviewed By: Mortimerp9 Differential Revision: D32653381 Pulled By: dianaml0 fbshipit-source-id: 2810d14867cd7d64f4d340740e2b590b82de47fe	2021-11-29 12:32:59 -08:00
Myle Ott	6d23cc7e7c	Move checkpoint state_dict creation into Trainer (#1666 ) Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/1666 Context: the checkpoint saving call stack has become a bit convoluted: ``` train.py + checkpoint_utils.save_checkpoint + trainer.save_checkpoint + checkpoint_utils.save_state + checkpoint_utils.torch_persistent_save ``` This diff slightly simplifies the checkpoint saving logic by exposing a `state_dict` method inside the Trainer. This simplifies the call stack to: ``` train.py + checkpoint_utils.save_checkpoint + trainer.save_checkpoint + checkpoint_utils.torch_persistent_save ``` This new structure is important for the FullyShardedDataParallel diff (next diff in the stack), since it enables the Trainer to save multiple checkpoints for the different optimizer state shards. Test Plan: - unit tests - trained WMT En-De models; confirmed checkpoints save/load properly, resuming from a checkpoint gives identical results - `buck test fblearner/flow/projects/langtech/translation:tests` (2 failures are in trunk too): https://www.internalfb.com/intern/testinfra/testconsole/testrun/2533274840914654/ Reviewed By: zhengwy888 Differential Revision: D26771146 Pulled By: myleott fbshipit-source-id: 10f91979cd42205c1d8abcaa9ab56f63eba31e93	2021-03-04 13:32:44 -08:00
Eric Lou	7d2394b56f	ioPath async - Fairseq unittests (#1669 ) Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/1669 Unit tests for async writes integration done in D26467815 (`3100d0b8e5`). Ongoing performance tests: https://fb.quip.com/kjM7Atb1kKbO Reviewed By: myleott Differential Revision: D26732660 fbshipit-source-id: faf8cac67b9167af4195358c1a2592804c13562c	2021-03-03 10:50:39 -08:00
alexeib	15867e1284	migrate translation task (#1569 ) Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/1569 Test Plan: Imported from OSS tests + ran ``` python fairseq_cli/train.py \  18:08:56 ~/data/iwslt14.de-en \ --arch transformer_iwslt_de_en --share-decoder-input-output-embed \ --optimizer adam --adam-betas '(0.9, 0.98)' --clip-norm 0.0 \ --lr 5e-4 --lr-scheduler inverse_sqrt --warmup-updates 4000 \ --dropout 0.3 --weight-decay 0.0001 \ --criterion label_smoothed_cross_entropy --label-smoothing 0.1 \ --max-tokens 4096 \ --eval-bleu \ --eval-bleu-args '{"beam": 5, "max_len_a": 1.2, "max_len_b": 10}' \ --eval-bleu-detok moses \ --eval-bleu-remove-bpe \ --eval-bleu-print-samples \ --best-checkpoint-metric bleu --maximize-best-checkpoint-metric ``` Reviewed By: myleott Differential Revision: D25967217 Pulled By: alexeib fbshipit-source-id: 808f3cb0939fa13e1e05f39bfa02a7fb0b152940	2021-01-20 18:01:18 -08:00
Myle Ott	9cf0bd96d6	Add/fix tests (#1468 ) Summary: - add test for loading ensemble checkpoints (and confirmed it fails if I revert: `265791b727`) - add test for LayerDrop (and fix it) Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/1468 Reviewed By: alexeib Differential Revision: D25223272 Pulled By: myleott fbshipit-source-id: 3f06f753605af251567c70d2961f5506ea423499	2020-11-30 14:20:36 -08:00

8 Commits