fairseq

mirror of https://github.com/facebookresearch/fairseq.git synced 2024-08-16 20:10:40 +03:00

History

Myle Ott 1cc8e95cec Don't cache epoch iterators when using sharded datasets (#1268 ) Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/1268 We previously had a memory leak when using sharded datasets. In particular, each sharded dataset is a new FairseqDataset instance, and the cache is keyed by the `dataset` instance. Since we never clear the cache, this would eventually cause the system to run out of CPU RAM. This diff disables caching when using sharded datasets. Note that we also change the signature to `get_batch_iterator`, which needs to propagate to many places. We previously avoided this update when adding `data_buffer_size`, so I'm also adding that everywhere. Reviewed By: ngoyal2707 Differential Revision: D23319135 fbshipit-source-id: 6bcd6aee141ad9cc234448c49106a8dbf8ea1800		2020-09-09 06:20:31 -07:00
..
_static	Add documentation	2018-09-03 19:15:23 -04:00
command_line_tools.rst	Small fixes (#1215 )	2020-07-14 14:17:13 -07:00
conf.py	v0.8.0 -> v0.9.0 (#1452 )	2019-12-03 15:19:33 -08:00
criterions.rst	Merge internal changes (#283 )	2019-01-04 20:03:19 -08:00
data.rst	Miscellaneous documentation improvements: (#868 )	2019-09-23 12:27:12 -07:00
docutils.conf	Add documentation	2018-09-03 19:15:23 -04:00
fairseq_logo.png	Miscellaneous fixes (#1196 )	2020-06-24 10:08:53 -07:00
fairseq.gif	Miscellaneous fixes (#1196 )	2020-06-24 10:08:53 -07:00
getting_started.rst	Misc fixes (#2492 )	2020-08-20 06:42:10 -07:00
index.rst	Add documentation	2018-09-03 19:15:23 -04:00
lr_scheduler.rst	Add fairseq to PyPI (#495 )	2019-02-08 22:03:29 -08:00
make.bat	Add documentation	2018-09-03 19:15:23 -04:00
Makefile	Add documentation	2018-09-03 19:15:23 -04:00
models.rst	Updates to model API (#561 )	2019-05-15 07:12:41 -07:00
modules.rst	Updates to model API (#561 )	2019-05-15 07:12:41 -07:00
optim.rst	Merge internal changes (#654 )	2019-04-29 19:50:58 -07:00
overview.rst	add vq-wav2vec (#1029 )	2020-02-29 18:25:34 -08:00
requirements.txt	Update documentation	2018-09-03 20:03:37 -04:00
tasks.rst	Add documentation	2018-09-03 19:15:23 -04:00
tutorial_classifying_names.rst	Don't cache epoch iterators when using sharded datasets (#1268 )	2020-09-09 06:20:31 -07:00
tutorial_simple_lstm.rst	Misc fixes (#2448 )	2020-08-14 10:24:51 -07:00