fairseq/docs
Myle Ott 1cc8e95cec Don't cache epoch iterators when using sharded datasets (#1268)
Summary:
Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/1268

We previously had a memory leak when using sharded datasets. In particular,
each sharded dataset is a new FairseqDataset instance, and the cache is keyed
by the `dataset` instance. Since we never clear the cache, this would
eventually cause the system to run out of CPU RAM.

This diff disables caching when using sharded datasets.

Note that we also change the signature to `get_batch_iterator`, which needs to
propagate to many places. We previously avoided this update when adding
`data_buffer_size`, so I'm also adding that everywhere.

Reviewed By: ngoyal2707

Differential Revision: D23319135

fbshipit-source-id: 6bcd6aee141ad9cc234448c49106a8dbf8ea1800
2020-09-09 06:20:31 -07:00
..
_static Add documentation 2018-09-03 19:15:23 -04:00
command_line_tools.rst Small fixes (#1215) 2020-07-14 14:17:13 -07:00
conf.py v0.8.0 -> v0.9.0 (#1452) 2019-12-03 15:19:33 -08:00
criterions.rst Merge internal changes (#283) 2019-01-04 20:03:19 -08:00
data.rst Miscellaneous documentation improvements: (#868) 2019-09-23 12:27:12 -07:00
docutils.conf Add documentation 2018-09-03 19:15:23 -04:00
fairseq_logo.png Miscellaneous fixes (#1196) 2020-06-24 10:08:53 -07:00
fairseq.gif Miscellaneous fixes (#1196) 2020-06-24 10:08:53 -07:00
getting_started.rst Misc fixes (#2492) 2020-08-20 06:42:10 -07:00
index.rst Add documentation 2018-09-03 19:15:23 -04:00
lr_scheduler.rst Add fairseq to PyPI (#495) 2019-02-08 22:03:29 -08:00
make.bat Add documentation 2018-09-03 19:15:23 -04:00
Makefile Add documentation 2018-09-03 19:15:23 -04:00
models.rst Updates to model API (#561) 2019-05-15 07:12:41 -07:00
modules.rst Updates to model API (#561) 2019-05-15 07:12:41 -07:00
optim.rst Merge internal changes (#654) 2019-04-29 19:50:58 -07:00
overview.rst add vq-wav2vec (#1029) 2020-02-29 18:25:34 -08:00
requirements.txt Update documentation 2018-09-03 20:03:37 -04:00
tasks.rst Add documentation 2018-09-03 19:15:23 -04:00
tutorial_classifying_names.rst Don't cache epoch iterators when using sharded datasets (#1268) 2020-09-09 06:20:31 -07:00
tutorial_simple_lstm.rst Misc fixes (#2448) 2020-08-14 10:24:51 -07:00