fairseq/tests
Changhan Wang 1d1c145387 speech-to-text OSS
Summary:
Imported from https://github.com/fairinternal/fairseq-py/pull/1284. Updated according to PR comments.

Main changes:
* New task: `fairseq.tasks.speech_to_text`
  * Multilingual support: multiple train sub-splits, temperature-based sampling, language ID tokens
* New dataset: `fairseq.data.audio.speech_to_text_dataset`
* Added accuracy metrics and BOS prefix removal to label smoothed cross entropy
* New models: Transformer (`fairseq.models.speech_to_text.s2t_transformer`) and BLSTM (`fairseq.models.speech_to_text.berard`)
* Extended scorers:
  * Added a base scorer class: `fairseq.scorers.BaseScorer` (the parent class for all scorers except the BLEU scorer in CPP)
  * Added an evaluation tokenizer: `fairseq.scorers.eval_tokenizer` which leverages sacreBLEU's built-in tokenizers and allows character-level tokenization as well as punctuation removal (for WER scoring).
  * Added chrF scorer: `fairseq.scorers.chrf`
* Online Mel-filter bank speech feature extraction (via CPP-based pyKaldi or Python-based TorchAudio): `fairseq.data.audio.audio_utils`
* Online speech feature transforms: `fairseq.data.audio.feature_transforms.*`
* Fixed the subsampled sequence lengths in VGGTransformer (`examples.speech_recognition.models.vggtransformer`)
* Examples under `examples/speech_to_text`:
  * LibriSpeech (ASR): better results than VGGTransformer with smaller Transformer-based models
  * MuST-C (ST): comparable to [SOTA results](https://arxiv.org/pdf/2004.10234.pdf) but with less tricks

Reviewed By: jmp84

Differential Revision: D24065273

fbshipit-source-id: 5f842ca9c826f92d4af660705611885fe440a9ab
2020-10-14 12:30:05 -07:00
..
gpu Fix hub (#2687) 2020-10-02 19:02:01 -07:00
speech_recognition hydra fairseq 3 - inherit from legacy for fairseq classes 2020-09-09 17:02:13 -07:00
__init__.py remediation of S205607 2020-07-17 17:21:51 -07:00
test_average_checkpoints.py Small fixes 2019-08-19 15:08:25 -07:00
test_backtranslation_dataset.py Deprecate the SequenceGenerator with the Scripted vision (#1120) 2020-04-07 13:28:30 -07:00
test_binaries.py remove max_sentences from args, use batch_size instead (#1333) 2020-10-05 19:09:01 -07:00
test_bmuf.py Fix BMUF using 1 GPU 2020-04-16 11:25:35 -07:00
test_character_token_embedder.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_concat_dataset.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_constraints.py Added constrained decoding (#1536) (#2402) 2020-08-20 11:59:53 -07:00
test_convtbc.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_dictionary.py Allow dictionaries to overwrite entries with #fairseq:overwrite comment (#1073) 2020-03-08 06:52:00 -07:00
test_export.py hydra fairseq 3 - inherit from legacy for fairseq classes 2020-09-09 17:02:13 -07:00
test_file_io.py Added unit test for PathManager file io (with or without fvcore). 2019-12-09 14:19:51 -08:00
test_fp16_optimizer.py Fix hub (#2687) 2020-10-02 19:02:01 -07:00
test_inference_dropout.py Misc fixes (#2492) 2020-08-20 06:42:10 -07:00
test_iterators.py Account for checkpoint updates when calling take on CountingIterator 2020-09-04 14:26:53 -07:00
test_label_smoothing.py speech-to-text OSS 2020-10-14 12:30:05 -07:00
test_lstm_jitable.py hydra fairseq 3 - inherit from legacy for fairseq classes 2020-09-09 17:02:13 -07:00
test_memory_efficient_fp16.py Clean up tests 2020-01-22 11:29:20 -08:00
test_metrics.py Fix logging of training sets (fixes #1632) (#1634) 2020-01-20 16:34:33 -08:00
test_multi_corpus_sampled_dataset.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_multihead_attention.py Fixing key padding mask during transformer generation 2019-11-05 06:50:53 -08:00
test_noising.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_reproducibility.py Fix validation happening twice at the end of epoch (#1934) 2020-04-03 16:38:39 -07:00
test_resampling_dataset.py Add dataset class for weighted sampling with replacement. (#861) 2019-09-19 10:36:00 -07:00
test_sequence_generator.py hydra fairseq 3 - inherit from legacy for fairseq classes 2020-09-09 17:02:13 -07:00
test_sequence_scorer.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_sparse_multihead_attention.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_token_block_dataset.py Relicense fairseq under MIT license (#786) 2019-07-30 07:48:23 -07:00
test_train.py Misc fixes (#2492) 2020-08-20 06:42:10 -07:00
test_utils.py Updates full to no longer use deprecated integer fill_value type inference 2020-06-22 11:56:58 -07:00
utils.py remove max_sentences from args, use batch_size instead (#1333) 2020-10-05 19:09:01 -07:00