Generic Datasets#

HDF Dataset#

class HDFDataset.HDFDataset(files=None, use_cache_manager=False, **kwargs)[source]#

Bases: CachedDataset

Dataset based on HDF files. This was the main original dataset format of RETURNN.

Parameters:
  • files (None|list[str]) –

  • use_cache_manager (bool) – uses Util.cf() for files

Next Gen HDF Dataset#

class HDFDataset.NextGenHDFDataset(input_stream_name, files=None, **kwargs)[source]#

Bases: CachedDataset2

Another separate dataset which uses HDF files to store the data.

Parameters:
  • input_stream_name (str) –

  • files (None|list[str]) –