NumpyDumpDataset

class NumpyDumpDataset.NumpyDumpDataset(prefix, postfix='.txt.gz', start_seq=0, end_seq=None, num_inputs=None, num_outputs=None, **kwargs)[source]
file_format_data = '%i.data'[source]
file_format_targets = '%i.targets'[source]
cached_seqs = None[source]
Type:list[DatasetSeq]
init_seq_order(epoch=None, seq_list=None)[source]
Parameters:| None seq_list (list[str]) – In case we want to set a predefined order.
Return type:bool

:returns whether the order changed (True is always safe to return)

This is called when we start a new epoch, or at initialization. Call this when you reset the seq list.

get_input_data(seq_idx)[source]
Return type:numpy.ndarray
Returns features:
 format 2d (time,feature) (float)
get_targets(target, seq_idx)[source]
Return type:numpy.ndarray
Returns targets:
 format 1d (time) (int: idx of output-feature)
get_ctc_targets(seq_idx)[source]
get_seq_length(seq_idx)[source]
Return type:NumbersDict
num_seqs[source]
len_info()[source]
Return type:str

:returns a string to present the user as information about our len. Depending on our implementation, we can give some more or some less information.