reservoirpy.datasets.japanese_vowels#
- reservoirpy.datasets.japanese_vowels(one_hot_encode=True, repeat_targets=False, data_folder=None, reload=False)[source]#
Load the Japanese vowels [16] dataset.
This is a classic audio discrimination task. Nine male Japanese speakers pronounced the ` ae` vowel. The task consists in inferring the speaker identity from the audio recording.
Audio recordings are series of 12 LPC cepstrum coefficient. Series contains between 7 and 29 timesteps. Each series (or “block”) is one utterance of ` ae` vowel from one speaker.
Classes
9
Samples per class (training)
30 series of 7-29 timesteps
Samples per class (testing)
29-50 series of 7-29 timesteps
Samples total
640
Dimensionality
12
Features
real
Data is downloaded from: https://doi.org/10.24432/C5NS47
- Parameters:
one_hot_encode (bool, default to True) – If True, returns class label as a one-hot encoded vector.
repeat_targets (bool, default to False) – If True, repeat the target label or vector along the time axis of the corresponding sample.
data_folder (str or Path-like object, optional) – Local destination of the downloaded data.
reload (bool, default to False) – If True, re-download data from remote repository. Else, if a cached version of the dataset exists, use the cached dataset.
- Returns:
Lists of arrays of shape (timesteps, features) or (timesteps, target) or (target,).
- Return type:
X_train, Y_train, X_test, Y_test
References