Overall process of speech signal processing (Mel-spectrogram & MFCCs) and loading data using Pytorch dataloader