nnAudio
nnAudio copied to clipboard
Audio processing by using pytorch 1D convolution network
I am working on multi-channel sEMG signal. Is there possible if the API can be applied on the multi-channel signal in shape (n_channels, n_samples) to produce STFT data in shape...
Hello, Thanks for putting in place a really useful library! I'm working on the pneumonia detection problem. My dataset is super imbalanced, with 2000+ non-pneumonia cases and 142 cases, but...
I've been playing around with trying to reconstruct an STFT spectrogram from a Mel spectrogram (derived using the `MelSpectrogram` class) and wondered if you might be interested in incorporating something...
Hello! While training my model, I used MelSpectrogram function and realized that after few epochs(15-10), the mel basis(mel filterbank) are stuck and donot change at all. I even tried increasing...
CQT doesn't work on waveforms on short chunks. Is this something expected?
I think it will be nice to have normalization tools for STFT kernels (they exist in CQT in the forward pass with the parameter normalization_type) in order to control the...
Could you please elaborate why you have not used Learnable_window in STFT , Mel Spectrograms and MFCC but used in their inverse counterparts?
hi, i convert the stft model to onnx version , but a error is raised. ''' RuntimeError: Exporting the operator col2im to ONNX opset version 12 is not supported. Please...
Setting: Spectrogram.iSTFT(n_fft=1024, win_length=1024, freq_bins=None, hop_length=300, fmax=7600, fmin=80, sr=24000, trainable_window=False, trainable_kernels=False, verbose=False) I used Spectrogram.iSTFT convert spectrograms back to waveforms, most batch backward is faster, but sometimes backward is very slow...