diffwave-sr
diffwave-sr copied to clipboard
 Hello, I'm trying to train a model on the Opensinger dataset, but the Loss_T keeps going up and the resulting speech is almost incomprehensible, do you have any suggestions...
Excuse me, what bandwidth range was your 16kHz model trained on, and can it extend waveforms from 2k, 4k, and 8k to 16k?
Hello. I have a question about below formula. How did you derive this? https://github.com/yoyololicon/diffwave-sr/blob/cab5c4e330c8b6d8b329a6c85812a7328fe3431c/loss.py#L20 In this research, audio data is used and is it continuous? I would appreciate your cooperation.
Hi, I am now working on the evaluation on audio super metrics, and i am wondering whether the LSD metric lead to sub-optimal results? For example, the following STFT-image consists...