VocGAN
VocGAN copied to clipboard
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Hello, I tried to use a pretrained model with kss data, but a 1 second wav file was printed out. Can you only make a 1-second file with that model?...
F:\ProgramData\Anaconda3\python.exe F:/work/VocGAN-master/trainer.py F:\ProgramData\Anaconda3\lib\site-packages\torchaudio\extension\extension.py:14: UserWarning: torchaudio C++ extension is not available. warnings.warn('torchaudio C++ extension is not available.') F:\ProgramData\Anaconda3\lib\site-packages\torchaudio\backend\utils.py:64: UserWarning: The interface of "soundfile" backend is planned to change in 0.8.0 to...
Thanks for sharing great result. I want to train kss data for training vocoder to use in fastspeech2 with master branch code, is it possible? ``` Avg : g 1.3729...
Hi, thanks for your implementation which already helps me a lot, but I still have several questions: 1. As for the JCU discriminator, author mentioned that they use a convolution...
The biggest bottleneck in your model, regarding speed, is the Resample layer in the Hierarchical Discriminator. The speed on it is just awful. If you resample your dataset beforehand and...
I am facing assertion error and the log is as follows: Traceback (most recent call last): File "/home/stuart/sagar/speech_analysis_synth/VocGAN/utils/train.py", line 98, in train disc_real, disc_real_multiscale = model_d(audioG, melG) File "/home/stuart/sagar/speech_analysis_synth/VocGAN/venv/lib/python3.6/site-packages/torch/nn/modules/module.py", line...
Bumps [torch](https://github.com/pytorch/pytorch) from 1.6.0 to 2.2.0. Release notes Sourced from torch's releases. PyTorch 2.2: FlashAttention-v2, AOTInductor PyTorch 2.2 Release Notes Highlights Backwards Incompatible Changes Deprecations New Features Improvements Bug fixes...