VocGAN icon indicating copy to clipboard operation
VocGAN copied to clipboard

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Results 8 VocGAN issues
Sort by recently updated
recently updated
newest added

Hello, I tried to use a pretrained model with kss data, but a 1 second wav file was printed out. Can you only make a 1-second file with that model?...

F:\ProgramData\Anaconda3\python.exe F:/work/VocGAN-master/trainer.py F:\ProgramData\Anaconda3\lib\site-packages\torchaudio\extension\extension.py:14: UserWarning: torchaudio C++ extension is not available. warnings.warn('torchaudio C++ extension is not available.') F:\ProgramData\Anaconda3\lib\site-packages\torchaudio\backend\utils.py:64: UserWarning: The interface of "soundfile" backend is planned to change in 0.8.0 to...

Thanks for sharing great result. I want to train kss data for training vocoder to use in fastspeech2 with master branch code, is it possible? ``` Avg : g 1.3729...

Hi, thanks for your implementation which already helps me a lot, but I still have several questions: 1. As for the JCU discriminator, author mentioned that they use a convolution...

The biggest bottleneck in your model, regarding speed, is the Resample layer in the Hierarchical Discriminator. The speed on it is just awful. If you resample your dataset beforehand and...

I am facing assertion error and the log is as follows: Traceback (most recent call last): File "/home/stuart/sagar/speech_analysis_synth/VocGAN/utils/train.py", line 98, in train disc_real, disc_real_multiscale = model_d(audioG, melG) File "/home/stuart/sagar/speech_analysis_synth/VocGAN/venv/lib/python3.6/site-packages/torch/nn/modules/module.py", line...

Bumps [torch](https://github.com/pytorch/pytorch) from 1.6.0 to 2.2.0. Release notes Sourced from torch's releases. PyTorch 2.2: FlashAttention-v2, AOTInductor PyTorch 2.2 Release Notes Highlights Backwards Incompatible Changes Deprecations New Features Improvements Bug fixes...

dependencies