VocGAN issues

Can't I change the length of my voice length?

1

Hello, I tried to use a pretrained model with kss data, but a 1 second wav file was printed out. Can you only make a 1-second file with that model?...

Koeunseooooo

KeyError: 'getstate'

1

F:\ProgramData\Anaconda3\python.exe F:/work/VocGAN-master/trainer.py F:\ProgramData\Anaconda3\lib\site-packages\torchaudio\extension\extension.py:14: UserWarning: torchaudio C++ extension is not available. warnings.warn('torchaudio C++ extension is not available.') F:\ProgramData\Anaconda3\lib\site-packages\torchaudio\backend\utils.py:64: UserWarning: The interface of "soundfile" backend is planned to change in 0.8.0 to...

c1a1o1

Is it possible to train KSS dataset in master branch?

2

Thanks for sharing great result. I want to train kss data for training vocoder to use in fastspeech2 with master branch code, is it possible? ``` Avg : g 1.3729...

taewhankim

JCU Discriminator implementation details

Hi, thanks for your implementation which already helps me a lot, but I still have several questions: 1. As for the JCU discriminator, author mentioned that they use a convolution...

twinklecc

What did you use to generate the mel files from text?

1

Thanks

ghost

[Proposal] Reduce training time by resampling beforehand

1

The biggest bottleneck in your model, regarding speed, is the Resample layer in the Hierarchical Discriminator. The speed on it is just awful. If you resample your dataset beforehand and...

Craq

Assertion error: torchaaudio resample_waveform related

3

I am facing assertion error and the log is as follows: Traceback (most recent call last): File "/home/stuart/sagar/speech_analysis_synth/VocGAN/utils/train.py", line 98, in train disc_real, disc_real_multiscale = model_d(audioG, melG) File "/home/stuart/sagar/speech_analysis_synth/VocGAN/venv/lib/python3.6/site-packages/torch/nn/modules/module.py", line...

raikarsagar

Bump torch from 1.6.0 to 2.2.0

Bumps [torch](https://github.com/pytorch/pytorch) from 1.6.0 to 2.2.0. Release notes Sourced from torch's releases. PyTorch 2.2: FlashAttention-v2, AOTInductor PyTorch 2.2 Release Notes Highlights Backwards Incompatible Changes Deprecations New Features Improvements Bug fixes...

dependabot[bot]

dependencies

VocGAN
VocGAN copied to clipboard

Metadata

Can't I change the length of my voice length?

KeyError: 'getstate'

Is it possible to train KSS dataset in master branch?

JCU Discriminator implementation details

What did you use to generate the mel files from text?

[Proposal] Reduce training time by resampling beforehand

Assertion error: torchaaudio resample_waveform related

Bump torch from 1.6.0 to 2.2.0

← Metadata

Owner

Metadata

VocGAN VocGAN copied to clipboard

Metadata

← Metadata

Owner

Metadata

VocGAN
VocGAN copied to clipboard