yxlllc comments

Results 35 comments of


                                            yxlllc

How does RVC handle reconstructing audio from the spectrogram?

RVC and So-Vits-Svc are similar end-to-end architectures. In fact, the spectrum is not explicitly generated during the conversion process, although HifiGAN is used (the input is a 192-dimensional hidden space...

How does RVC handle reconstructing audio from the spectrogram?

> > RVC and So-Vits-Svc are similar end-to-end architectures. In fact, the spectrum is not explicitly generated during the conversion process, although HifiGAN is used (the input is a 192-dimensional...

Standard for most of recorded music is 44khz instead of 40khz

I guess the reason is that the pre-trained model of rvc is trained with the VCTK dataset with a sampling rate of 48khz.

GANs

We have tried it, but bigvgan training is a bit difficult and the improvement is not obvious.

Asking about the long-promised RVCv3 pre-trained model

We actually have some experimental models, but the performance improvements have not met expectations. A key point may be that the performance of contentvec (hubert_base.pt) pre-training limits the final upper...

不知道的config.yaml

加载错模型了

size mismatch

Check whether the configuration file or pre-trained model used is correct

ERROR: Could not open requirements file: [Errno 2] No such file or directory: 'requirements.txt'

Check if the pip command is executed in the correct directory

Failure at "audio_callback" in gui_diff.py preventing usage

According to my tests, only MME is the most stable driver, the others are very random, and may be a problem with the sounddevice library.

Training

If you use a pretrained model, a few hours will be enough