bzp83
bzp83
It would be great to have some tips on how to train different languages... I have datasets of different languages and would be happy to train with those datasets, but...
I'm not sure if my question makes sense... but I'll try to explain. I trained a voice with ~50 hours of audio divided into 57500 wav files in 44100Hz 16...
Hello! I trained 2 voices from scratch, one in medium and the other in high quality. When I export them to onnx and test, the medium has a RTF of...
Could someone help clarify a few questions for me? What exactly is the difference between medium and high quality? I have a fairly extensive dataset, approximately 100 hours of high-quality...
I trained a model twice from scratch with the same dataset and same everything, except that one used use_sdp=True and the other use_sdp=False. I can't see any difference, except the...
https://github.com/jik876/hifi-gan/blob/4769534d45265d52a904b850da5a622601885777/models.py#L81 Hi, why is 80 hardcoded here? Should it match [num_mels](https://github.com/jik876/hifi-gan/blob/4769534d45265d52a904b850da5a622601885777/config_v1.json#L18)? Thanks
Hi! Thanks for this work! I'm using this in a model and when I try to export the model to onnx, I get: ` Missing key(s) in state_dict: "model_g.dec.stft.forward_basis", "model_g.dec.stft.inverse_basis".`...
Hi, are you able to tell me if I can use your vocoder with vits (https://github.com/jaywalnut310/vits)? Are you able to tell me if your vocoder has better quality than the...
Hi, First, I wanted to congratulate you for this amazing work and wanted to share my experience. I decided to give it a go and tested a dataset on a...
Is this possible to fine tune a vocos model with mel specs and gt audio?