Tim von Känel comments

Results 22 comments of


                                            Tim von Känel

Progress stuck on writing image (#19)

You may need to install poetry https://python-poetry.org/docs/ but as I said, it's a work in progress and you will run into other bugs.

Progress stuck on writing image (#19)

@DrAnime-byte I am currently working on removing docker as a dependency, so if you come back in a few weeks the issue should be resolved.

How can I train this model with data in sample_rate=16k ?

If you want to use your model with the pretrained HiFiGAN as Vocoder you need to mimic its short-time fourier transform window and hop length. The window length for pretrained...

How can I train this model with data in sample_rate=16k ?

No, you dont need to train a new hifigan, the output of hifigan will be 22050 hz even though you trained fastspeech on 16khz mel spectrograms

Delightful TTS implementation

Thanks for opening the issue! Pre-training the model will take me roughly one more week. Afterward, I will refactor the code, and get the project into a usable state, then...

Delightful TTS implementation

I fine-tuned the model on the voices of Twilight Sparkle (~6000 samples, My Little Pony) and Demoman (~500 samples, Team Fortress 2) now. There is definitely still a lot of...

I also noticed the multilingual G2P stuff and unusual phone set (phone set from [Montreal Forced Aligner](https://montreal-forced-aligner.readthedocs.io/en/latest/)) will probably make it a pain to implement this into coqui. It's probably...

Delightful TTS implementation

Just noticed coqui already has support for multiple languages, that is nice. It doesn't really matter which G2P model we use, we just need a way to extract the phoneme...

🐸Coqui <> VoiceSmith ?

@snufas Hey thanks! I work full-time in text-to-speech now and developing this repository is way out of scope at the moment, unfortunately. Maybe one day I will update and finish...