AutoPST
AutoPST copied to clipboard
Global Rhythm Style Transfer Without Text Transcriptions
Hi there, I am trying to follow the code with my own dataset and could run Main_1.py and main_2.py to get xxx-A.ckpt and xxx-B.ckpt files. Now I am not able...
Why did the speech content of the converted voice with my own trained model changed? Do you know the reason?
Hello. I have referred to your paper. Based on your experiment, I conducted experiment on accent transformation using English accent data from different countries. But the result is very unsatisfactory,...
The pretrained model sea.ckpt just fit dataset which have 82 speaker, However, I have a huge dataset including 300 speaker at least. How could I train a corresponding SAE model?
 Hi, I got an error like this, when run prepare_train_data.py Is spk2emb has vctk16-train-wav key? vctk16-train-wav Traceback (most recent call last): File "prepare_train_data.py", line 52, in submeta.append(spk2emb[subdir]) KeyError: 'vctk16-train-wav'
Hi, I run into an error about onmt --------------------------------------------------------------------------- ModuleNotFoundError Traceback (most recent call last) in 5 import torch.nn.functional as F 6 from collections import OrderedDict ----> 7 from onmt.utils.misc...
Hi and thank you for this amazing project! I was trying to create a notebook in colab that would allow me to input an audio file, then select the speaker...
Sorry, I’m not familiar with English grammar, please forgive me if I offend. I want to try to execute this Github project, but failed. The only changed part of the...
The mean and std I created are different from the values in mfcc_stats.pkl you provided. Can you please check if I am doing something wrong? I attached a simple code...
How to make 'mfcc_stats.pkl' and 'spk2emb_82.pkl'? I want to test another dataset. Thanks.