vits icon indicating copy to clipboard operation
vits copied to clipboard

Computing power requirement

Open AlphaMind123 opened this issue 2 years ago • 3 comments

May I ask what is the minimum machine power configuration requirement for training vits and reasoning? Unfortunately,I only have 2060, 6GB of graphics memory in my laptop.

AlphaMind123 avatar Jul 01 '23 08:07 AlphaMind123

I'm training a model in Brazilian Portuguese using RTX 3060 12gb. I'm running the training for 5 days. I can understand what the generated voice is speaking. However the generated voice does not have a good quality... yet

vidigal avatar Aug 03 '23 00:08 vidigal

I'm training a model in Brazilian Portuguese using RTX 3060 12gb. I'm running the training for 5 days. I can understand what the generated voice is speaking. However the generated voice does not have a good quality... yet

Hello, I have been training my model for 3 days, but my computer has had a power outage. I'm struggling to know how to train to continue from

2023-09-27 17:04:19,901 vietnamese_base INFO [2.2566256523132324, 2.8984131813049316, 5.876196384429932, 24.848127365112305, 1.80074548 72131348, 2.167569398880005, 32800, 0.00019621110994425385]
2023-09-27 17:04:27,901 vietnamese_base INFO ====> Epoch: 154

can you help me ? thank you !

ctimict avatar Sep 27 '23 11:09 ctimict

I'm using a similar laptop with RTX 3060 6GB. Using 1150 sentences (wavs), it took around 15 days for 10,000 epochs. It was a good enough test to know that I can get decent quality. Some words have strange pronunciations and the speaking cadence seems odd at times. Planning for another run with at least 2,000 sentences but will likely not run it on the 3060, takes too long. I bought 3 of the Tesla P40 24GB cards and the training seems much faster on those.

aaronnewsome avatar Dec 23 '23 21:12 aaronnewsome