Grad-SVC
Grad-SVC copied to clipboard
Is it necessary to trim the audio tracks in the dataset to under 30 seconds?
Thank you for the amazing open-source project. However, I am facing an issue when inference, so I have two questions.
- Is it necessary to separate the vocals and the music from each other? (I have separated using demucsv4)
- Is it necessary to trim the audio tracks in the dataset to under 30 seconds? (I have not trimmed the audio tracks)
Best regards