Grad-SVC icon indicating copy to clipboard operation
Grad-SVC copied to clipboard

Is it necessary to trim the audio tracks in the dataset to under 30 seconds?

Open ducchung2444 opened this issue 1 year ago • 0 comments

Thank you for the amazing open-source project. However, I am facing an issue when inference, so I have two questions.

  1. Is it necessary to separate the vocals and the music from each other? (I have separated using demucsv4)
  2. Is it necessary to trim the audio tracks in the dataset to under 30 seconds? (I have not trimmed the audio tracks)

Best regards

ducchung2444 avatar Nov 22 '24 07:11 ducchung2444