Mayur
Mayur
Hi @vokshin, I am experiencing the same issue as well with LRS2 where the network does not seem to start converging even when using the repo code as is.
Hi @GGaryuk , @vokshin , Thanks for the great advice - I am yet to try out the new audio processing method suggested by @GGaryuk - it's great to hear...
Hi! I been experiment with a few variants of the original implementations and, as advised by this [issue](https://github.com/Rudrabha/Wav2Lip/issues/195#issuecomment-762986057), I have verified that the model is able to converge much quicker...
Hi @vokshin , @GGaryuk , I've tried to implement the solution suggested by @GGaryuk - unfortunately, I have unable to experience any form of convergence (with LRS2 dataset) even after...
@vokshin, I think your proposed solution sounds promising. Will try it out as well. I'm not sure how applicable will it be in the case of LRS2 since to the...
I'm not sure how big of a difference this would make when preprocessing, but I noticed that the length of the extracted audio is not the same as the video....
Hi @vokshin, When I attempted to further train the provided pre-trained weights for SyncNet, the validation loss starts at 0.2581 and the training loss is at 0.2785. It seems that...
When I trained the model from scratch and if I were to train the model for 1.75 million steps, the lowest validation loss that I could achieve was 0.3 -...
I made several changes to the code structure and used several alternate libraries that are more efficient.
Hi @TejaswiniiB , I primarily used PyTorch. For audio management I used torchaudio, for image handling I used torchvision. From my experience, I don't think that the choice of software...