syncnet_python
syncnet_python copied to clipboard
Model taking lip video as input
Hi! Thank you for your excellent work in the paper! As is said in you paper, your model takes lip video as input, while this repo however only provides a model taking face video as input. Could you please provide a pre-trained model which takes the lip video as input? It will be very helpful for me!
Or could you please make it clearer how I can train the model by myself with lip videos as input?