The input is face image or lip image?

Open ltl7155 opened this issue 4 years ago • 1 comments

In this paper, it's said that the input is lip image. But in this repo and the example.avi, the whole faces are kept and processed without cropping face part. In your Keras version, you only use lip. So for this pretrained model syncnet_v2.model, what kind of input image should we use?

Nov 08 '21 06:11 ltl7155

#9 It seems to be full face, can't understand why it is inconsistent with the paper, but there isn't any explanation.

Jan 10 '22 02:01 xiao-keeplearning