LipNet How to do real-time inference?

Thanks for this awesome repo! I want to test Lipnet on real-time video, i.e. Given a stream of image frames from the video (at 25 fps using the provided model) and output results at real time. However, am not sure how to do it.

I thought about splitting the incoming frames into mutually exclusive chunks, and generating outputs from each chunk. However, this approach may suffer when chunks end in the middle of utterance. Any suggestion?

Jul 11 '18 18:07 harryxu-yscz

me too

Mar 28 '19 12:03 HamdiKaptan

me too

Apr 01 '19 07:04 pinest94

me too

Apr 30 '19 08:04 purpletpig

Thanks for this awesome repo! I want to test Lipnet on real-time video, i.e. Given a stream of image frames from the video (at 25 fps using the provided model) and output results at real time. However, am not sure how to do it.

I thought about splitting the incoming frames into mutually exclusive chunks, and generating outputs from each chunk. However, this approach may suffer when chunks end in the middle of utterance. Any suggestion?

Did you achieve ?

Apr 21 '20 08:04 chahatagarwal