LipNet icon indicating copy to clipboard operation
LipNet copied to clipboard

How to do real-time inference?

Open harryxu-yscz opened this issue 7 years ago • 4 comments

Thanks for this awesome repo! I want to test Lipnet on real-time video, i.e. Given a stream of image frames from the video (at 25 fps using the provided model) and output results at real time. However, am not sure how to do it.

I thought about splitting the incoming frames into mutually exclusive chunks, and generating outputs from each chunk. However, this approach may suffer when chunks end in the middle of utterance. Any suggestion?

harryxu-yscz avatar Jul 11 '18 18:07 harryxu-yscz

me too

HamdiKaptan avatar Mar 28 '19 12:03 HamdiKaptan

me too

pinest94 avatar Apr 01 '19 07:04 pinest94

me too

purpletpig avatar Apr 30 '19 08:04 purpletpig

Thanks for this awesome repo! I want to test Lipnet on real-time video, i.e. Given a stream of image frames from the video (at 25 fps using the provided model) and output results at real time. However, am not sure how to do it.

I thought about splitting the incoming frames into mutually exclusive chunks, and generating outputs from each chunk. However, this approach may suffer when chunks end in the middle of utterance. Any suggestion?

Did you achieve ?

chahatagarwal avatar Apr 21 '20 08:04 chahatagarwal