Nickyang4900

Results 2 issues of Nickyang4900

Any input image is resized to [448,448] by CLIPImageProcessor. So the model input size is actually fixed, right?

Your work is very interesting and inspiring. Is it possible to share your test sequences on VTM?