Nickyang4900
Results
2
issues of
Nickyang4900
Any input image is resized to [448,448] by CLIPImageProcessor. So the model input size is actually fixed, right?
Your work is very interesting and inspiring. Is it possible to share your test sequences on VTM?