Video-Super-Resolution-Library icon indicating copy to clipboard operation
Video-Super-Resolution-Library copied to clipboard

[REQ] Better VIDEO super resolution models

Open MarcoRavich opened this issue 11 months ago • 2 comments

Hi there, just discovered this interesting HW-optimized library for IMAGE super resolution.

As well explained in this demo video for VideoGigaGAN (paper here) by University of Maryland/Adobe Research, there are multiple factors to consider for "correct" VIDEO super resolution inferencing:

https://github.com/videogigagan/videogigagan.github.io/raw/refs/heads/main/assets/videos/demo.mp4

Anyway, since VideoGigaGAN sources aren't available, I recommend you to "port" alternative (more contemporary, if possible) VIDEO processing-oriented algos for better/consistent results... ...here are some interesting projects - you can already exploit - to carefully evaluate:

  • StableVSR: Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models (paper);
  • FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring (paper)
  • EvTexture: Event-driven Texture Enhancement for Video Super-Resolution (paper)
  • MIA-VSR: Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention (paper)
  • STVSR-NO: Space-Time Video Super-resolution with Neural Operator (paper);
  • STINet: Enhancing Space-time Video Super-resolution via Spatial-temporal Feature Interaction (paper);
  • GIRNet: Global Spatial-Temporal Information-based Residual ConvLSTM for Video Space-Time Super-Resolution (paper); ...

Resources collections:

Last but not least, I strongly suggest you to enable the Discussion section for this repository in order to let 3rd-party devs/users to exchange ideas & knowledge to push the evolution.

Hope that inspires.

MarcoRavich avatar Feb 20 '25 10:02 MarcoRavich

Thank you @MarcoRavich

To enable DL VSR models, we also have the library and FFmpeg plugin: https://github.com/OpenVisualCloud/iVSR, it's based on OpenVINO. For example, it supports the Enhanced BasicVSR model which was optimized based on the public BasicVSR model.

djie1 avatar Feb 28 '25 09:02 djie1

Just discovered @ArcticHare105's S3Diff (Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors paper implementation) that looks VERY promising too (for image upscale/enhance)...

Hope that inspires !

MarcoRavich avatar Mar 29 '25 20:03 MarcoRavich