BasicSR icon indicating copy to clipboard operation
BasicSR copied to clipboard

About reproduce the result in 'Understanding Deformable Alignment in Video Super-Resolution'

Open EricMingze opened this issue 3 years ago • 1 comments

Hi, thanks for your great work!

I'm trying to visualize the result presented in paper 'Understanding Deformable Alignment in Video Super-Resolution'. According to the paper, I expect the offset will be similar to optical flow. However, I visualized the offset in Layer 1, the number is much smaller than optical flow.

Here is what I did: I used the pretrained EDVR model 'EDVR_L_x4_SR_Vimeo90K_official-162b54e4.pth' and visualized the L1 offset right before the cascading part. The offset tensor has shape (1, 128, h, w) and I reshaped it to (1, 2, 8=group_number, 8=kernal_h*kernal_w, h, w), and randomly pick one (2, h, w) to visualize.

By my observation, I didn't see a big difference between L1 offset and the cascading offset mentioned in https://github.com/XPixelGroup/BasicSR/issues/404.

Can you help me with that? thanks

EricMingze avatar Jul 21 '22 21:07 EricMingze

  1. The offsets should be re-grouped. You can contact Kelvin for the corresponding scripts.
  2. There are several dcns in EDVR. After warping several times by previous dcns, the offsets are supposed to be small in the cascading part. You can train a new network with one dcn and then visualize its offsets.

xinntao avatar Jul 26 '22 11:07 xinntao