zzwei1 issues

Results 8 issues of


                                            zzwei1

Why the input shape is (NT,C,H,W)?

Hi, I wonna use TSM on my own dataset, which is a video-like input(each gesture have 32 frames,so my input shape is (N,C,T,H,W)). But when I use a 2D conv...

What should I do if I want to extend your deformable-kernels to a 3D version?

Nice work ! I wonna know if it's possible to extend this deformable kernel to a 3D version to inpletement it to action recgnition? And what should I do? Thanks...

when could code of your paper "Fourier Space Losses for Efficient Perceptual Image Super-Resolution" be available?

Hi, nice work! I'm interesting about your another work "Fourier Space Losses for Efficient Perceptual Image Super-Resolution". I wonna learn your code, however, I cannot jump to a correct url...

About context-gated-convolution for action recgnition

Nice work! I wonna use context-gated-convolution in action recgnition. I noticed that you use torch.nn.Unfold in layer.py, and it needs a 4-D input(batch x channel x height x width). But...

What's the difference between DeformConv_d and DeformConvPack_d?

Thanks for the repo. I don't understand the difference between DeformConv_d and DeformConvPack_d. I found the main difference in the source code (deform_conv.py) is that, in DeformConv_d, offset = temp.clone().resize_(b,...

How to visualize the learned offsets?

Hi,nice work! I'm confusing about visualizing the learned offsets as shown in Fig. 6? Could you give some reference code?

Is the code related to Earthformer AR available?

Nice work ! I notice that you pretrain a VQ-VAE to compress the image sequence to a discrete latent space, and explore an auto-regressive decoder named Earthformer-AR. I'm interesting in...

enhancement

Confusion aboout the "many to one" training style

Nice work! However, I'm a bit confused about the "many to one " training style. Does it mean that, in such a video reconstruction network, you have N reconstructions that...