GuideNet icon indicating copy to clipboard operation
GuideNet copied to clipboard

Question for the kernel design in multi-batch input.

Open LifeBeyondExpectations opened this issue 6 years ago • 5 comments

Thank you for your nice work. Since the code is not yet open, I write down my question for your kernel design.

In the paper, given the image feature information, you set this feature as the convolution weight. My question is when the size of the batch of the input images is larger than 1, how did you compute the convolution operation?

In pytorch, kernel shape of CNN is [out_channels, in_channels, kernel_size, kernel_size] which means that the same kernels are applied to different multi-batch tensor (e.g. img.shape = [batch, channel, height, width]). So I guess that you may split the multi-batch-tensor into single-batch-tensor for computation... or did you personally re-implement the convolution operation??..

LifeBeyondExpectations avatar Jan 14 '20 14:01 LifeBeyondExpectations

@LifeBeyondExpectations Have you released the paper successfully?

LiangliangGuo avatar Feb 27 '20 10:02 LiangliangGuo

Thank you for your nice work. Since the code is not yet open, I write down my question for your kernel design. In the paper, given the image feature information, you set this feature as the convolution weight. My question is when the size of the batch of the input images is larger than 1, how did you compute the convolution operation? In pytorch, kernel shape of CNN is [out_channels, in_channels, kernel_size, kernel_size] which means that the same kernels are applied to different multi-batch tensor (e.g. img.shape = [batch, channel, height, width]). So I guess that you may split the multi-batch-tensor into single-batch-tensor for computation... or did you personally re-implement the convolution operation??..

The author claimed that he wrote some CUDA code. I have a reimplemented version of guideconv version using conv and bmm. But I got even worse results with the guideconv fusion blocks than the naïve concatenating strategy. That's weird.

JUGGHM avatar Mar 03 '20 04:03 JUGGHM

@JUGGHM could you please share your code with me ?maybe i can provide some suggestions

dv0340 avatar May 17 '20 02:05 dv0340

@JUGGHM could you please share your code with me ?maybe i can provide some suggestions

The implemention actually refered to the CSPN naïve implemention by the author of CSPN, and it's easy to understand.

JUGGHM avatar Jun 01 '20 23:06 JUGGHM

Hi guys, I've shared my naive implementation based on CSPN in https://github.com/kakaxi314/GuideNet/issues/10, let me known if it helps, or you have any questions or suggestions.

AlanDecode avatar Jun 07 '20 10:06 AlanDecode