[QST] in implicit gemm conv, why does not support split-k when group !=1 ?
the file is include/cutlass/conv/device/implicit_gemm_convolution.h
in implicit gemm conv, why does not support split-k when group !=1 ?
is this for performance ?
@Junkai-Wu
For group fprop/dgrad, usually C/K per group is small, e.g. < 64, and RS dimension of filter is not large, e.g. 3x3, so there is no need to do split k. In other words, group is already used to split the C/K dimension in fprop/dgrad, there is no need to use split_k to split it again.
thank you. does wgrad support splitk, which seems necessary in scenario like depthwise conv ?
This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.
This issue has been labeled inactive-90d due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.