cutlass
cutlass copied to clipboard
CUDA Templates for Linear Algebra Subroutines
Results
1
cutlass issues
Sort by
recently updated
recently updated
newest added
Sometimes when the tensor format changes after this conv (e.g., NCHW -> NHWC for layer normalization), calling backward will raise an "input must be contiguous" error. Making the grad contiguous...