gpu-sum-reduction
gpu-sum-reduction copied to clipboard
CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.
Results
2
gpu-sum-reduction issues
Sort by
recently updated
recently updated
newest added
Hi, I'm learning the reduction of cuda with [nvidia doc](https://developer.download.nvidia.cn/compute/cuda/1.1-Beta/x86_website/projects/reduction/doc/reduction.pdf), I unroll the last warp with device funciton `WarpReduce`, the result is correct. however, the result is wrong when i...
I have tried the `reduce0`, and it return the wrong result. How can I fix it?