RustaCUDA icon indicating copy to clipboard operation
RustaCUDA copied to clipboard

Add cooperative kernel launch API

Open LutzCle opened this issue 4 years ago • 0 comments

Pascal and newer devices support cooperative groups. These groups enable kernels to, e.g., globally synchronize the grid without terminating the kernel.

A special function cuLaunchCooperativeKernel was added to CUDA 9.0 to launch a cooperative group. This PR adds this launch API to RustaCUDA.

Example:

unsafe { launch_cooperative!( kernel<<<grid, block>>>() ).unwrap() };

See the CUDA documentation and this Nvidia Developer blog post for more information.

LutzCle avatar Oct 15 '21 20:10 LutzCle