cuda-python
cuda-python copied to clipboard
Allow user-provided annotation for kernels generated from `Program` and`ObjectCode`
We have examples like this today
kernel = module.get_kernel("vectorAdd<float>")
which does not tell us what args are expected on the device side, and so when launch(s, config, kernel, ...) is called we cannot do type checking for users.