Andreas Klöckner
Andreas Klöckner
Could you explain how this relates to pycuda? Sorry if I'm being slow here. FWIW, if you're including `` in kernel code, I don't think that works. AFAIK, it needs...
I see. What do you propose? Wrapping the kernel in `extern "C"` by default was done to make symbol lookup more straightforward, and as a matter of compatibility, that's not...
I'd view that as about as complicated as managing `extern "C"` manually. What's the advantage vs. that in your view?
Thanks for this request! I would be happy to consider a PR that adds this. This should be simple to do. Within the `stream` object here, simply track whether the...
Thanks for following up! I took a look why there is no warning about the silenced exception. Turns out it's getting caught here: https://github.com/inducer/pycuda/blob/a25ed986bb6acfdc6e4e64371510a577860eec19/src/cpp/cuda.hpp#L1010 and then simply silenced here: https://github.com/inducer/pycuda/blob/a25ed986bb6acfdc6e4e64371510a577860eec19/src/cpp/cuda.hpp#L161-L171...
This issue tracker is for bugs, not technical support. Please send a message to the mailing list for tech support. Look up PCIe bandwidth to help figure out why your...
This looks great, thanks for working on this! To be merged, it'd of course need docs and tests. For lack of GPUs, I don't have usable CI for PyCUDA on...
Made an account for you, you should have that info in your email. The site is at https://gitlab.tiker.net/inducer/pycuda.
It's clear that this should happen, ideally soon. As it happens, there are now two (draft) versions of this, one here: https://gitlab.tiker.net/kaushikcfd/pycuda/-/merge_requests/2/diffs and the other one in this PR. (They...
I agree that it wouldn't be very complicated. You would need to use the [driver API functions](https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__EXEC.html#group__CUDA__EXEC). I'd be happy to review a PR.