Andreas Klöckner comments

Results 957 comments of


                                            Andreas Klöckner

Is there a way to share context among threads, if not why?

This also came up recently in https://github.com/inducer/pycuda/issues/305#issuecomment-887761332. PyCUDA currently assumes that each context can only be active in a single thread. It appears that this was true up until CUDA...

Is there a way to share context among threads, if not why?

How come this got closed? The question you raised is a real concern to my mind, and I wouldn't be opposed to the issue staying open.

Is there a way to share context among threads, if not why?

PyCUDA isn't doing anything special with memcpy. It just calls the corresponding CUDA function. For an additional speedboost, you can use "page-locked" memory (on the host side).

[codegen, bug]: Callee kernel name generation is incorrect

Yep, agree the the name should be decided once and then not messed with.

Type inference failure

Thanks for reporting this! #483 fixes the unassigned variable (which is definitely a bug). ed5d1458abb07f7d30de4854b1e4f427480e52df was an attempt to make type inference succeed, but given that `insn1` is self-referential, I...

Type inference failure

That seems similar to ed5d1458abb07f7d30de4854b1e4f427480e52df in its approach, however I'm not sure either approach necessarily yields a consistent treatment of self-referential assignments. Just skipping them basically ignores type constraints that...

Andreas Klöckner

Is there a way to share context among threads, if not why?

Is there a way to share context among threads, if not why?

Is there a way to share context among threads, if not why?

[codegen, bug]: Callee kernel name generation is incorrect

Type inference failure

Type inference failure

Shipped Boost.Python is incompatible with Python 3.11

Shipped Boost.Python is incompatible with Python 3.11

Shipped Boost.Python is incompatible with Python 3.11

Shipped Boost.Python is incompatible with Python 3.11