Add dynamic shared memory allocation

Open michael-kenzel opened this issue 2 years ago • 0 comments

This adds a parameter to allocate a given amount of dynamic shared memory upon kernel launch. Wrapper functions that just pass 0 are provided for backwards compatibility with existing code. Currently implemented for CUDA only, other platforms will error.

corresponding thorin changes: https://github.com/AnyDSL/thorin/pull/144

Aug 04 '23 21:08 michael-kenzel