Matt Wala
Results
2
comments of
Matt Wala
Here's a reduced C++ reproducer: ```c++ #include #include #include #include #include const char *SOURCE = R"RAW( #define lid(N) ((int) get_local_id(N)) #define gid(N) ((int) get_group_id(N)) __kernel void __attribute__ ((reqd_work_group_size(16, 16, 1)))...
A set of transformation/code generation benchmarks might be useful.