Victor Lomuller comments

Results 11 comments of


                                            Victor Lomuller

[Testing][SYCL] Do not decompose structs with pointers

> Maybe we can add a compiler option to avoid the wrapping i.e. if user sets the flag it promises that all pointers captured inside device code are USM pointers....

[Testing][SYCL] Do not decompose structs with pointers

> I see at least two directions to explore: > > 1. pass "packed" lambda object instead of original lambda object. "Packed" object should have only "live" data (i.e. referenced...

Application aborted when passing nullptr as a kernel argument on OpenCL backend

I guess the SYCL to OpenCL kernel lowering could be improved to handle `std::nullptr_t` which the type captured when your bool condition is false. Otherwise your bug seems to be...

Is a 2d array treated internally as 1d array at assembly level in dpcpp for NVIDIA BACKEND?

(I'm assuming you are comparing DPC++ against NVCC) I think you are just noticing a general LLVM/NVPTX vs NVCC optimization difference. In your sample, looking at the output of the...

[SYCL] Don't set PI_USM_INDIRECT_ACCESS if platform don't support it

A few points: - I'm not too sure how to create a test for that, I'm happy to try suggestions if you have any - another way to do this...

[SYCL] Don't set PI_USM_INDIRECT_ACCESS if platform don't support it

@steffenlarsen After discussion with Beni, I cut the link to the UR patch (will be caught with another bump). So if you are happy you can approve it, no risk...

[SYCL] Don't set PI_USM_INDIRECT_ACCESS if platform don't support it

@intel/llvm-gatekeepers Ready to merge (failing CI job is unrelated and common to other PRs)

[SYCL] Add a CUDA compatibilty mode

Part of the idea is to allow user to call CUDA device functions from a SYCL kernel. The underlying motivation is actually to have a mode that would support the...

cl code within C++ source code?

CUDA realies on directives such as `__device__`, `__host__` and `__global__` to drive what goes on the host or the device. OCL doesn't have such things (the specs even has some...

__builtin_printf not diagnosed but results in invalid SPIR-V

> Despite the fact that in SPIR-V, it does not and cannot work. It can https://registry.khronos.org/SPIR-V/specs/unified1/OpenCL.ExtendedInstructionSet.100.html#printf It is just improperly lowered by the translator. Note: DPCPP is also using an...