oneDPL
oneDPL copied to clipboard
Remove local (in-group) atomic usage from `__parallel_find_or`
In this PR we remove local (in-group) atomic usage from __parallel_find_or implementation:
-
__found_localnow is local variable (for each item); - barriers doesn't required anymore;
- we updates the state of
__found_localwithin one group through__dpl_sycl::__reduce_over_groupoperation :
__found_local = __dpl_sycl::__reduce_over_group(__item_id.get_group(), __found_local,
typename _BrickTag::FoundLocalReduceOp{});
where FoundLocalReduceOp is :
-
__dpl_sycl::__minimum(for the__parallel_find_forward_tag); -
__dpl_sycl::__maximum(for the__parallel_find_backward_tag); -
__dpl_sycl::__plus(for the__parallel_or_tag).
FPGA emulation CI seems to time out. @SergeyKopienko do you know why?
FPGA emulation CI seems to time out. @SergeyKopienko do you know why?
Will check.
FPGA emulation CI seems to time out. @SergeyKopienko do you know why?
- fixed.
Let's continue the work in https://github.com/oneapi-src/oneDPL/pull/1668