Jakub Szuppe

Results 13 issues of Jakub Szuppe

https://github.com/google/benchmark/commit/e776aa0275e293707b6a0901e0e8d8a8a3679508 This commit adds `list(FILTER SOURCE_FILES EXCLUDE REGEX "benchmark_main\\.cc")` line to https://github.com/google/benchmark/blob/master/src/CMakeLists.txt. `list(FILTER` was introduced in CMake 3.6, however you script sets minimum required CMake version at `2.8.12`.

enhancement
help wanted

```cpp ... int input = 1; int init = 10; int inclusive_output, exclusive_output; using WarpScan = cub::WarpScan; __shared__ typename WarpScan::TempStorage storage[warps_no]; WarpScan(storage[warp_id]) .Scan(input, inclusive_output, exclusive_output, init, cub::Sum()); ... ``` Should...

type: enhancement
P1: should have
area: docs
area: tests

- [ ] In [documentation on boost.org](http://www.boost.org/doc/libs/1_61_0/libs/compute/doc/html/boost_compute/reference.html#boost_compute.reference.api_overview) links to APIs of core classes are not there, however, everything is fine [here](http://boostorg.github.io/compute/boost_compute/reference.html#boost_compute.reference.api_overview) on GitHub. - [ ] snippets do not work...

bug
documentation
help wanted

If you look at [enqueue_write_buffer()](https://github.com/boostorg/compute/blob/master/include/boost/compute/command_queue.hpp#L432) implementation (and other writes methods without `_async` suffix), you can think that all those operations are 100% blocking, i.e., they do not return until memory...

question
design

All over the documentation and in the tests we use fundamental C++ types like `int`, `short`, `long`. That may cause problems when their sizes do not match sizes of OpenCL...

test
task

`valarray` needs some fixing. Currently, it's possible to create a `valarray` object with buffer in some custom context that is not the context of `system::default_queue()` which is used in every...

bug
task

This is a "beta" version of distributed work infrastructure: `distributed::context`, `distributed::command_queue`, `distributed::vector`. I'm open to comments and suggestions. The idea is pretty simple: `distributed::context` is a wrapper for many OpenCL...

Hi, Tests `fill_svm_buffer`, `copy_svm_ptr` and `sum_svm_kernel` fail on my AMD Radeon R9 380. I managed to figure out that is all because of `command_queue::enqueue_svm_memcpy(..)` not working. It does not copy...

driver bug

```cpp // When functions in classes tuple_value and tuple_impl are marked with // hc and cpu attributes there are linking problems: // // /tmp/tmp.xgmMisjvuh/test_hc.cpp.host.o: In function `main': // (.text+0x7af): undefined...

We don't know why, but sometimes HIP's shuffle functions can produce incorrect results. We assume that incorrect ISA is produced. If we replace those `__shfl*` functions with HCC's `hc::__shfl*`, everything...