gtensor
gtensor copied to clipboard
GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.
adds benchmarks and improvements for sycl backend
As of now, usage of half precision is not straightforward. Not only for extension libraries as mentioned in issue #266 , but also for the generation of standard kernels. E.g.,...
Building codes that use `gt::adapt_device` can result in a lot of spurious warnings: ``` include/gtensor/gtensor_span.h(263): warning #20011-D: calling a __host__ function("gt::sarray ::sarray(const int *, unsigned long)") from a __host__ __device__...
Note that for some reason the gtensor specified AssignN kernel names no longer work with the additional template parameter on sycl ext complex type; the name is missing the void...
While much of gtensor is type independent, the extension libraries like gt-blas, gt-fft, gt-solver, and the complex helpers, have some type specific details that may need to be modified to...
I've seen this in two places now: ```cxx template inline void gtensor_span::fill(const value_type v) { if (v == T(0)) { auto data = gt::backend::raw_pointer_cast(this->data()); backend::standard::memset(data, 0, sizeof(T) * this->size()); }...
If a `stream` or `stream_view` object is const, it becomes useless, as no methods will be allowed. This came up in GENE where there is a helper class which has...
The current caching allocator has a separate cache per instance of the class, which is templated on ``. Separate per space is necessary, but per-type is not. It would be...