cmpfeil
cmpfeil
As of now, usage of half precision is not straightforward. Not only for extension libraries as mentioned in issue #266 , but also for the generation of standard kernels. E.g.,...
## Mixed-precision extension to gtensor Provides template functions `mxp::adapt` (and `mxp::adapt_device`) inspired from and extending `gt::adapt`. An `mxp::mxp_span` (derived from `gt::gtensor_span`) is returned, enabling mixed precision computations in gtensor kernels,...
Uses 16 bit FP types from HIP headers (``, ``) when CUDA headers not available. BF16 tests pass on AMD MI300A when built with module `rocm/6.3` loaded via ``` cmake...