cmpfeil issues

Repositories
Issues
Comments

Results 3 issues of


                                            cmpfeil

half precision axpy

As of now, usage of half precision is not straightforward. Not only for extension libraries as mentioned in issue #266 , but also for the generation of standard kernels. E.g.,...

mxp: Mixed precision extension

## Mixed-precision extension to gtensor Provides template functions `mxp::adapt` (and `mxp::adapt_device`) inspired from and extending `gt::adapt`. An `mxp::mxp_span` (derived from `gt::gtensor_span`) is returned, enabling mixed precision computations in gtensor kernels,...

Enable gt half precision types for HIP

Uses 16 bit FP types from HIP headers (``, ``) when CUDA headers not available. BF16 tests pass on AMD MI300A when built with module `rocm/6.3` loaded via ``` cmake...