stdBLAS
stdBLAS copied to clipboard
Reference Implementation for stdBLAS
Implement P3222R0 ("Add `transposed` special cases for P2642 layouts"). The corresponding paper PR is https://github.com/ORNL/cpp-proposals-pub/issues/448. Add tests for previously supported cases and the new cases. Implement P3050R2 ("Optimize `linalg::conjugated` for...
See https://github.com/ORNL/cpp-proposals-pub/pull/454 (paper PR).
Putting `#include ` in a source file in Visual Studio 2022, using the VS compiler, results in a compiler error. There is no error when I switch to the LLVM...
PR #263 (merged today) required a hack to get use of `submdspan` to build correctly. The issue is that this repository currently assumes that `#include ` puts all `mdspan` components...
This is a follow-on to PR #87.
* Rename `accessor_scaled` to `scaled_accessor` * Change `scaled_accessor::reference` to the actual product value, and change `element_type` accordingly * Add full converting constructor (scaling factor and/or input `scaled_accessor`'s nested accessor could...
Make `scale` work for `std::execution::par`. This is rebased atop PR #256; please merge that first; thanks!
GCC, even 13.1.0, needs TBB in order for `` features to compile. Thus, we added a `LINALG_ENABLE_TBB` CMake option, and documented the option in README.md. We've tested the option using...
@crtrott @dalg24 @amklinv-nnl has been working on parallel specialization of stdblas algorithms. The two of us tried to specialize `matrix_vector_product` for `std::execution::parallel_policy`, but keep getting run-time recursion. We're guessing that...
@mhoemmen @crtrott Please review at your convenience and I'll do the same for the other functions.