Elmar Peise

Results 6 comments of Elmar Peise

`dgemm` is part of BLAS, not LAPACK. ReLAPACK must be linked with a BLAS implementation, which will provide `dgemm`. `dgemmt` is not part of BLAS but needed by a LAPACK...

A ReLAPACK-style recursive `dgemm` implementation would outperform the reference BLAS, but almost certainly not reach the performance of high-performance BLAS implementations. Such libraries are tuned for specific CPU architectures and...

Would you mind checking if #8 solves this?

No problem. I'm only maintaining this on the side, but if you find small things to change/fix, please let me know. Also, of course any pull requests are welcome :-)

Regarding GBTRF: For small bandwidth _kl_ that function translates into _O(n/kl)_ deep tail recursion (instead of the desired _O(log(n))_): https://github.com/HPAC/ReLAPACK/blob/d24495f875e5c6ee208b981b43b9b63b8bfcbba6/src/dgbtrf.c#L107 Instead of disabling the BR recursion, we might want to...

Never mind my last suggestion: Since we want to do pivoting on the whole diagonal, we need to do tail recursion. But I suppose we could express that in a...