Mike Bauer

Results 22 comments of Mike Bauer

Thanks @martin-frbg for the information. Unfortunately I really do need these calls to run concurrently and with OpenMP for performance reasons so serializing them inside OpenBLAS is not a reasonable...

@brada4 Read my full post again more carefully. I'm not using one of the normal implementations of the OpenMP runtime. I have a special one that supports concurrent instances of...

@martin-frbg I see calls into OpenMP being serialized as you describe and the results are deterministic. Performance is 2X less than it should be, which for this particular workloads is...

@TiborGY using OpenMP is a requirement for me for performance reasons. @brada4 You statement that "the library you point to just permits to next KMP with OMP" is incorrect. That...

I think it would still be good to document it somewhere in the readme just like is done with USE_OPENMP and DEBUG options. @brada4 I don't want it to be...

Yes, unfortunately the runtime under Legion (called Realm) still needs you to tell it how to configure the machine. In this case you'll want to use the `-ll:ocpu ` command...

I suspect none of the tests in the test suite are actually big enough to exercise the bug. Legate's default partitioning heuristics won't over-partition the input arrays to span multiple...

> I'll just try looping the test for now until something breaks and/or running on a really big machine (provided I can get it to run on arm64 or power9...

Where in the wiki can I find the change?

To be clear, that documentation doesn't actually cover the scenario described in this issue (somebody renamed this issue incorrectly). The issue being discussed here is NOT multiple threads calling into...