Switch to use CUDA driver APIs in `Device` constructor
~~Blocked by #459 & https://github.com/NVIDIA/cuda-python/issues/439#issuecomment-2673234572.~~
Before this PR:
In [7]: %timeit Device()
622 ns ± 1.17 ns per loop (mean ± std. dev. of 7 runs, 1,000,000 loops each)
With this PR:
In [20]: %timeit Device()
391 ns ± 1.86 ns per loop (mean ± std. dev. of 7 runs, 1,000,000 loops each)
(Bindings are built from the main branch.)
This pull request requires additional validation before any workflows can run on NVIDIA's runners.
Pull request vetters can view their responsibilities here.
Contributors can view more details about this message here.
/ok to test
Doc Preview CI :---: |
:rocket: View preview athttps://nvidia.github.io/cuda-python/pr-preview/pr-460/
|
https://nvidia.github.io/cuda-python/pr-preview/pr-460/cuda-core/
|
https://nvidia.github.io/cuda-python/pr-preview/pr-460/cuda-bindings/
|
Preview will be ready when the GitHub Pages deployment is complete.
/ok to test c9fac0b
This is ready.
/ok to test d70ec240
/ok to test d279e50
Blocked by #687.
/ok to test 708fd70f
/ok to test 4015f9c8
Thanks, Ralf/Keith!
Doc Preview CI :---: Preview removed because the pull request was closed or merged.