cuda-python icon indicating copy to clipboard operation
cuda-python copied to clipboard

[BUG]: Randomly recurring test_cufile.py::test_get_stats_l3 Segmentation faults

Open rwgk opened this issue 1 month ago • 1 comments

Is this a duplicate?

  • [x] I confirmed there appear to be no duplicate issues for this bug and that I agree to the Code of Conduct

Type of Bug

Runtime Error

Component

cuda.bindings

Describe the bug

In routine testing on a bare-metal (NOT WSL) Ubuntu 24.04 linux-64 workstation I'm seeing randomly recurring test_cufile.py::test_get_stats_l3 Segmentation faults, e.g.:

smc120-0004.ipp2a2.colossus.nvidia.com:/wrk/logs $ grep -a Segmentation *
qa_bindings_linux_2025-12-04+171342_tests_log.txt:Fatal Python error: Segmentation fault
qa_bindings_linux_2025-12-04+171342_tests_log.txt:../ctk-next/qa/13.1.0/qa_bindings_linux_tests.sh: line 60:  5457 Segmentation fault      (core dumped) python -m pytest -ra -s -vv tests/
qa_bindings_linux_2025-12-05+214218_tests_log.txt:Fatal Python error: Segmentation fault
qa_bindings_linux_2025-12-05+214218_tests_log.txt:../ctk-next/qa/13.1.0/qa_bindings_linux_tests.sh: line 61: 48514 Segmentation fault      (core dumped) CUDA_PYTHON_CUDA_PER_THREAD_DEFAULT_STREAM=1 python -m pytest -ra -s -vv tests/
qa_bindings_linux_2025-12-06+224850_tests_log.txt:Fatal Python error: Segmentation fault
qa_bindings_linux_2025-12-06+224850_tests_log.txt:../ctk-next/qa/13.1.0/qa_bindings_linux_tests.sh: line 60: 84340 Segmentation fault      (core dumped) python -m pytest -ra -s -vv tests/

I'm attaching one of the log files. Please see there for details.

qa_bindings_linux_2025-12-06+224850_tests_log.txt

How to Reproduce

See commands in attached log file. Essentially:

cd cuda_bindings/
pip install ...
pytest -ra -s -v tests/

rwgk avatar Dec 07 '25 18:12 rwgk