open-gpu-kernel-modules icon indicating copy to clipboard operation
open-gpu-kernel-modules copied to clipboard

CUDA fails with CUDA call "cu->cuInit(0)" failed with CUDA_ERROR_UNKNOWN (999): unknown error

Open virtualfuzz opened this issue 1 year ago • 1 comments

NVIDIA Open GPU Kernel Modules Version

560.28.03

Please confirm this issue does not happen with the proprietary driver (of the same version). This issue tracker is only for bugs specific to the open kernel driver.

  • [X] I confirm that this does not happen with the proprietary driver package.

Operating System and Version

NixOS 24.11 (Vicuna)

Kernel Release

Linux laptop 6.6.42-xanmod1 #1-NixOS SMP PREEMPT_DYNAMIC Tue Jan 1 00:00:00 UTC 1980 x86_64 GNU/Linux

Please confirm you are running a stable release kernel (e.g. not a -rc). We do not accept bug reports for unreleased kernels.

  • [X] I am running on a stable kernel release.

Hardware: GPU

NVIDIA GeForce RTX 3050 Ti Laptop GPU (UUID: GPU-357aa0ab-a71e-7d0a-b7bd-4d2a99d4e383)

Describe the bug

When I try to use anything that uses CUDA (trying to record with OBS, re-encoding a video, running hashcat), it fails with CUDA call "cu->cuInit(0)" failed with CUDA_ERROR_UNKNOWN (999): unknown error.

To Reproduce

  • Just run a program that uses CUDA

For exemple:

  • Try to record something using OBS.

Bug Incidence

Always

nvidia-bug-report.log.gz

nvidia-bug-report.log.gz

More Info

This bug disappears after running nvidia-bug-report.sh for some reason, CUDA works again after running nvidia-bug-report.sh.

Edit: After more research I found out that running nvidia-settings as root somehow fixes the problem. Edit 2: Loading the nvidia-uvm module though modprobe nvidia-uvm fixes this issue

virtualfuzz avatar Aug 02 '24 16:08 virtualfuzz

I got this running OBS from Flatpak on Ubuntu@latest.

Johnsoct avatar Nov 22 '24 20:11 Johnsoct