torehl
torehl
Which version of ROCm? Didn't see this with 5.2.3. I used ` $ ../configure --prefix=/cm/shared/apps/ucc/1.2.0 --with-avx --with-sse42 --with-ucx=/cm/shared/apps/ucx/1.14.1 --with-cuda=/cm/shared/apps/cuda11.8/toolkit/11.8.0 --with-nccl --with-profiling --with-rocm=/cm/shared/apps/amd/rocm/5.2.3 --with-rccl`
I see this with AMD ROCM 5.7.1 and ucc 1.2.0 with config `../configure --prefix=/cm/shared/apps/ucc/1.2.0 --with-avx2 --with-sse42 --with-ucx --with-cuda=/cm/shared/apps/cuda12.3/toolkit/12.3.2 --with-nccl --with-profiling --with-valgrind --with-avx --with-rocm=/cm/shared/apps/amd/rocm/5.7.1 --with-mpi=/cm/shared/apps/openmpi4-cuda11.8-ofed5-gcc11/4.1.4 --enable-gtest` and snippet `/cm/shared/apps/amd/rocm/5.7.1/bin/hipcc -c ../../../../../../src/components/ec/rocm/kernel/ec_rocm_executor_kernel.cu...
Is there any way to get around this?
Seeing the same. But it is for all 3rdparty tools. The problem is that the 3rdparty tools are not adhering to -DCMAKE_INSTALL_PREFIX. WorkARound(WAR) is to go through each subdir and...
Installed and built it on 6.3.5 fc37. Works fine. Did you rebuilt initramfs after installing it? # dkms status r8152 r8152/2.16.3-20221209, 6.3.5-100.fc37.x86_64, x86_64: installed (original_module exists) [root@srl-torel01 src]# modinfo r8152...
Same issue on gcc-11.3.0. Thanks for workaround.
@Maxzor > I am doing an effort, alongside the dedicated packaging team, for native Debian packaging, and the general assumption that the ROCm toolkit is all under one directory -...
``` [ 10.068955] ------------[ cut here ]------------ [ 10.068955] UBSAN: array-index-out-of-bounds in /home/torel/workspace/VMware-Workstations/vmware-host-modules-works tation-17.5.1/vmmon-only/common/vmx86.c:3652:38 [ 10.068957] index 1 is out of range for type 'MSRReply [*]' [ 10.068957] CPU: 11...
> How you installed the vmware-modules? Yes. As I said, and as confirmed by @priiduneemre it works perfectly on 6.8.7, while I'm getting the above kernel oops on 6.8.8. Haven't...