guangzlu

Results 11 comments of guangzlu

> The hipRTC equivalent APIs for nvrtcGetPTXSize and nvrtcGetPTX are hiprtcGetCodeSize and hiprtcGetCode respectively. Please use these and check if it helps. Thank you very much. But I've found that...

> @yxsamliu Any suggestions regarding this? I've added '--save-temps' to the function 'hiprtcCompileProgram', and I can get the readable isa now, thank you very much for help.

> @yxsamliu Any suggestions regarding this? Sorry, I have a other question. When I am using '--save-temps' to get the isa, it returns HIPRTC_ERROR_COMPILATION for the function 'hiprtcCompileProgram'. Do you...

> It seem to work fine when verified with latest hip using --save-temps . Please share the sample and the release with which you are trying in case you still...

@satyanveshd that worked, thank you very much!

[raylet_out_2024-05-20-16-54-06.txt](https://github.com/ray-project/ray/files/15375034/raylet_out_2024-05-20-16-54-06.txt) Here is our raylet.out log.

Update: it can run with specifying both num_cpus and num_gpus. But why it cannot work when only set num_gpus? And how do ray detect num_cpus by default?How to set the...

> If you don't set num_cpus or num_gpus, Ray will auto detect. Have you tried to not set num_gpus and see if it can detect the CPU and GPU counts?...

**Update:** we can use multiprocessing.cpu_count() to get cpu number successfully. But we cannot set num_cpus too large. We have 192 cpus on the machine, but we can only set num_cpus...

> I had the same error: > > `logging.cc:97: Unhandled exception: N5boost10wrapexceptINS_6system12system_errorEEE. what(): thread: Resource temporarily unavailable [system:11]` > > After making some changes to /etc/security/limits.conf,the error is gone `open...