Divyam Goel
Divyam Goel
Hey @calpt, I'd love to work on this. While the modifications are pretty straightforward, I think that it would save inexperienced users some time if the code were to be...
Hey @calpt, I will take this up then! Will send in a PR soon 😄
Hi @aishwaryap, The metrics file can be found [here](https://drive.google.com/file/d/1lVxHFNHM3Mx0UlKAoXNNlgdEotJOFMUh/view?usp=sharing). Please let me know if you need any other information from my side! Thanks, Divyam
Cool! I'll try it out and let you know how it goes.
Hi @aishwaryap I ran inference on a pertained ET baseline model using `--num_processes 1` on a small subset of EDH instances. I also removed the `try-catch` blocks around the `_run_edh_instance`...
Hi @aishwaryap I tried running the inference on 4GPUs using `--num_processes 2` on the full data. While I did get some successful instances this time around, the process still failed...
Hi @hangjieshi A total of 68 instances were successfully processed. I checked the GPU memory usage multiple times during the process, none of the GPUs ran out of memory at...
Hi @aishwaryap Thank you for your insights. I tried using `replay_timeout = 1000` and ran into the same issue with multiple threads ending the process on account of a `TimeoutError`....