Xavier Pillons
Xavier Pillons
> Good point that I meant to bring up with the PR, is there a preferred way to add tests? Not really. We trust your tests
I've seen this too and I don't know how to fix that. It's good to merge for me now.
@garvct what do you think about this fix ?
@edwardsp can you please have a look ?
Hi @imangohari1 I would suggest you to have a look at our new solution https://github.com/Azure/az-hop which will allow you to do 1. 4 is always done on these machines if...
SLURM is on our roadmap for az-hop, but no ETA defined yet. We will be happy to follow up once added.
Manually reran the pipeline. Gen2 passed. Gen1 failed with error Resource : gpumaster - OSProvisioningTimedOut Message : OS Provisioning for VM 'gpumaster' did not finish in the allotted time. The...
@edwardsp can you have a look to check why the prsync is failing ? I can see in the code that ssh is tested upfront, but I'm not 100% sure...
@lmiroslaw I've created issue #283 to track it and I'm working on it now.
Thanks for the suggestion. It is not only the CUDA option, but creating a build machine and later a GPU cluster with the right configuration. So doable, but it will...