Gautam Kumar

Results 53 comments of Gautam Kumar

Also to extend this, we should have `status` and `restarts` about running clusters ? ``` $ kubectl get rayclusters NAME AGE raycluster-complete 7m48s ``` it used to be ``` $...

Try this out https://github.com/gautamkmr/examples/blob/master/imagenet/DistributedTraing.md with the fix https://github.com/gautamkmr/examples/commit/4f030db0f5e53e1920690fb6a33b0b7156b53fc0

I think I have changed it a bit can you take a look again ? https://github.com/gautamkmr/examples/blob/master/imagenet/DistributedTraing.md

@curry111 Do you mean accessing cluster node from training code or in general ?

@DARK-art108 are u still facing the issue ? I would recommend to clone the manifest repo from [here](https://github.com/awslabs/kubeflow-manifests/tree/v1.3-branch/distributions/aws/examples), if you're installing any further >=1.3.x

> ons in which we pass an API key and the github username in the request to the API gateway to prevent unauthorized users fro Whats the risk here? Can...

This is important for customer and it will help them to do a single command deployment.

@hemvats take a look into this issue https://github.com/awslabs/kubeflow-manifests/issues/238 I've also provided a sample cluster. https://github.com/weaveworks/eksctl/tree/main/examples pick one which suits well with ur requirements.

@Harikantipudi Currently kubeflow OSS community doesn't offer any path of upgrade while keeping the system on (kubeflow and its component should not go down). Are you referring here upgrade in...