et-operator icon indicating copy to clipboard operation
et-operator copied to clipboard

Use kubexec to replace ssh when launcher attach to worker

Open xiaozhouX opened this issue 5 years ago • 0 comments

Now launcher use ssh when attach to workers, there are some problem:

  1. it needs workers open sshd
  2. controller needs to create a keyPair as secret for every job
  3. it makes controller hard to know whether worker training process is shutdown when scaleIn job

A better way is using kubectl exec to replace ssh.

xiaozhouX avatar Jan 06 '21 02:01 xiaozhouX