et-operator
et-operator copied to clipboard
Use kubexec to replace ssh when launcher attach to worker
Now launcher use ssh when attach to workers, there are some problem:
- it needs workers open sshd
- controller needs to create a keyPair as secret for every job
- it makes controller hard to know whether worker training process is shutdown when scaleIn job
A better way is using kubectl exec to replace ssh.