flinkk8soperator
flinkk8soperator copied to clipboard
Failure deployment doesn't change the state in flink operator
According to the states, if dual mode, either failure in ClusterStarting or SubmittingJob will leads to the RollingBackJob mode. However, when I tested, I noticed that
- In
ClusterStaringstate, the deployment failed (usually due to the image not exists or the sidecar is not injected properly that we are missing some packages), the flink app will stuck inClusterStartingand not really goes into theRollingBackJobstate. - In
SubmittingJob, sometime bad beam python code written and then it will also stuck inSubmittingJobstate, and I can see that it keeps trying to resubmit the job (and showing error log in flink operator) instead of changing toRollingBackJobstate as mentioned in the doc
Wondering if I am missing some configuration which leads to this issue. Thanks!