Aleksei Partilov

Results 6 comments of Aleksei Partilov

The completed pods are not a problem because they consume no resources, and they will be deleted during the airflow cleanup-pods cronjob execution. However, a TaskInstance can get stuck in...

> Hi @droppoint let us know what you find My team and I ran an experiment that demonstrated that even if the scheduler shuts down abnormally, the TaskInstance still completes...

We've refactored the _adopt_completed_pods function to the _delete_orphaned_completed_pods function and now it removes completed pods from failed schedulers properly. Here's a step-by-step breakdown of our test: 1. Set the number...

Hi, everyone! I think I found the root cause of the problem. Short answer: The [KubernetesExecutor._adopt_completed_pods](https://github.com/apache/airflow/blob/main/airflow/providers/cncf/kubernetes/executors/kubernetes_executor.py#L645-L676) function is not compatible with concurrently running schedulers. Long answer: I encountered an issue...

@potiuk Thank you. Should I submit a PR with the _adopt_completed_pods removal or is it better to wait for the maintainers decision on how to fix this problem?

@dirrao This is cool, but I already submitted PR #35800 almost a month ago. I think we are fixing different problems. Your PR addresses the issue when adoption is performed...