rust-extensions icon indicating copy to clipboard operation
rust-extensions copied to clipboard

Bug: pod restarted after containerd restarted when container IO is huge

Open Burning1020 opened this issue 2 years ago • 1 comments

We have found a bug that pods restarted after containerd restarted, especially on the condition that the container IO is very frequent. The parent process of the container processm that is our async containerd -shim process is not very careful to handle the pipe of container process IO. Thus when containerd restarts, some of containers, especially that has huge IO requests, would receive EPIPE and exited unexpectly.

Burning1020 avatar Nov 15 '23 06:11 Burning1020

We have to do some code optimization in handling container IO pipe in the async shim-runc crate, I can fix this lately.

Burning1020 avatar Nov 15 '23 06:11 Burning1020