databend icon indicating copy to clipboard operation
databend copied to clipboard

Feature: databend-meta: add backoff for send_append_entries, send_install_snapshot, send_vote RPC if there is error occurs

Open drmingdrmer opened this issue 3 years ago • 0 comments

If a node encounters StorageError, the raft node stays in a state where it receives a request but can not handle it. In this case, the leader will keep sending replication data(logs or snapshot), and consumes all bandwidth.

To avoid this, the leader should sleep for a while if a previous RPC fails. The sleep time should increase if more errors occur. And there should be a max sleep interval.

drmingdrmer avatar Jul 30 '22 03:07 drmingdrmer