databend
databend copied to clipboard
Feature: databend-meta: add backoff for send_append_entries, send_install_snapshot, send_vote RPC if there is error occurs
If a node encounters StorageError, the raft node stays in a state where it receives a request but can not handle it. In this case, the leader will keep sending replication data(logs or snapshot), and consumes all bandwidth.
To avoid this, the leader should sleep for a while if a previous RPC fails. The sleep time should increase if more errors occur. And there should be a max sleep interval.