temporal
temporal copied to clipboard
Reduce aggressive retries in case of service busy error
Expected Behavior
When there's resource exhausted error, the retry between client/server and within history service is too aggressive. This amplifies the load on the server and often makes the situation worse.
Actual Behavior
We should slow down and/or stop retrying resource exhausted errors, so the system can stabilize by itself.
Steps to Reproduce the Problem
Send consistent load to server, then reduce history persistence max QPS, and then observer the behavior.
Specifications
- Version:
- Platform:
https://github.com/temporalio/temporal/pull/3069