Tony Allen comments

Results 49 comments of


                                            Tony Allen

Failed to load kernel.

Are you running it from the Sia-GPU-Miner directory?

Peak EWMA load balancing

I'm all for introducing a more sophisticated load balancing technique, but we should be really understand _why_ what we have today does not address your specific use-case. Before getting into...

Peak EWMA load balancing

> The reason is that when the source pod chooses a destination pod, the active_request queues are mostly empty or at a size of 1/2, so the algorithm works as...

This is great. Thanks for the extra information. So, the `cx_active` stats are showing us the number of active **_connections_**. Envoy's LEAST_REQUEST load balancer normalizes endpoint weights by the number...

Peak EWMA load balancing

Thanks for the extra graphs, that helps in seeing the differences between backends. > We think that by applying different weights based on historical response time, the spread of the...

Peak EWMA load balancing

> I think that success latency + error rate can provide a good measurement for weight. I'm not opposed to the idea, we just need to be more specific on...

Peak EWMA load balancing

@jizhuozhi can you just briefly describe how all of this works here? I don't know how mosn works and would appreciate it if you could clearly describe how this new...

Peak EWMA load balancing

> First, the LEAST_REQUEST load balancer (LRLB) calculates the expected completion time of all tasks based on the number of active requests. It assumes that all servers have the same...

Peak EWMA load balancing

> What I need to clarify is that I understand how Least Request is implemented in envoy, but what I want to express is that usually Least Request is abstracted...

Peak EWMA load balancing

That's some convincing data! I'll need to do a more in-depth read, but from my quick pass the data in section 5.2 makes a good case for implementing the **Prequal**...