Chenran Li comments

Results 9 comments of


                                            Chenran Li

Support queuing in the QueueProxy while using randomChoice2Policy in the activator

Thanks @nader-ziada and @vagababov ! The use case is that our system has optimal performance with the randomChoice2Policy (this is also why we want to keep activator in the request...

Support queuing in the QueueProxy while using randomChoice2Policy in the activator

Thanks @nader-ziada! Yes, the lb policy is based on the hard limit. Do you think either option 1 or 2 above makes sense? We'll start to work on the PR...

Support queuing in the QueueProxy while using randomChoice2Policy in the activator

> We currently have queueDepth := 10 * env.ContainerConcurrency, and container concurrency can be set on a [per revision basis](https://knative.dev/docs/serving/autoscaling/concurrency/#hard-limit). So what you're proposing would be make the 10 configurable...

Support queuing in the QueueProxy while using randomChoice2Policy in the activator

> So to put it slightly differently, you're seeing better performance with the activator as load balancer (using randomChoice2Policy) than a standard Kubernetes service (which is what handles the routing...

Support queuing in the QueueProxy while using randomChoice2Policy in the activator

@dprotaso could you please take a look at __Feature Request 4__ here? If it makes sense, I'll go ahead and implement it first.

Support queuing in the QueueProxy while using randomChoice2Policy in the activator

@nader-ziada do you think capping the queue size at `100 * env.ContainerConcurrency` makes sense? So the proposed change is, at [this line](https://sourcegraph.com/github.com/knative/serving@f4ea3ac779621ea133a78a746525f6c6ca9947de/-/blob/cmd/queue/main.go?L326), set the queueDepth to `X` * env.ContainerConcurrency: *...