lmdeploy icon indicating copy to clipboard operation
lmdeploy copied to clipboard

[Feature] Can we support parameter n in OpenAI compatible API?

Open Huarong opened this issue 3 months ago • 0 comments

Motivation

In /v1/completions and /v1/chat/completions endpoint, can we support the parameter n?

So that we can sampling multiple outputs for the same input.

Currently, we can only call the endpoint multiple times which is not efficient.

Related resources

No response

Additional context

No response

Huarong avatar Oct 24 '25 10:10 Huarong