[Feature] Can we support parameter n in OpenAI compatible API?

Open Huarong opened this issue 3 months ago • 0 comments

In /v1/completions and /v1/chat/completions endpoint, can we support the parameter n?

So that we can sampling multiple outputs for the same input.

Currently, we can only call the endpoint multiple times which is not efficient.

No response

No response

Oct 24 '25 10:10 Huarong