FlagEmbedding icon indicating copy to clipboard operation
FlagEmbedding copied to clipboard

Deploying reranker model on triton inference server

Open zeionara opened this issue 8 months ago • 0 comments

Hey, does it make sense to deploy the reranking model in triton inference server for efficiency? Or maybe there are other recommendations concerning reranking inference optimization?

Did anybody elaborate on that?

zeionara avatar May 05 '25 16:05 zeionara