wanghuihhh

Results 3 comments of wanghuihhh

Thank you! I try it and I find there is a large difference in using between python runtime and c++ runtime. If you could provided c++ runtime example, we will...

> Hi @wanghuihhh, would you be able to try enabling [ragged batching](https://github.com/triton-inference-server/server/blob/main/docs/user_guide/ragged_batching.md) for the input on the config? I have understand this blog. But I still don't success because the...

其实大家主要不是不满,而是一开始提及150ms,吊足了胃口,但是最后开出来的版本距离这个值差太多,并且也没有提及任何相关信息,即使这部分不打算开源,也应该说一下