Wang Yubo
Results
2
comments of
Wang Yubo
From my experiences, it seems the following combination works well - Build engine using trtllm 0.6.1 - Triton Server 23.11-trtllm-python-py3 to serve the engine
> Just to report back, `compute_transition_scores` is exactly what I needed. Thanks for the suggestion. One thing worth mentioning is that, when locating logprobs per token (rather than strings that...