chen lian
chen lian
@nvpohanh could you please look at this?
> I think TRT should have good out-of-box performance for T5 now. can you try to export it to onnx and check the throughput? @zerollzeng thanks for your answer. Dose...
Thanks you. > I think it's more convenient to export to onnx Yes, I think so. But unfortunately, I need a saved_model to deploy service. I tried > Or using...
Thanks for your suggestion. I have some questions to discuss with you. Can I get your other contact details? email, work IM or something else? It would be nice if...
> I no longer work in TF sorry, my fault.
never mind, i found solution.
The original saved_model tooks 300ms when batch_size=32 and sen_length=128, it's too long for deploy. So I wanted to speed up t5 by using tf-trt. But when I convert saved_model using...