Michael comments

Results 34 comments of


                                            Michael

Export model to ONNX

@vince62s i want to export pytorch model to onnx model like this (MT task using transformer,with no modifications to the original code): def main(opt): dummy_parser = configargparse.ArgumentParser(description='train.py') opts.model_opts(dummy_parser) dummy_opt =...

Export model to ONNX

@vince62s you said change the way attentions flow, currently a dictionary but this does not work, requires list or tuple (https://github.com/OpenNMT/OpenNMT-py/issues/638#issuecomment-434765232), but now you said but again given some code...

Export model to ONNX

@vince62s ONNX itself is native to pytorch 1.0 as the model output format, may the output be the format of ONNX ?

May i use triton-inference-server to run T5 model

[I do as README， but i met some errors as follows: Firstly, i start the server(the model path cannot be setted like --model-repository=/workspace/build/fastertransformer_backend/all_models/t5/fastertransformer/ or --model-repository=/workspace/build/fastertransformer_backend/all_models/t5/fastertransformer/1 , because the program will...

May i use triton-inference-server to run T5 model

After i mv the model_checkpoint_path to 2-gpu ![image](https://user-images.githubusercontent.com/5691554/161374992-37f68e55-7912-4575-a74d-b3c838345cfb.png) then i run the command:mpirun -n 1 --allow-run-as-root /opt/tritonserver/bin/tritonserver --model-repository =/workspace/build/fastertransformer_backend/all_models/t5 i got this error , it sames the file name is...

May i use triton-inference-server to run T5 model

I se t the model_checkpoint_path to 2-gpu ![image](https://user-images.githubusercontent.com/5691554/161485538-068dd845-c0e6-418e-adb0-e3e1a2312032.png) then i also get the error. ![image](https://user-images.githubusercontent.com/5691554/161485473-c9eaa00b-4e39-4d1c-afeb-b67351a3aa1c.png)

May i use triton-inference-server to run T5 model

![image](https://user-images.githubusercontent.com/5691554/161553080-42e3e1ce-7dda-478c-867c-21c864faa825.png) Thanx, it worked! but another question the cuda version must be 11.4 if i want to run t5 in triton ? ![image](https://user-images.githubusercontent.com/5691554/161553434-d7588747-06c4-44c8-afd9-a0a3cf1c4207.png)

May i use triton-inference-server to run T5 model

I set the CONTAINER_VERSION=20.10 it sames the github access errors in cmake file ![image](https://user-images.githubusercontent.com/5691554/162747332-b726f78b-ed54-4a1c-bd96-a5aeae35347f.png) or ![image](https://user-images.githubusercontent.com/5691554/162752730-51cf0f47-19dc-43b5-87ee-a93b578bb65e.png) Then i change the https to http in fastertransformer_backend/CMakeLists.txt, then i get this error:...

May i use triton-inference-server to run T5 model

I can build the Dockerfile successfully with default 21.08 CONTAINER_VERSION, then i met the cuda mismatch error. So i change the the CONTAINER_VERSION to 20.10 as you said and build...

May i use triton-inference-server to run T5 model

I try 21.07, it's the same error. ![image](https://user-images.githubusercontent.com/5691554/162880874-148c8c57-9b8c-4b50-9cb9-93f17da07b62.png) And i can find the 20.10 in the https://github.com/triton-inference-server/server/branches/all?page=5 Maybe it's the network error. Should i do some config about github in...