fangpings

Results 5 comments of fangpings

I have the same problem. We have an ensemble model which has preprocessing, inference and postprocessing. I observed that in the preprocessing phase, sometimes it will generate request whose batch_size...

For ensemble pipeline, the input is a web document. In the preprocessing model we tokenize the web document, but sometimes the number of tokens in the web document will exceed...

We are facing the same issues in our models. Any more updates on this? Also for the second issue where `/dev/shm` will not be cleaned after container restarts. If you...