zhangxin81

Results 3 issues of zhangxin81

Is there any fesature related to GPT-like models that can be applied to BERT-like models?

question
triaged

Is there a benchmark to compare with Tensorrt/fastertransformer/Tensorrt-llm In 【latency, Throughput】?