Patrick

Results 3 issues of Patrick

Is it normal to have higher latency than TGI with a low concurrency, such as 1 or 4?

使用没有指令得,原始文本微调模型