haoranlll

Results 1 issues of haoranlll

I want to use the mixtral 8X7B model for inference, but currently it only supports autoTP. How to add more support to enable it to use more parallelism (e.g. EP,...

enhancement