haoranlll
Results
1
issues of
haoranlll
I want to use the mixtral 8X7B model for inference, but currently it only supports autoTP. How to add more support to enable it to use more parallelism (e.g. EP,...
enhancement