DeepSpeed-MII
DeepSpeed-MII copied to clipboard
Can MII support quanted Llama2 of AWQ?