iibw comments

Results 23 comments of


                                            iibw

Error during inference with Mixtral 7bx8 GPTQ

I tried to load a GPTQ version of Mixtral 8x7b and got an error, but a different one than posted here. I got: ``` config.py gptq quantization is not fully...

Error during inference with Mixtral 7bx8 GPTQ

@casper-hansen > You need to use float16 or half for quantization. I switched it to torch.float16 in the config.json and my error changed to the one in https://github.com/vllm-project/vllm/issues/2251

Error during inference with Mixtral 7bx8 GPTQ

I'll try doing that now

Error during inference with Mixtral 7bx8 GPTQ

Yep! It seems like the latest vLLM has fixed this bug. Both GPTQ and AWQ are working for me now. Thanks for the help :)

Feature request：support ExLlama

ExLlamaV2 has taken over ExLlama in quantization performance for most cases. I hope we can get it implemented in vLLM because it is also an incredible quantization technique. Benchmarks between...

Load Mixtral 8x7b AWQ model failed

I'm also having this issue after a fresh quantization of Mixtral 8x7b instruct. There is no issue when running directly with AutoAWQ across multiple GPUs. Only when using vLLM across...

Load Mixtral 8x7b AWQ model failed

I was able to get both working tp=4 with GPTQ and AWQ. It took a long time to load the model in my case, but eventually, it loaded and then...

Load Mixtral 8x7b AWQ model failed

I used my own AWQ quantization. Try quantizing it yourself and maybe that will fix the problem.

download.sh closes without downloading anything.

First 7 consolidated.pth of 70B-chat downloaded perfectly. 8th failed with 403. Can't download any models now. This was the first download with my URL. Requesting a second download URL. Let's...

datasets.utils.info_utils.ExpectedMoreSplits: {'validation'}

This error seems to have happened because c4 was updated with some `datasets` configuration options which aren't supported in older versions of `datasets`. To fix, upgrade `datasets` with `pip install...