Tianhao Cheng comments

Results 9 comments of


                                            Tianhao Cheng

[BUG] Bloom inference error with dtype=int8

@lekurile @jeffra @HeyangQin according to https://github.com/microsoft/DeepSpeed/issues/2876 , I tried to load the model in FP16 and then set the dtype = torch.int8 in init_inference , but it still fails :...

[BUG] Bloom inference error with dtype=int8

https://github.com/microsoft/DeepSpeed/issues/2865 mention the same problem

[BUG] [0.8.1] INT8 model loading/inference issue

@HeyangQin Load BLOOM model with FP16 checkpoint and then set dtype=int8 in init_inference not work : ( Could u please answer this issue: https://github.com/microsoft/DeepSpeed/issues/2923 , and I found some people...

[BUG] Fail to inference with 8bit quantized bloom-3b model, shape mismatch error

I meet the same problem

[Usage] RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

+1 , I met the same problem here , I finetune llava with lora and want to inference it with ``` python -m llava.serve.cli --model-path /root/code/LLaVA/checkpoints/llava-v1.5-13b-lora --image-file /root/code/LLaVA/pic.png --model-base FlagAlpha/Llama2-Chinese-7b-Chat...

Tianhao Cheng

[BUG] Bloom inference error with dtype=int8

[BUG] Bloom inference error with dtype=int8

[BUG] [0.8.1] INT8 model loading/inference issue

[BUG] Fail to inference with 8bit quantized bloom-3b model, shape mismatch error

[Usage] RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

[Usage] RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

[Usage] RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

Could your support human rating with multiple iterations?

Resume an experiment