Is it normal to take a long time ( about 15min )to generate an answer?

Open nicezic opened this issue 2 years ago • 0 comments

I use GTX3070Ti 8G VRAM, and Ryzen 32Core.

My params are ..

model_name = "stabilityai/stablelm-tuned-alpha-7b" 
torch_dtype = "bfloat16" #@param ["float16", "bfloat16", "float"]
load_in_8bit = False #@param {type:"boolean"}
device_map = "auto"

Is there a way to speed up to generation?

May 17 '23 03:05 nicezic