StableLM
StableLM copied to clipboard
Is it normal to take a long time ( about 15min )to generate an answer?
I use GTX3070Ti 8G VRAM, and Ryzen 32Core.
Is it normal to take a long time ( about 15min )to generate an answer?
My params are ..
model_name = "stabilityai/stablelm-tuned-alpha-7b"
torch_dtype = "bfloat16" #@param ["float16", "bfloat16", "float"]
load_in_8bit = False #@param {type:"boolean"}
device_map = "auto"
Is there a way to speed up to generation?