tl.dot updated https://github.com/openai/triton/issues/1098
I got this error when trying to run stable_diffusion inference with deepspeed TypeError: dot() got an unexpected keyword argument 'trans_b'
Issue found on triton with the same error : TypeError: dot() got an unexpected keyword argument 'trans_b'
solution suggested was to use qk += tl.dot(q, tl.trans(k)) , inplace of qk += tl.dot(q, k, trans_b=True)
@bmedishe have you tested this with a stable diffusion model? It causes DeepSpeed to hang for me.
@bmedishe have you tested this with a stable diffusion model? It causes DeepSpeed to hang for me.
@mrwyattii Hi , I see a hang on AMD GPU too, what version of triton should I be using to have working
qk += tl.dot(q, k, trans_b=True)
After this PR Merge triton-mlir branch - Complete rewrite of the backend from scratch qk += tl.dot(q, k, trans_b=True) throws TypeError
I am facing the same issue, any updates?
Same here. This fix hangs on Nvidia RTX 3090.
@mrwyattii Hi , I see a hang on AMD GPU too, what version of triton should I be using to have working
qk += tl.dot(q, k, trans_b=True)
https://github.com/microsoft/DeepSpeed/blob/d92539509b1e9a6178cfdb921d5080e76f690bce/requirements/requirements-sd.txt#L2