simplyfy AttentionBlock

Open patil-suraj opened this issue 3 years ago • 1 comments

This simplifies AttentionBlock by always making q,k,v a 3D tensors like we do in CrossAttention. This way we can also leverage sliced attention and xformers attention in this block.

Nov 30 '22 15:11 patil-suraj

The documentation is not available anymore as the PR was closed or merged.

Nov 30 '22 15:11 HuggingFaceDocBuilderDev

I assume all the model slow tests are passing? Merging and will check then

Dec 01 '22 15:12 patrickvonplaten