accelerated-scan icon indicating copy to clipboard operation
accelerated-scan copied to clipboard

gates (A matrix) with a shape of batch * dim * dim * seqlen

Open WeihanLikk opened this issue 1 year ago • 4 comments

Thank you for your excellent work!

I was wondering if it’s possible to modify your code to handle a state-space model case where the gates (A matrix) have a more general shape of batch × dim × dim × seqlen?

WeihanLikk avatar Dec 10 '24 05:12 WeihanLikk

Thanks @WeihanLikk! How big is the dim you'd like to use?

proger avatar Dec 18 '24 03:12 proger

Thanks for your reply! In my case, the maximum dim is around 100.

WeihanLikk avatar Dec 18 '24 03:12 WeihanLikk

That sounds reasonable! I would make an entirely new kernel though, that could be simpler because the thread block layout would be different.

proger avatar Dec 22 '24 02:12 proger

Thanks!

WeihanLikk avatar Dec 22 '24 02:12 WeihanLikk