DeepSeek Native Sparse Attention (NSA) Kernel
🚀 The feature, motivation and pitch
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention https://arxiv.org/abs/2502.11089
Potentially useful python reference https://github.com/dhcode-cpp/NSA-pytorch
Alternatives
No response
Additional context
No response
Would like to take this up!
I'm interested in it. Is there anything I can help?
Hi, I want to join this work as well. Is there any ongoing progress? Thanks!
Any updates / active branches on this? @Tcc0403 @shivam15s
@mRSun15 @AndreSlavescu I'm not working on it. Feel free to pick it up!
@mRSun15 @AndreSlavescu I'm not working on it. Feel free to pick it up!
@mRSun15 Are you working on this / Do you want to pick this up? If not, I can pick it up once I have more cycles @Tcc0403