iseg.code
iseg.code copied to clipboard
Code details
Thanks for your elegant work !
I have a tiny question. it seems that you choose to filter different heads of attn_maps via a pre-difined index order.
Is it a carefully-tuned trick? or could you kindly explain why the order is specifically choosed ?