BitNet icon indicating copy to clipboard operation
BitNet copied to clipboard

why arm don't support tl2 kernel

Open nigelzzz opened this issue 9 months ago • 3 comments

Hi , can i know why arm don't support tl2 kernel? i guess the simd instruction no support?
Image

nigelzzz avatar Apr 24 '25 03:04 nigelzzz

because TL2 optimized for model file size, to reduce memory io, while on ARM cpu the memory io is not a bottleneck for such 2B models.

sd983527 avatar Apr 24 '25 04:04 sd983527

Thanks for your response!! So using I2_s and TL1 can decrease lantency, TL2 can't improve it? if the simd lane can be 32 or 64, is it helpful?

nigelzzz avatar Apr 24 '25 08:04 nigelzzz

TL2 is faster than the others for larger b1.58 models (e.g. 70B or 100B) as from our experiment, however currently we do not have a checkpoint with that size.

sd983527 avatar May 07 '25 09:05 sd983527