nigelzzz

Results 4 comments of nigelzzz

Hi @QingtaoLi1 , can we use `test-backend-ops` to test flops, i would like to show the result from paper image 6, image 7. thanks and i checked the tmac repo,...

Can I known if we want to use llama3.2 1b q4 , do we need to recompile kernel?

Thanks for your response!! So using `I2_s` and `TL1` can decrease lantency, `TL2` can't improve it? if the simd lane can be 32 or 64, is it helpful?

Hi I have same issue on rpi5. `prompt eval rate: 27.45 tokens/s vs 2.27 tokens per second)` ollama.cpp total duration: 57.016716541s load duration: 35.149005ms prompt eval count: 948 token(s) prompt...