Qingyu Chen

Results 3 comments of Qingyu Chen

@RunningLeon 很感谢你的工作!想问下internvl-1.5的internlm2-20b 网络跟普通internlm2-20b有什么区别吗,我用了PR里的转换脚本转出来之后都是乱码。

> Which GPU? It should work with Hopper and Blackwell (B200). Ampere and Blackwell Geforce need a bit of fixing. b200

> Oh right varlen on B200 doesn't work yet (need some minor fix). You can use `flash_attn_func` KK, thanks for the reply, will stay tuned on this thread!