zer0py2c

Results 14 comments of zer0py2c

Is there any document on how to use it?

> > Is there any document on how to use it? > > This work is not ready, if you want to develop this together, follow this, > > 1....

@dkasa 请问你那部署后能支持openai接口访问Qwen3-Embedding吗?需不需要改动什么?

> Since we don't have 300I device, 300I device is not tested. But, huawei document shows that almost all operators supported on 800t are also supported on 300 inference series....

Now, I can tell everyone about the testing of the Atlas 300I Duo device: ### My environment - NPU Driver: 24.1.RC2 - CANN version: 8.0.RC2 - model used: Qwen2-7B-Instruct ###...

maybe this operator is not supported :( ![算子](https://github.com/user-attachments/assets/0beb7c31-fd2b-490f-bb80-fae89789a02a)

> We have prepared a plan on 300I Duo, could you help us test it? I'm honored! I will try and give feedback soon.

> > We have prepared a plan on 300I Duo, could you help us test it? @zer0py2c @wangyuanxiong-hub @qiling1345 > > You can try running it with https://github.com/yao-fengchen/dlinfer/tree/fix_attn and https://github.com/DeepLink-org/lmdeploy/tree/fix_attn...

@wangyuanxiong-hub 你的 NPU 驱动版本是多少?我看了下官网,我用的 24.1.RC2 版本应该跟 CANN 8.0.RC3.alpha001 不配套。 ![CANN8 0 RC3 alpha001对应驱动版本](https://github.com/user-attachments/assets/13abae29-ea45-4117-a5c5-2d6efd3276ab)