zer0py2c
zer0py2c
Is there any document on how to use it?
> > Is there any document on how to use it? > > This work is not ready, if you want to develop this together, follow this, > > 1....
@dkasa 请问你那部署后能支持openai接口访问Qwen3-Embedding吗?需不需要改动什么?
> Since we don't have 300I device, 300I device is not tested. But, huawei document shows that almost all operators supported on 800t are also supported on 300 inference series....
Now, I can tell everyone about the testing of the Atlas 300I Duo device: ### My environment - NPU Driver: 24.1.RC2 - CANN version: 8.0.RC2 - model used: Qwen2-7B-Instruct ###...
maybe this operator is not supported :( 
> We have prepared a plan on 300I Duo, could you help us test it? I'm honored! I will try and give feedback soon.
> > We have prepared a plan on 300I Duo, could you help us test it? @zer0py2c @wangyuanxiong-hub @qiling1345 > > You can try running it with https://github.com/yao-fengchen/dlinfer/tree/fix_attn and https://github.com/DeepLink-org/lmdeploy/tree/fix_attn...
@wangyuanxiong-hub 你的 NPU 驱动版本是多少?我看了下官网,我用的 24.1.RC2 版本应该跟 CANN 8.0.RC3.alpha001 不配套。 