dash-infer issues

Illegal instruction (core dumped)

3

我通过 pip install 安装dashinfer后，执行推理报错

CPU specifications for Qwen2-7B models

1

I would like to know what should it be the minimum requirement specifications for running 7B models. I have this configuration and only can run 1.5B with average throughput 8.1...

LiweiPE

python/setup.py does not contains all install dependencies

When executing pip install dashinfer, torch, pandas, tabulate and so on are not installed automatically since the `install_requires` in python/setup.py are not managed correctly.

yejunjin

当prompt_token 超过模型支持的最大长度时, 程序就恢复不了, 一直返回报错信息

2

在dashinfer集成进fastchat过程中，~~当prompt token超过engine_max_length时~~ 当.generation_config.max_length < prompt token < .engine_config.engine_max_length，程序恢复不了。

yejunjin

flatten stop_words_ids in generation_config to 1 dim array

Here is an example of config file at examples/python/model_config/config_qwen_v10_7b.json ```json { "model_name": "Qwen-7B-Chat", "model_type": "Qwen_v10", "model_path": "~/dashinfer_models/", "data_type": "float32", "device_type": "CPU", "device_ids": [ 0 ], "multinode_mode": false, "engine_config": { "engine_max_length":...

yejunjin

为啥我转换的时候输出是none。我是用的4b不是4chat，和这有关系吗？

4

![image](https://github.com/modelscope/dash-infer/assets/71427899/f23cd445-651a-4dc0-9c3f-c8142b910a4b)

txl0117

dash-infer
dash-infer copied to clipboard

Metadata

x86平台前面都没报错，最后一步报错了。

Illegal instruction (core dumped)

CPU specifications for Qwen2-7B models

汇编kernel是指x86和arm都有吗

可以支持s-lora吗

有启动服务的功能吗？启动命令是什么？

python/setup.py does not contains all install dependencies

当prompt_token 超过模型支持的最大长度时, 程序就恢复不了, 一直返回报错信息

flatten stop_words_ids in generation_config to 1 dim array

为啥我转换的时候输出是none。我是用的4b不是4chat，和这有关系吗？

← Metadata

Owner

Metadata

dash-infer dash-infer copied to clipboard

Metadata

← Metadata

Owner

Metadata

dash-infer
dash-infer copied to clipboard