LIUKAI0815 issues

Results 19 issues of


                                            LIUKAI0815

KeyError: 'model.layers.0.self_attn.q_proj.qweight'

python3 convert_checkpoint.py --model_dir /workspace/lk/model/Qwen/14B --output_dir ./tllm_checkpoint_1gpu_gptq --dtype float16 --use_weight_only --weight_only_precision int4_gptq --per_group [TensorRT-LLM] TensorRT-LLM version: 0.10.0.dev2024042300 0.10.0.dev2024042300 Loading checkpoint shards: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:02

triaged

NotImplementedError: Cannot copy out of meta tensor; no data

python convert_checkpoint.py --model_dir /workspace/lk/model/Qwen/14B/ --output_dir ./tllm_checkpoint_1gpu_fp16_wq --dtype float16 --use_weight_only --weight_only_precision int8 [TensorRT-LLM] TensorRT-LLM version: 0.10.0.dev2024042300 0.10.0.dev2024042300 Loading checkpoint shards: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:02

triaged

Segmentation fault (core dumped)

[TensorRT-LLM] TensorRT-LLM version: 0.10.0.dev2024050700 [TensorRT-LLM][INFO] Engine version 0.10.0.dev2024050700 found in the config file, assuming engine(s) built by new builder API. [TensorRT-LLM][WARNING] [json.exception.out_of_range.403] key 'cross_attention' not found [TensorRT-LLM][WARNING] Optional value for...

triaged

neeed more info

LIUKAI0815

KeyError: 'model.layers.0.self_attn.q_proj.qweight'

NotImplementedError: Cannot copy out of meta tensor; no data

Segmentation fault (core dumped)

swift export 指定 --tensor_parallel_size --gpu_memory_utilization 感觉不管用

有没有零样本训练的技术细节，要怎么在自己的数据集训练

swift可以训练量化之后的模型吗，比如modelscope里面的awq或者gptq量化之后的模型

不支持--model_type gemma2-2b-instruct

使用这个 sh scripts/run_assistant_server.sh 部署模型之后，会不会比VLLM速度慢很多

BAAI/bge-multilingual-gemma2，请问可以将这个模型做int8或者int4量化之后用吗？

请问embedding模型和rerank模型怎么finetune，用的FlagEmbedding的吗？