LiMa-cas comments

Results 18 comments of


                                            LiMa-cas

Error when loading model

thanks in advance，Hi i have a question about the model. So in your script I see u use the FP16 model instead of INT4 model, could qlora use the quantizationed...

Is there a plan to support model Qwen2?

hi what the transformers version for？ I use the latest which is 4.46.2,but got the problems :TypeError: LlamaRotaryEmbedding.forward() got an unexpected keyword argument 'seq_len',which maybe need a old version of...

qlora 问题

Hi，但是我用来微调的 `NCCL_P2P_DISABLE=1 NCCL_IB_DISABLE=1 CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --nproc_per_node 4 --nnodes 1 --node_rank 0 --master_addr localhost --master_port 6666 ../finetune_llama3.py --model_name_or_path "/extra_data/mali36/GAOTONG/AWQMODEL/llama3-8B-instruct-awq" --data_path "../data/Belle_sampled_qwen.json" --bf16 True --output_dir "../output/llama3_8B_instruct_awq_qlora" --num_train_epochs 100 --per_device_train_batch_size 1 --per_device_eval_batch_size 1...

qlora 问题

Hi，我把上面的--load_in_4bit 去掉了，然后又遇到新的问题： ![{08C533B8-0729-4A96-A46A-0F999377C154}](https://github.com/user-attachments/assets/3251b587-c2f8-442f-a860-a2f5daea2fc8)

question about the finetune

Thanks very much. I might not have expressed myself clearly last time. 1. why block fine-tuning is worse than global fine-tuning? 2. the inference time instead of quantization is much...

question about the finetune

Thanks a lot！

question about the finetune

Hi， when I use pv-tuning for AWQ model，firstly I need to ![{6D34503E-D6A0-4BB6-B0A3-674A14EFE1A2}](https://github.com/user-attachments/assets/3f2a4b26-59f0-42c4-8e60-7ccb42998cba) but I encountered a bug： ![{121A4318-B370-402F-A59E-FD7BC4E5845E}](https://github.com/user-attachments/assets/15c7a5d3-8e8c-4d9d-98bc-319a387d2e48) could u help me？

GPU OUT OF MEMORY

Hi，problems solved with using 80GB

GPU OUT OF MEMORY

Hi，another question，if I want to quantization for 4 bit， is the following parameters right？( --scale_nbits=4 ) but in the process, curent_avg_bits is 2.8 python main.py $MODEL_PATH $DATASET_PATH \ --nsamples=1024 \...

GPU OUT OF MEMORY

Hi，thanks a lot. There are 2 script called "finetune.py". U mean "global fine-tuning", is this the one called "finetune.py" as follows? ![image](https://github.com/user-attachments/assets/2f100a37-a132-4646-bf99-d863530ce19a) but main.py above using the "finetune.py" under src:...