Xin Yao issues

Results 16 issues of


                                            Xin Yao

training error about qnli

Great job. However, when I train to the 3rd epoch in the QNLI task, I encounter the following problem, but the CoLA or Squad tasks do not encounter this problem....

oomkilled ![image](https://user-images.githubusercontent.com/35353688/219367782-55fc3c31-d553-4a41-b649-95e349244907.png) 默认的脚本 `set -x export BS=${BS:-16} export MEMCAP=${MEMCAP:-0} export GPUNUM=${GPUNUM:-1} export MODLE_PATH="facebook/opt-${MODEL}" model_name_or_path=./opt6.7b \# HF_DATASETS_OFFLINE=1 TRANSFORMERS_OFFLINE=1 torchrun \ --nproc_per_node ${GPUNUM} \ --master_port 19198 \ train_gemini_opt.py \ --mem_cap ${MEMCAP} \...

bug

[BUG]: OPT30B CUDA out of memory

### 脚本： ``` set -x export BS=${BS:-1} export MEMCAP=${MEMCAP:-40} export MODEL=${MODEL:-"30b"} export GPUNUM=${GPUNUM:-8} mkdir -p ./logs export MODLE_PATH="facebook/opt-${MODEL}" torchrun \ --nproc_per_node ${GPUNUM} \ --master_port 19198 \ train_gemini_opt.py \ --mem_cap ${MEMCAP}...

bug

[Chatllama] BUG?Assertion srcIndex < srcSelectDimSize failed.

Amazing work. But may I ask how to solve the following problem? ![image](https://user-images.githubusercontent.com/35353688/223980676-1d64ae78-9e3d-4847-9733-33e543f70931.png)

convert-gptq-to-ggml:KeyError: 'model.layers.0.self_attn.q_proj.zeros'

Cool job, but I am getting the following error after running convert-gptq-to-ggml.py. ![image](https://user-images.githubusercontent.com/35353688/231916198-7b19f801-f5f5-4def-8340-55cdffb3a73b.png)

Why does my program have no output after quantization

![image](https://user-images.githubusercontent.com/35353688/231195014-46f09804-7b61-4e55-83fc-c7d73aed51b5.png) No output after `.\quantize.exe .\models\7B\ggml-model-f16.bin \models\7B\ggml-model-q4_0.bin 2` Ask for help,thanks!!!

请问有交流群吗？

如上，希望大佬拉我一下，感激不尽！！！

load_in_8bit=True but loss=0.0 when training the llama

code is: ``` model = LlamaCausalLM.from_pretrained(path, load_in_8bit=True,device_map="auto") model = prepare_model_for_int8_training(model) mdoel = get_peft_model(model, peft_config) model.print_trainable_parameters() ``` ![image](https://user-images.githubusercontent.com/35353688/233516123-a96120d7-aac5-4f57-b8ee-074a5fe189e4.png) ![image](https://user-images.githubusercontent.com/35353688/233516243-84c3681b-3126-4de0-b714-fbb718686f7c.png) platform: ``` torch1.12cu113 python3.8.16 peft=0.3.0.dev0 transformers=4.29.0.dev0 ``` And the model works well...

Xin Yao

Experimental results