guihonghao comments

Results 46 comments of


                                            guihonghao

Cuda OOM when fine-tuning 13B

> Thank you, fixing the version of `bitsandbytes` to 0.37.2 resolved the issue for me. ([TimDettmers/bitsandbytes#324](https://github.com/TimDettmers/bitsandbytes/issues/324)) > > ``` > bitsandbytes==0.37.2 > ``` Yes, I meet an OOM when fine-tuning...

The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable.

你好，环境版本是下面这些。如果8bit量化存在问题，可以尝试使用4bit量化。 ``` accelerate==0.21.0 transformers==4.33.0 bitsandbytes==0.39.1 ```

The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable.

``` quantization_config=BitsAndBytesConfig( load_in_4bit=True, llm_int8_threshold=6.0, llm_int8_has_fp16_weight=False, bnb_4bit_compute_dtype=torch.bfloat16, bnb_4bit_use_double_quant=True, bnb_4bit_quant_type="nf4", ) model = AutoModelForCausalLM.from_pretrained( model_path, config=config, device_map="auto", quantization_config=quantization_config, torch_dtype=torch.bfloat16, trust_remote_code=True, ) ``` 设置并传入quantization_config参数

baichuan2-13b-iepile-lora模型预测报错zjunlp/baichuan2-13b-iepile-lora does not appear to have a file named config.json.

你好，model_name_or_path 是底座模型即https://huggingface.co/baichuan-inc/Baichuan2-13B-Chat。另外，我们新发布的信息抽取大模型OneKE在信息抽取方面效果更佳 https://github.com/zjunlp/DeepKE/blob/main/example/llm/OneKE.md

baichuan2-13b-iepile-lora模型预测报错zjunlp/baichuan2-13b-iepile-lora does not appear to have a file named config.json.

你好，我们采用的环境是 ``` accelerate==0.21.0 transformers==4.33.0 bitsandbytes==0.39.1 ```

baichuan2-13b-iepile-lora模型预测报错zjunlp/baichuan2-13b-iepile-lora does not appear to have a file named config.json.

1、参考 [data](https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/data) 目录下各个任务中sample.json文件格式组织测试文件，schema.json组织schema信息。 2、参考 https://github.com/zjunlp/DeepKE/blob/main/example/llm/InstructKGC/README_CN.md#23%E6%B5%8B%E8%AF%95%E6%95%B0%E6%8D%AE%E8%BD%AC%E6%8D%A2 步骤，按照下面的代码转换sample.json文件为模型可输入的文件test.json ```python python ie2instruction/convert_func.py \ --src_path data/NER/sample.json \ --tgt_path data/NER/test.json \ --schema_path data/NER/schema.json \ --language zh \ --task NER \ --split_num 6 \ --split test...

guihonghao

Cuda OOM when fine-tuning 13B

The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable.

The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable.

baichuan2-13b-iepile-lora模型预测报错zjunlp/baichuan2-13b-iepile-lora does not appear to have a file named config.json.

baichuan2-13b-iepile-lora模型预测报错zjunlp/baichuan2-13b-iepile-lora does not appear to have a file named config.json.

baichuan2-13b-iepile-lora模型预测报错zjunlp/baichuan2-13b-iepile-lora does not appear to have a file named config.json.

baichuan2-13b-iepile-lora模型预测报错zjunlp/baichuan2-13b-iepile-lora does not appear to have a file named config.json.

baichuan2-13b-iepile-lora模型预测报错zjunlp/baichuan2-13b-iepile-lora does not appear to have a file named config.json.

您好，我想请问一下，如果我想抽取完整的三元组，有没有对应的prompt模版？我只看到实体、事件等单独的抽取方式

您好，我想请问一下，如果我想抽取完整的三元组，有没有对应的prompt模版？我只看到实体、事件等单独的抽取方式