peft issues

T5 with PrefixTuning ,error

2

```python import torch from transformers import AutoModelForSeq2SeqLM,T5Tokenizer from peft import get_peft_config, get_peft_model, TaskType,PrefixTuningConfig,PeftModelForSeq2SeqLM,PeftModel model_name_or_path = "t5-small" tokenizer_name_or_path = "t5-small" model = AutoModelForSeq2SeqLM.from_pretrained(model_name_or_path) tokenizer = T5Tokenizer.from_pretrained(tokenizer_name_or_path) peft_config=PrefixTuningConfig( task_type=TaskType.SEQ_2_SEQ_LM, inference_mode=False, num_virtual_tokens=20) model...

yuyijiong

CUDA Error when fine tuning GPT-J for CasualLM

6

Hello, I am trying to finetune GPT-J for text generation by adapting [this notebook](https://colab.research.google.com/drive/1jCkpikz0J2o20FBQmYmAGdiKmJGOMo-o?usp=sharing). However, when I run `trainer.train` I get a CUDA error that states the following, ` RuntimeError:...

JohnnyRacer

solved

Fixed typo in Readme

Also added links to datasets and models, plus enhanced config render with yaml command

Muhtasham

Use `peft` for RLHF

4

# Feature request We should leverage `trl`: https://github.com/lvwerra/trl - the recent library from Hugging Face for RLHF, to apply PPO using `peft` and LoRA I think `peft` should just work...

younesbelkada

wip

[Feature] Add support for Donut (Multimodal Model)

Thank you very much for sharing this library, it is going to be very useful for fine tuning big models. It would be cool if [Donut](https://huggingface.co/docs/transformers/model_doc/donut) model is supported. This...

WaterKnight1998

PRs welcome to address this

Enhancement: detach dtype for prompt embeddings from the model itself

1

I think right now, the dtype of prompt embeddings and the model are tied together since the weights are copied. It would be nice to have a different dtype for...

mayank31398

convert prompt tuning vocab to fp32

closes: https://github.com/huggingface/peft/issues/62

mayank31398

Add support for T-Few

2

T-Few is a PEFT method for few-shot learning that is currently the SOTA on many NLP benchmarks. It uses a nifty technique called (IA)^3 to update a small number of...

lewtun

PRs welcome to address this

feat(ci): add `pip` caching to CI

2

Changes made by this PR can be summarized as follows: - Set the `cache` and `cache-dependency-path` argument to enable caching to speed up CI times.

SauravMaheshkar

AttributeError: 'NoneType' object has no attribute 'device'

4

why this happening? ``` batch = tokenizer("Two things are infinite: ", return_tensors="pt") with torch.cuda.amp.autocast(): output_tokens = model.generate(**batch, max_new_tokens=50) print("\n\n", tokenizer.decode(output_tokens[0], skip_special_tokens=True)) ``` its give the following error **AttributeError: 'NoneType' object...

imrankh46

peft
peft copied to clipboard

Metadata

T5 with PrefixTuning ,error

CUDA Error when fine tuning GPT-J for CasualLM

Fixed typo in Readme

Use `peft` for RLHF

[Feature] Add support for Donut (Multimodal Model)

Enhancement: detach dtype for prompt embeddings from the model itself

convert prompt tuning vocab to fp32

Add support for T-Few

feat(ci): add `pip` caching to CI

AttributeError: 'NoneType' object has no attribute 'device'

← Metadata

Owner

Metadata

peft peft copied to clipboard

Metadata

← Metadata

Owner

Metadata

peft
peft copied to clipboard