peft icon indicating copy to clipboard operation
peft copied to clipboard

Add support for T-Few

Open lewtun opened this issue 3 years ago • 2 comments

T-Few is a PEFT method for few-shot learning that is currently the SOTA on many NLP benchmarks. It uses a nifty technique called (IA)^3 to update a small number of parameters during training and would be an impactful method to include IMO.

Although research code exists, it is tightly bound to the paper and doesn't run easily on hardware that isn't an 80GB A100. The peft library could help make this work more accessible to industry practitioners (where few-shot is actually valuable)

cc @craffel

Paper: https://arxiv.org/abs/2205.05638 GitHub: https://github.com/r-three/t-few

lewtun avatar Jan 26 '23 09:01 lewtun

(IA)^3 is already in the backlog: https://github.com/huggingface/peft#backlog

sayakpaul avatar Jan 27 '23 04:01 sayakpaul

IA^3 is already implemented here https://github.com/adapter-hub/adapter-transformers

I will give it a try

savasy avatar Mar 16 '23 15:03 savasy

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

github-actions[bot] avatar May 05 '23 15:05 github-actions[bot]

Any updates on this? Is there a timeline about when this would be supported or has the priorities of the team changed?

patricebechard avatar May 18 '23 18:05 patricebechard

Quick update, the PR #578 implements this and is quite advanced, will probably land soon.

BenjaminBossan avatar Jul 10 '23 09:07 BenjaminBossan