starcoder icon indicating copy to clipboard operation
starcoder copied to clipboard

Is it possible to fine-tune the model to obtain better results in Swift?

Open drewparo opened this issue 2 years ago • 1 comments

I would like to train the model specifically on swift documents that are not the the classical chat format ( instruction/input/output). Can I use the script in starcoder/finetune/finetune.py? In which form I should put the data?

drewparo avatar May 17 '23 15:05 drewparo

The fine-tuning script, i.e. finetune.py is designed to fine-tune Starcoder to map an input text to an output text . If you have a dataset which follows that template (or if you can modify a dataset in order to have that format), you can use the provided code to perform your fine-tuning without any further issue.

ArmelRandy avatar May 30 '23 10:05 ArmelRandy

If you want a similar data preparation to the pre-training you can use this dataset template which packs sequences of constant length.

loubnabnl avatar Jun 13 '23 08:06 loubnabnl