starcoder icon indicating copy to clipboard operation
starcoder copied to clipboard

Question about training data.

Open suoych opened this issue 2 years ago • 1 comments

Hi, thanks for sharing the great work! May I ask that where you get the PDDL(Planning Domain Definition Language) data? I run the demo on huggingface and found that starcoder has the ability to write the pddl code. However, I did not find pddl language data in language list of The Stack dataset. Could you shed light on how to acquire pddl data? Thank you so much.

suoych avatar Jul 02 '23 12:07 suoych

Hi. StarCoder was not explicitly trained on PDDL. However, the model could have encountered its syntax in the markdown/HTML files that are part of the training dataset.

ArmelRandy avatar Jul 11 '23 14:07 ArmelRandy