AutoAudit icon indicating copy to clipboard operation
AutoAudit copied to clipboard

为什么不使用增训方式构造大模型

Open zt1112 opened this issue 2 years ago • 1 comments

很棒的工作,有两个疑惑希望作者帮助解答下:

1、类似的行业大模型会采用先增训再用指令数据集SFT的方案,请教下这里为什么考虑直接使用SFT呢? 2、SFT方案对安全领域的知识扩充是否足够,不知道作者有没有这方面的实验,多谢

zt1112 avatar Oct 10 '23 03:10 zt1112

Apologies for the delayed response. Due to some matters in 2023, I was unable to reply in a timely manner, and I apologize once again. Regarding your questions:

Due to our lack of experience during the initial exploration, we did not consider data augmentation, which would undoubtedly enhance the diversity of the dataset.

Recently, there has been some work on data augmentation in the cybersecurity field as well. IBM’s CyberPal mentioned specific details on this topic (unfortunately, the article is closed-source, and the code and dataset were not published). We are working on reproducing it and plan to release our dataset and corresponding code. We look forward to your continued interest in our work.

ddzipp avatar Nov 13 '24 16:11 ddzipp