[Question] Does PKU-Alignment/Align-DS-V utilize the LLF technique?

Open shuaijiang opened this issue 1 year ago • 1 comments

Required prerequisites

[x] I have read the documentation https://align-anything.readthedocs.io.
[x] I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
[ ] Consider asking first in a Discussion.

Questions

Has the technical report for Align-DS-V not been released yet? I'm interested in knowing whether the model utilizes the LLF technique.

Feb 10 '25 02:02 shuaijiang

This repository serves as the official implementation of the paper Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback. Beyond LLF itself, align-anything (this repo) can be used for post-training alignment across various modalities. We provide the relevant code and implementations for this purpose.

Considering resource and time constraints, the currently open-sourced align-ds-v version does not yet incorporate LLF technology. However, our paper has already demonstrated the effectiveness of LLF on the TI2T modality, where employing just 25% of LLF achieves the same effect as 100% binary feedback.

The align-ds-v model series is undergoing continuous updates. We plan to integrate LLF into the training process in future iterations and further refine the models.

Feb 13 '25 17:02 zmsn-2077