About interaction
Could this work achieve caption everything without any interaction like SAM?
@PilgrimMay Thank you for your suggestion. Currently, the repository does not support captioning "everything" in a dense caption format. However, we will be adding this feature within the next few days.
Thanks for your continuous upgrading. Note that this job now seems to support caption everything. Is it possible to try this function in the demo?
yes, try demo with chatGPT
OK! Thanks for your excellent work. Would you release the code about training? So that I could train my own datasets.
Hi, the model combines pretrained models like SAM, ChatGPT, and BLIP-2 for interactive usage. No additional training is needed. Please refer to the paper at https://arxiv.org/pdf/2305.02677.pdf for more details, and the Acknowledgement Section in the Readme for the official training code of each pretrained model .