all-seeing icon indicating copy to clipboard operation
all-seeing copied to clipboard

Where do the bounding boxes used in creating the AS-V2 dataset come from?

Open littlepenguin89106 opened this issue 1 year ago • 1 comments

Thank you for the excellent work on ASMv2. In the paper, you mention that when creating the AS-V2 dataset, the bounding boxes of objects are used as part of the prompt for GPT-4V. However, the process of obtaining these bounding boxes wasn't explained. Could you describe the workflow for acquiring the bounding boxes?

littlepenguin89106 avatar Sep 11 '24 01:09 littlepenguin89106

Hello, may I ask if you have solved this problem?

jjl212 avatar Dec 20 '24 02:12 jjl212