BLIP icon indicating copy to clipboard operation
BLIP copied to clipboard

Confidence scores for image captions?

Open key88sf opened this issue 3 years ago • 2 comments

Hello - is there any way for the model to also return a [0,1] confidence score when generating an image caption?

key88sf avatar Feb 18 '23 19:02 key88sf

Hi, you can pass in the generated caption to our ITM model to get a similarity matching score. You may want to refer to examples in our LAVIS library: https://github.com/salesforce/LAVIS/blob/main/examples/blip_image_text_matching.ipynb

LiJunnan1992 avatar Feb 20 '23 00:02 LiJunnan1992

Hello, What about the visual question answering? How can we get the probabilities? Thanks

kosarkazemi avatar Mar 31 '23 16:03 kosarkazemi