Ask-Anything
Ask-Anything copied to clipboard
How is it related to Video ChatCaptioner?
There is also another work called video ChatCaptioner. It looks that these two ideas are very related. Can you tell the main difference between your work and Video ChatCaptioner? https://github.com/Vision-CAIR/ChatCaptioner/tree/main/Video_ChatCaptioner