InternVideo
InternVideo copied to clipboard
zeroshot video-retrieval
Thank you for your work! But I have a question about zero shot video-retrieval task on activitynet dataset, which pretrain model I should use to reproduce the performance?Is Clip ViT-L-14.pt? Thank you for your response!