DiCoSA issues

MSVD checkpoint

5

Hi, Congrats on your amazing work! Can you please upload the MSVD checkpoint and steps for inference?

Questions about the inference stage

Thank you for sharing such a great job! You concatenated the latent factors of text and video subspace to calculate similarity through MLP, which means that during the testing phase,...

qjyyyy

Strange results occur when reproducing code on a GPU

I'm getting strange results when running the code on an RTX 3090 GPU. I first used the code in CLIP4Clip to compress the video size to 3fps : https://github.com/ArrowLuo/CLIP4Clip/blob/master/preprocess/compress_video.py and...

11362p

i met some issue when i cry to train on the msvd dataset.

1

Hi, I am facing the issue when trying to train on the MSVD dataset. I got the errors as the message below. command： torchrun main_retrieval.py --do_train 1 --workers 8 --n_display...

Asakinevergup

Questions about disentangled representation learning

Hello, Thank you for the nice work. I have a question on the representation projection. In your paper, the text and video representation are independently project into K components with...

Ray-Zhen

question about QB-norm inference

Hello, I found that you used QB-Norm postprocessing in inference stage while there is no mention about qb-norm in paper, can you show the result without qb-norm? thank you for...

musicman217

Unable to Reproduce Paper’s MSRVTT Results

I'm unable to reproduce the scores reported in the paper. Below are my MSRVTT training/testing results. Could you please advise? ![Image](https://github.com/user-attachments/assets/ba042e03-53dc-4d26-a8ad-47fca45a0c77) ![Image](https://github.com/user-attachments/assets/6177ab58-fc49-43ca-8499-4d64aa0694ea) My settings are as follows: ``` CUDA_VISIBLE_DEVICES=0,1 \...

hankyuwon

DiCoSA
DiCoSA copied to clipboard

Metadata

MSVD checkpoint

Questions about the inference stage

Strange results occur when reproducing code on a GPU

i met some issue when i cry to train on the msvd dataset.

Questions about disentangled representation learning

question about QB-norm inference

Unable to Reproduce Paper’s MSRVTT Results

← Metadata

Owner

Metadata

DiCoSA DiCoSA copied to clipboard

Metadata

MSVD checkpoint

Questions about the inference stage

Strange results occur when reproducing code on a GPU

i met some issue when i cry to train on the msvd dataset.

Questions about disentangled representation learning

question about QB-norm inference

Unable to Reproduce Paper’s MSRVTT Results

← Metadata

Owner

Metadata

DiCoSA
DiCoSA copied to clipboard