Fida Mohammad Thoker

Results 12 issues of Fida Mohammad Thoker

Hello everyone, Has anyone successfully converted kinetics pre-trained weights into Pytorch TSN?

@yjxiong Can you explain why there is a different loss function mentioned in the paper as opposed to the normal cross entropy? From the implementation, I can see you have...

@yjxiong I sucessfully trained a new model with PKUMMD dataset. However during testing I am getting nan values for map. I have attached both training and testing logs below. **These...

Hi, Thanks for releasing the code and this amazing work. I am using the MAE and MoCo-V3 baseline in my current work however I can't reproduce your results in Table...

Hi, thanks for releasing the code and r2+1D model. Can you please share the kinetics pretrained 3D-ResNet18 weights too? I am comparing 3D-ResNet18 models from other self-supervised methods too in...

Thanks for releasing the code and pretrained models of your amazing work "Audio-Visual Instance Discrimination with Cross-Modal Agreement". I noticed that you used different architectures for R(2+1)D in different experiments...

While running the notebook i get the following error. IOError Traceback (most recent call last) in () 24 25 write_image_annotation_pairs_to_tfrecord(filename_pairs=overall_train_image_annotation_filename_pairs, ---> 26 tfrecords_filename='pascal_augmented_train.tfrecords') /home/fmthoker/bonn/lab_vision_systems/slim/models/tf-image-segmentation/tf_image_segmentation/utils/tf_records.py in write_image_annotation_pairs_to_tfrecord(filename_pairs, tfrecords_filename) 36 37 img...

Dear authors, Can you share some details about how we can generate the captions for new videos in the same manner as done for Intervid? From the paper, you generated...

Dear authors, Great work and thanks for releasing the code for ViClip pretraining on InternVid-10M-FLT. Firstly, It would be really great if the pre-trainning instructions are more detailed, like which...