ViLT
ViLT copied to clipboard
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Hi, if I train a new dataset, do I still use the data preparation script? For example, MM-IMDb and UPMC Food-101 dataset. Thank you very much.
Hi Thanks for your impressive works, which helps me a lot to solve my tasks. Due to the surprised outcome we obtained by fine-tuning the ViLT, we plan to train...
Hi, I have Image and Description of Products which is in Spanish language and want to train a classifier model using ViLT. What kind of pretrained model shall I use...
Finetune is failing ValueError: operands could not be broadcast together with shapes (384,576) (3,)
Hi, I am trying to retrain the network further on same VQA dataset and this is failing with error ` data = [self.dataset[idx] for idx in possibly_batched_index] 50 else: 51...
Hey @dandelin , I just want to share the results I reproduced with my own recall implementation. Here is my ViltModel ```python from typing import List, Dict import torch from...
In [this line](https://github.com/dandelin/ViLT/blob/762fd3975c180db6fc88f577cf39549983fa373a/vilt/modules/objectives.py?fbclid=IwAR1YnT-PjjklnNLX-WSDmNCUW3ZQNz2kcmtoQtHGMqG65ecpM62cUJIljrU#L428) the answer is being initialized to zeros and never changed. I am not able to understand how this helps with both positive and negative examples. Can someone...
I am trying to run the vqa demo on jupyter notebook, but it keeps saying in the browser "waiting for localhost", do you know which port is it supposed to...
Validation sanity check: 0it [00:00, ?it/s]ERROR - ViLT - Failed after 0:00:13! Traceback (most recent call last): File "/home/zhurui10/.custom/cuda-10.2-cudnn8-devel-ubuntu18.04-pytorch1.8.0_full_tensorboard/pylib/Jupyter-kuplus/sacred/experiment.py", line 312, in run_commandline return self.run( File "/home/zhurui10/.custom/cuda-10.2-cudnn8-devel-ubuntu18.04-pytorch1.8.0_full_tensorboard/pylib/Jupyter-kuplus/sacred/experiment.py", line 276, in...
rror while read file idx 4315223 in conceptual_caption_train_0 -> cannot identify image file Error while read file idx 3711625 in conceptual_caption_train_0 -> image file is truncated (2 bytes not processed)...