ViLT issues

How to use the modal-type embedding in the output of encoder?

1

train customer data

1

Hi, if I train a new dataset, do I still use the data preparation script? For example, MM-IMDb and UPMC Food-101 dataset. Thank you very much.

guanhdrmq

ViLT training time

1

Hi Thanks for your impressive works, which helps me a lot to solve my tasks. Due to the surprised outcome we obtained by fine-tuning the ViLT, we plan to train...

xii-rao

How to use ViLT model for Spanish Text ?

1

Hi, I have Image and Description of Products which is in Spanish language and want to train a classifier model using ViLT. What kind of pretrained model shall I use...

karndeepsingh

Finetune is failing ValueError: operands could not be broadcast together with shapes (384,576) (3,)

2

Hi, I am trying to retrain the network further on same VQA dataset and this is failing with error ` data = [self.dataset[idx] for idx in possibly_batched_index] 50 else: 51...

amitkayal

Got better results than in the paper:

2

Hey @dandelin , I just want to share the results I reproduced with my own recall implementation. Here is my ViltModel ```python from typing import List, Dict import torch from...

JoanFM

Flickr30k Image and Text Retrieval - Query regarding training

2

In [this line](https://github.com/dandelin/ViLT/blob/762fd3975c180db6fc88f577cf39549983fa373a/vilt/modules/objectives.py?fbclid=IwAR1YnT-PjjklnNLX-WSDmNCUW3ZQNz2kcmtoQtHGMqG65ecpM62cUJIljrU#L428) the answer is being initialized to zeros and never changed. I am not able to understand how this helps with both positive and negative examples. Can someone...

gchhablani

Waiting for localhost

I am trying to run the vqa demo on jupyter notebook, but it keeps saying in the browser "waiting for localhost", do you know which port is it supposed to...

seifmaged31

AttributeError: 'LightningDistributedDataParallel' object has no attribute '_sync_params'

1

Validation sanity check: 0it [00:00, ?it/s]ERROR - ViLT - Failed after 0:00:13! Traceback (most recent call last): File "/home/zhurui10/.custom/cuda-10.2-cudnn8-devel-ubuntu18.04-pytorch1.8.0_full_tensorboard/pylib/Jupyter-kuplus/sacred/experiment.py", line 312, in run_commandline return self.run( File "/home/zhurui10/.custom/cuda-10.2-cudnn8-devel-ubuntu18.04-pytorch1.8.0_full_tensorboard/pylib/Jupyter-kuplus/sacred/experiment.py", line 276, in...

KimSoybean

while read file idx 2740206 in conceptual_caption_train_0 -> image file is truncated

1

rror while read file idx 4315223 in conceptual_caption_train_0 -> cannot identify image file Error while read file idx 3711625 in conceptual_caption_train_0 -> image file is truncated (2 bytes not processed)...

campper

ViLT
ViLT copied to clipboard

Metadata

How to use the modal-type embedding in the output of encoder?

train customer data

ViLT training time

How to use ViLT model for Spanish Text ?

Finetune is failing ValueError: operands could not be broadcast together with shapes (384,576) (3,)

Got better results than in the paper:

Flickr30k Image and Text Retrieval - Query regarding training

Waiting for localhost

AttributeError: 'LightningDistributedDataParallel' object has no attribute '_sync_params'

while read file idx 2740206 in conceptual_caption_train_0 -> image file is truncated

← Metadata

Owner

Metadata

ViLT ViLT copied to clipboard

Metadata

← Metadata

Owner

Metadata

ViLT
ViLT copied to clipboard