Vision-Transformer icon indicating copy to clipboard operation
Vision-Transformer copied to clipboard

Implementation of Vision Transformer from scratch and performance compared to standard CNNs (ResNets) and pre-trained ViT on CIFAR10 and CIFAR100.

Results 2 Vision-Transformer issues
Sort by recently updated
recently updated
newest added

Hi, Thanks for sharing the code! The code for visualization is especially helpful. The authors of ViT also compute the mean attention distance. Are there any plans to support mean...

Add HN link of visual explanation