BrainScore-Transformers
BrainScore-Transformers copied to clipboard
Code from the paper "Joint rotational invariance and adversarial training of a dual-stream Transformer yields state of the art Brain-Score for Area V4" in BSW at Cosyne 2022
Adversarial Training + Rotational Invariance of Transformers
Introduction
We provide evidence against the unexpected trend of Vision Transformers (ViT) being not perceptually aligned with human visual representations by showing how a dual-stream Transformer (CrossViT) under a joint rotationally-invariant and adversarial optimization procedure yields 2nd place in the aggregate Brain-Score 2022 competition averaged across all visual categories, and currently (March 1st, 2022) holds the 1st place for the highest explainable variance of area V4. Against our initial expectations, these results provide tentative support for an ''All roads lead to Rome'' argument enforced via a joint optimization rule even for non biologically-motivated models of vision such as Vision Transformers
For more details please see our BSW 2022 paper.
Setup
- Install Python (>=3.7), PyTorch and other required python libraries with:
pip install -r requirements.txt - Download Imagenet dataset and valprep.sh for preparing validation set:
mkdir -p ./Dataset # Unzip data inside "Dataset" cd ./Dataset/val bash valprep.sh
Usage
- Generate or choose a config file from "Configs" folder and run the experiments:
python -u train_adv.py --data path/to/dataset --config path/to/config.yaml
Pretrained weights
| ID | Description | Val. Acc(%) | Avg | V1 | V2 | V4 | IT | Behavior |
|---|---|---|---|---|---|---|---|---|
| 1057 | CrossViT-18† | 83.05 | 0.442 | 0.473 | 0.274 | 0.478 | 0.484 | 0.500 |
| 1095 | CrossViT-18†+Rotation | 79.22 | 0.458 | 0.458 | 0.288 | 0.495 | 0.503 | 0.547 |
| 1084 | CrossViT-18†+Adv | 64.60 | 0.462 | 0.497 | 0.343 | 0.508 | 0.519 | 0.441 |
| 991 | CrossViT-18†+Rotation+Adv | 73.53 | 0.488 | 0.493 | 0.342 | 0.514 | 0.531 | 0.562 |
Citation
If you find this useful for your work, please consider citing
@inproceedings{
berrios2022joint,
title={Joint rotational invariance and adversarial training of a dual-stream Transformer yields state of the art Brain-Score for Area V4},
author={William Berrios and Arturo Deza},
booktitle={Brain-Score Workshop},
year={2022},
url={https://openreview.net/forum?id=SOulrWP-Xb5}
}