PREVALENT
PREVALENT copied to clipboard
large scale pretrain for navigation task
Prevalent: A Pretrained Generic VLN Agent
This repository contains source code to reproduce the results presented in the paper:
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training, CVPR 2020
Weituo Hao*,
Chunyuan Li*,
Xiujun Li,
Lawrence Carin,
Jianfeng Gao
Pretrain
Our collected triplets can be downloaded here
The pretrained model can be downloaded here
R2R
CVDN
HANNA
Citation
If you use this code for your research, please cite our paper:
@article{hao2020prevalent,
title={Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training},
author={Hao, Weituo and Li, Chunyuan and Li, Xiujun and Carin, Lawrence and Gao, Jianfeng},
journal={Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2020}
}