More details about tuning the visual tokenizer?

Open YAOYI626 opened this issue 2 years ago • 1 comments

Thanks authors for the insightful work!

I want to understand more details about tuning the visual tokenizer. Could you mind explaining about what kind of dataset used in training your own visual tokenizer?

It will be supe helpful to us! Thanks in advance.

Aug 17 '23 09:08 YAOYI626

Hi, Thank you for your interest in our work!

For visual tokenizer distillation, we followed the protocol in FD and performed the feature distillation on ImageNet-1K dataset.

Aug 17 '23 16:08 daoyuan98