GVT
GVT copied to clipboard
More details about tuning the visual tokenizer?
Thanks authors for the insightful work!
I want to understand more details about tuning the visual tokenizer. Could you mind explaining about what kind of dataset used in training your own visual tokenizer?
It will be supe helpful to us! Thanks in advance.
Hi, Thank you for your interest in our work!
For visual tokenizer distillation, we followed the protocol in FD and performed the feature distillation on ImageNet-1K dataset.