mmf
mmf copied to clipboard
How to train the VAE image tokenizer?
🚀 Feature
Thanks for your excellent work!
As mentioned in the paper,
we first train a discrete VAE as the image tokenizer on our collected fashion images with the perceputal loss.
Does the code support the image tokenizer training?