Can LlamaGen predict a [EOS] token when inferencing?

Open Xiaoyuan-Isaac-Wang opened this issue 1 year ago • 6 comments

Jul 17 '24 07:07 Xiaoyuan-Isaac-Wang

No， we donot need to

Jul 17 '24 07:07 daiyixiang666

If I simply add a [eos] token after tokenizing a image for training, and when inferencing, if [eos] is predicted, the model then stops generating. Will it work? What's your thought on this, thx!

Jul 17 '24 07:07 Xiaoyuan-Isaac-Wang

emm.. It will work but it will be just useless since the model will do the exact step during inference since the image has a fix latent size.

Jul 17 '24 08:07 daiyixiang666

But it will be usefull if you want to training on different spatial ratio image and add the information as the start token

Jul 17 '24 08:07 daiyixiang666

Hi~ Adding a special token to enable different aspect ratio image generation is very promising. We will try this idea if possible in the future. Thx !

Jul 23 '24 06:07 PeizeSun

Yes, And my another suggest based on my training in many t2i models, add cross attention instead of add the token to the front will produce more promising result.

Jul 23 '24 08:07 daiyixiang666