LlamaGen icon indicating copy to clipboard operation
LlamaGen copied to clipboard

Can LlamaGen predict a [EOS] token when inferencing?

Open Xiaoyuan-Isaac-Wang opened this issue 1 year ago • 6 comments

Xiaoyuan-Isaac-Wang avatar Jul 17 '24 07:07 Xiaoyuan-Isaac-Wang

No, we donot need to

daiyixiang666 avatar Jul 17 '24 07:07 daiyixiang666

If I simply add a [eos] token after tokenizing a image for training, and when inferencing, if [eos] is predicted, the model then stops generating. Will it work? What's your thought on this, thx!

Xiaoyuan-Isaac-Wang avatar Jul 17 '24 07:07 Xiaoyuan-Isaac-Wang

emm.. It will work but it will be just useless since the model will do the exact step during inference since the image has a fix latent size.

daiyixiang666 avatar Jul 17 '24 08:07 daiyixiang666

But it will be usefull if you want to training on different spatial ratio image and add the information as the start token

daiyixiang666 avatar Jul 17 '24 08:07 daiyixiang666

Hi~ Adding a special token to enable different aspect ratio image generation is very promising. We will try this idea if possible in the future. Thx !

PeizeSun avatar Jul 23 '24 06:07 PeizeSun

Yes, And my another suggest based on my training in many t2i models, add cross attention instead of add the token to the front will produce more promising result.

daiyixiang666 avatar Jul 23 '24 08:07 daiyixiang666