LlamaGen icon indicating copy to clipboard operation
LlamaGen copied to clipboard

Why the result image is all black, when cfg-scale == 0 and temperature == 0 in Class-conditionalImageGeneration

Open Haodong-Lei-Ray opened this issue 1 year ago • 1 comments

Hi. When I test Class-conditionalImageGeneration function in LlamaGen's GPT-L model. When I use cfg-scale == 0 temperature == 0 in the model. sample_c2icfg0t0 When I set cfg-scale 0, when temperature == 0 sample_c2icfg0t0 When I set cfg-scale 2, when temperature == 0 sample_c2icfg2t0 When I set cfg-scale 3, when temperature == 0 sample_c2icfg3t0 When I set cfg-scale 4 when temperature == 0 sample_c2icfg4t0 When I set cfg-scale 10 when temperature == 0 sample_c2i So when cfg-scale to 10 could make 360 class otter(Also maybe sth wrong in #76 ) Also I find when I set cfg-scale 0, 2, 3, 4, 10 when temperature == 1 sample_c2icfg1 sample_c2icfg2 sample_c2icfg3 sample_c2icfg4 Question 1: Why the result image is all black, when cfg-scale == 0 and temperature == 0 in Class-conditionalImageGeneration? Could I understand that the model depend heavily on the temperature random? Is it only when setting temperature != 0 could working?(Maybe > 1.0) Also I notice your Experiments that model performance very bad when cfg-scale == 0. But I do not find temperature Experiments image

Haodong-Lei-Ray avatar Dec 26 '24 14:12 Haodong-Lei-Ray

It is my hyperparameter

python3 autoregressive/sample/sample_c2i.py \
  --vq-ckpt ./pretrained_models/vq_ds16_c2i.pt \
  --gpt-ckpt ./pretrained_models/c2i_L_384.pt \
  --gpt-model GPT-L \
  --image-size 384

Haodong-Lei-Ray avatar Dec 26 '24 14:12 Haodong-Lei-Ray