PuLID icon indicating copy to clipboard operation
PuLID copied to clipboard

question about fig2 in the paper.

Open dingangui opened this issue 1 year ago • 2 comments

Hello, thanks for your incredible work!

In the 'Accurate ID Loss' section in the bottom right corner of Figure 2 of the paper, there are two generated images both denoted as 'predict x_0'. Are both of these images produced by the Lighting T2I? I guess they represent T2I w/ ID and T2I w/o ID, respectively. However, upon closer inspection, it appears that the IDs of both images are well-preserved, which contradicts my speculation. What are these two images' actual meanings and why do you connect them with a vertical line? image

dingangui avatar May 09 '24 14:05 dingangui

When calculating the ID loss, the two images involved in the calculation are both generated by the Lightning T2I training branch, and both images are generated under the T2I w/ ID setting. Additionally, only when calculating the Alignment loss, the contrastive pair is composed of T2I w/ ID and T2I w/o ID.

guozinan126 avatar May 10 '24 06:05 guozinan126

When calculating the ID loss, the two images involved in the calculation are both generated by the Lightning T2I training branch, and both images are generated under the T2I w/ ID setting. Additionally, only when calculating the Alignment loss, the contrastive pair is composed of T2I w/ ID and T2I w/o ID.

Is there no id loss calculated with the given identity images here? Is it only done on the generated images? Also, I would like to ask, how is it ensured that the ID loss done by ArcFace is differentiable for gradient backpropagation?

Luh1124 avatar Aug 25 '24 11:08 Luh1124