Question about defining the contrastive loss on zi’s rather than hi’s

Open garynlfd opened this issue 4 years ago • 0 comments

Hello,

I have a question after reading your paper. You mentioned that it's beneficial to define the contrastive loss on zi’s rather than hi’s, but I'm not sure what's the main reason of this. As you mentioned in section 4, "z is trained to be invariant to data transformation", is this the main reason? Could you give me more evidence about why z is better than h in contrastive loss? I would be grateful if you could give me some hints.

Thanks.

Mar 27 '22 17:03 garynlfd