Xiongkun Linghu

Results 8 comments of Xiongkun Linghu

I mean the results on the arxiv paper. Are the results on the table 7 trained with the auxiliary loss?

Thanks a lot! I will try it again.

Ok, I will check the paper for more details about this parameter.

Thanks for your reply. Now I understand the format clearly.

Thanks for the response. I am interested in LEO's baseline, especially the data preprocess and data loader for such a baseline.

Willing to see the released baselines soon.

Thanks for your quik reply. Another question is that if there are multiple referent tokens in the prompt, how can you distinguish different referent scene queries? In above example, "Describe...

Thanks for the reply, I finally understand the mechanism.