张昱辉Zhang Yuhui

Results 6 comments of 张昱辉Zhang Yuhui

when i tried i change var.py class VAR's patch_nums, class VARHF's patch_nums,and arg_util.py pn

@iFighting thanks a lot! I am working on retrain var for inpainting research, hope i can get a good result :)

still trying    ------------------ Original ------------------ From: ***@***.***>; Date:  Wed, Dec 18, 2024 03:47 PM To: ***@***.***>; Cc:  "张昱辉Zhang ***@***.***>; ***@***.***>; Subject:  Re: [FoundationVision/VAR] How can I change the scale for training?...

also i find sometimes used gumbel-softmax(hard=false) will show a better result for train,but it is a bad set for use hard=false? if codebook is limited, will mixed token show more...

@jack111331 thx for u reply , I will try what your questioned in serveral weeks due to busy work. the mix token will useful if loss is not only crossentropy...

@Longhzzz I use gumbel_softmax for different scales token that mix into the final 16*16 token map(which then through frozen decoder into img ,then use img to apply image loss. I...