张昱辉Zhang Yuhui
张昱辉Zhang Yuhui
when i tried i change var.py class VAR's patch_nums, class VARHF's patch_nums,and arg_util.py pn
@iFighting thanks a lot! I am working on retrain var for inpainting research, hope i can get a good result :)
still trying ------------------ Original ------------------ From: ***@***.***>; Date: Wed, Dec 18, 2024 03:47 PM To: ***@***.***>; Cc: "张昱辉Zhang ***@***.***>; ***@***.***>; Subject: Re: [FoundationVision/VAR] How can I change the scale for training?...
also i find sometimes used gumbel-softmax(hard=false) will show a better result for train,but it is a bad set for use hard=false? if codebook is limited, will mixed token show more...
@jack111331 thx for u reply , I will try what your questioned in serveral weeks due to busy work. the mix token will useful if loss is not only crossentropy...
@Longhzzz I use gumbel_softmax for different scales token that mix into the final 16*16 token map(which then through frozen decoder into img ,then use img to apply image loss. I...