Zheng Guang Cong
Zheng Guang Cong
 1. how many images are use in training? 40k or the updated full 118k? As shown in paper "High-Resolution Complex Scene Synthesis with Transformers",  the number of training...
Thanks for your impressive work! I have the following questions: 1. How to split coco-stuff val into 1024 val and 2048 test 2. when calculating FID, do you generate 2048x5...
A really exciting work! I wonder if it could be implemented in stable diffusion.
1. In CT, would it be acceptable to use Loss( f(x0+t_{n+1}*z), x0 ) in place of Loss( f(x0+t_{n+1}*z) , f(x0+t_{n}*z) ) ? 2. I would like to know if doing...
Thanks for your great work! SG2Im, Layout2Im, LostGAN, they train on the deprecated coco-stuff with 24,972 training images. Do you train on the deprecated coco-stuff 2017 segmentation challenge or the...
From the loss of mse and m_mse, it seems that the mask branch does not work in MDT-S-2. We also visualize the generation image and find that generated image with...
1. Could you share the hyperparameters or shell for sampling videos? 2. what is the precise setting of guidance_scale, flow_shift, num_inference_steps, sampler, checkpoint(diffusers or not), negative prompt? I tried to...
1. Lora finetuning 2. Full finetuning