Jiayan Teng comments

Results 46 comments of


                                            Jiayan Teng

Tiled noise inversion for upscaling

I guess that you first upscale low-res image using SRGAN and then conduct image-to-image on it using MultiDiffusion? And when upscale, is MultiDiffusion recommended more or Mixture-of-Diffusers?

batch size when training

Thank you!

link error

![image](https://github.com/snap-research/Panda-70M/assets/97726897/3b8afebc-3632-496a-83e9-3fbda4c4b821) And it seems that corresponding lines are here?

What should I input for the network path in sample generation

The first stage model can be any diffusion, and the second stage is the model trained by relay diffusion. For example, see section "Performance Reproduction" below.

Difference between DDIMScheduler and CogVideoXDDIMScheduler

CogVideoXDDIMSampler is equivalent to #7 compute x_t without "random noise". Just set std_dev_t=0, no randomness. It's just a different way of writing the same formula.

How to start with 4090？

Thanks for your reminder, we have added the quick start instruction at the beginning of the readme.

How to start with 4090？

We use python 3.11. Sorry for not specifying the specific python version. We will clarify in Readme.

How to start with 4090？

> Hello, will the results with the diffuser be better than those with the SAT? Besides, I find that the generated video of rigid objects will have a nice view,...

Could I use the pre-fintuned model to proceed my fine-tuning?

1. You can just set train_iters = epochs * dataset_size 2. just set "load" to the pre-finetuned model's directory path.

Could I use the pre-fintuned model to proceed my fine-tuning?

Setting the random seed can only make the script run with the same result each time, but it cannot make the same result in multiple runs of the same script....