TokenFlow icon indicating copy to clipboard operation
TokenFlow copied to clipboard

my test output(Reconstructed video lacks consistency) ?

Open zhanghongyong123456 opened this issue 2 years ago • 5 comments

test cmd: python preprocess.py

https://github.com/omerbt/TokenFlow/assets/48466610/3fee547d-f65c-4af0-bee7-5712229c582d

zhanghongyong123456 avatar Sep 05 '23 06:09 zhanghongyong123456

same result with yours

puppynull avatar Sep 05 '23 08:09 puppynull

Inaccurate reconstruction is due to: (i) inaccurate DDIM inversion, (ii) imperfect VAE latent space autoencoder.

Interestingly, our method may still overcome issues with the DDIM inversion thanks to our TokenFlow injection. For example, the editing result for this video does not exhibit these artifcats that occur in the DDIM inversion process.

omerbt avatar Sep 05 '23 09:09 omerbt

Yes, I also experienced this issue. In my experience, it happens because each frame is inverted independently (and becomes severe when fewer DDIM steps are used). However, if you use Cross-Frame attention and Tokenflow propagation during DDIM inversion and reconstruction, this issue gets resolved even for reconstructed video

anime26398 avatar Sep 06 '23 03:09 anime26398

Yes, I also experienced this issue. In my experience, it happens because each frame is inverted independently (and becomes severe when fewer DDIM steps are used). However, if you use Cross-Frame attention and Tokenflow propagation during DDIM inversion and reconstruction, this issue gets resolved even for reconstructed video

I am in total agreement.

G-U-N avatar Sep 06 '23 04:09 G-U-N

@anime26398 Then Tokenflow propagation is implemented in this repo? I can't find 'compute nn fields' and 'tokenflow propagation' it just looks using PnP instead.

hyoseok1223 avatar Dec 21 '23 09:12 hyoseok1223