DiffTalk issues

preprocessing code 实测可用

https://github.com/yxdydgithub/difftalk_preprocess 实测可用

Do we need to have the same number of images, landmarks and audio features?

Thanks for your great work. I am confused one thing in preporcessing stage. When we extract images, landmarks and audio features from a video, do we need to have the...

novicemm

deepspeech model version

I use the deepspeech==0.9.3, however, it has error: graph_def.ParseFromString(f.read()) google.protobuf.message.DecodeError: Error parsing message with type 'tensorflow.GraphDef'

beria-moon

How can the driven-audio feature a and the landmark representation l be used for cross-attention module?

1

As we all know, the driven-audio feature a and the landmark representation l are just a vector, not a batch of vectors, so how can they be used in cross-attention...

Haoqing-Wang

Can you provide the processed data or the related processing code?

2

kunkun279

The usage of RAM is always increasing during one epoch.

24

After preprocessing of HDTF dataset, I got 415 videos. 249 videos (60%) were randomly selected as training set, the others (40%) were test set. The first 1500 frames of each...

quqixun

ModuleNotFoundError: No module named 'ldm.util'; 'ldm' is not a package

2

![error](https://github.com/sstzal/DiffTalk/assets/78424820/a04daa28-66a1-4779-bb89-ac351d3cfd9e) i encountered a problem about package 'ldm', my env's ldm==0.1.3 python==3.7 pytorch==1.12.1

zk19971101

glennchina

DiffTalk
DiffTalk copied to clipboard

Metadata

preprocessing code 实测可用

Do we need to have the same number of images, landmarks and audio features?

deepspeech model version

How can the driven-audio feature a and the landmark representation l be used for cross-attention module?

Can you provide the processed data or the related processing code?

The usage of RAM is always increasing during one epoch.

ModuleNotFoundError: No module named 'ldm.util'; 'ldm' is not a package

how to test with my own ref_image and audio ，to generate audio-driven video

about data_test

The requirements.exe is so bad

← Metadata

Owner

Metadata

DiffTalk DiffTalk copied to clipboard

Metadata

← Metadata

Owner

Metadata

DiffTalk
DiffTalk copied to clipboard