EAT_code issues

Details about CLIP fine-tuning and zero-shot text-guided editing

11

Hi, Could you kindly provide more details on the **_setting for model fine-tuning with CLIP_** and the **_zero-shot text-guided expression editing procedure_**? For model fine-tuning with CLIP, my understanding is...

JamesLong199

No discriminator in the state-dict. Dicriminator will be randomly initialized load optimizer_a2kp failed, train it from scratch

5

作者你好，非常感谢你优秀的工作，我在使用./ckpt/qvt_img_pca_sync_4_01_11_304.pth.tar对第二阶段进行训练的时候，出现了“No discriminator in the state-dict. Dicriminator will be randomly initialized load optimizer_a2kp failed, train it from scratch”的错误，但是我观察代码，在第一阶段训练的时候是有optimizer_a2kp 的pre-trained checkpoint的，为什么会出现这样的警告？请问是我的预训练检查点设置有问题吗？

YZX-codesky

Can anyone share TED-Talk dataset？

哪位巨佬有ted-talk数据集呢？跪求。

zouzhekang

Inquiry Regarding Preprocessing VOX2 and MEAD Dataset for Training

8

Thanks for the authors wonder work. As I am attempting to train the model myself and reproce the results, I would be grateful if the authors could give me more...

Calmepro777

mead数据压缩问题

1

大佬，数据是不是压缩的时候有问题上面第一个文件是可以解压的剩下的不行

bronzeman2022

How to generate files for custom audio?

4

According to the instructions provided here: Note 2: To test with a custom audio, you need to replace the video_name/video_name.wav and deepspeech feature video_name/deepfeature32/video_name.npy. The output length will depend on...

subharya83

How long does it take to train?

1

I wonder how long does it take to train? And what hardware devices you use? Thank you! @yuangan

Sprinkle0

vox2数据处理

2

您好作者，非常感谢您的工作，请问方便提供一下vox2数据处理后得到的poseimg数据吗，如果可以的话，不胜感激！

Jiang-zhengyang

能否用imgs+wavs进行A2ET训练

5

作者您好，我想对A2ET模型的训练进行复现，但是vox2数据集太大，预处理时间会很长，现在我这里有对vox2数据裁剪过的img和提取的对应的wav音频，请问能否完成对A2ET模型两种损失方式的训练？

HustZhenhua

voxselect文件

1

![Image](https://github.com/user-attachments/assets/446a774f-8626-4117-8436-8381e51b5279) 你好，我想用自己的数据集训练一下你的代码，现在数据集已经处理好了，但还存在几个问题需要请教一下， 1、如果用自己的数据集，这个tar文件可以直接下载使用吗还是要替换成自己数据集的权重文件，如果替换成自己的数据集的，怎么生成呢 2、这个voxselect文件要怎么替换呢

guo-king666

EAT_code
EAT_code copied to clipboard

Metadata

Details about CLIP fine-tuning and zero-shot text-guided editing

No discriminator in the state-dict. Dicriminator will be randomly initialized load optimizer_a2kp failed, train it from scratch

Can anyone share TED-Talk dataset？

Inquiry Regarding Preprocessing VOX2 and MEAD Dataset for Training

mead数据压缩问题

How to generate files for custom audio?

How long does it take to train?

vox2数据处理

能否用imgs+wavs进行A2ET训练

voxselect文件

← Metadata

Owner

Metadata

EAT_code EAT_code copied to clipboard

Metadata

← Metadata

Owner

Metadata

EAT_code
EAT_code copied to clipboard