syncnet_trainer issues

What is the meaning of offset in the txt file ?

2

Hi, I'm a little confused about the meaning of "offset" in the txt file. Could anyone please explain the meaning of it ? Thank you.

Aria-K-Alethia

Do we need to download VGGFACE and VoxCeleb in advance?

3

jlian2

link of pretrain model weight is out of work

https://www.robots.ox.ac.uk/~vgg/software/lipsync/data/voxsrc2020_baseline.model

pfeducode

Usage of disentangle loss

Hi, I'd like to know how can I add disentangle loss into the training process so as to know the true value of disentangling. It seems adding disentangle loss into...

zijiebite

Choice of loss function used

Hi, I am looking through this repo and I am confused about the choice of loss function used. I am using SyncNet to measure lip-sync error and considering that this...

Mayur28

preprocess for wav

Hi, joonson 1. run python makeFileList.py, It always Skipped audio and video lengths different, I get wav from mp4 by ffmpeg, how do you get wav from m4a or from...

azuredsky

No such file :/vox2_dev_txt/id04482/hB9jA7_P7Qk/00053.txt

2

It is 00048.txt instead. Is there anything wrong with the dataset?

jlian2

Where do I find the txt files?

3

I am trying the repo for the first time. While preparing the data I find that we need the text annotations of the voxceleb files. But I find the [dataset](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox2.html)...

dheerajmpai

Negative audio samples for M way matching

4

Where are the negative audio samples being generated for M-way matching problem? I just see load_wav function samples the audio corresponding to the starting index in video frame. I only...

ak-7

Evaluation Protocol for synchronization accuracy in Perfect Match Paper

1

Hello, I have a couple of questions regarding the 75.8% synchronization accuracy reported in https://ieeexplore.ieee.org/abstract/document/9067055/ Perfect match Evaluation protocol: The task is to determine the correct synchronisation within a ±15...

ak-7

syncnet_trainer
syncnet_trainer copied to clipboard

Metadata

What is the meaning of offset in the txt file ?

Do we need to download VGGFACE and VoxCeleb in advance?

link of pretrain model weight is out of work

Usage of disentangle loss

Choice of loss function used

preprocess for wav

No such file :/vox2_dev_txt/id04482/hB9jA7_P7Qk/00053.txt

Where do I find the txt files?

Negative audio samples for M way matching

Evaluation Protocol for synchronization accuracy in Perfect Match Paper

← Metadata

Owner

Metadata

syncnet_trainer syncnet_trainer copied to clipboard

Metadata

← Metadata

Owner

Metadata

syncnet_trainer
syncnet_trainer copied to clipboard