Panda-70M icon indicating copy to clipboard operation
Panda-70M copied to clipboard

About the caption the video with subtitles.

Open mutonix opened this issue 2 years ago • 1 comments

Great thanks to the great contribution of your work! I have some doubts about how you collect the subtitles. Do you directly download the subtitles from the youtube website or use some ASR models?

mutonix avatar Mar 30 '24 15:03 mutonix

Hi @mutonix, Thanks for your interest about this dataset! The subtitles are directly from youtube and we don't use another ASR model to get them. If you use the script in this repo to download the dataset, you can also get the youtube subtitles.

tsaishien-chen avatar Apr 01 '24 21:04 tsaishien-chen