visually-grounded-speech topic

List visually-grounded-speech repositories

SpeechCLIP

108
Stars
6
Forks
Watchers

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022