visually-grounded-speech topic
List
visually-grounded-speech repositories
SpeechCLIP
108
Stars
6
Forks
Watchers
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022