BindDiffusion
BindDiffusion copied to clipboard
Reverse: image-guided audio generation?
Hi there! I really like this repo-- thank you for creating it. How would I attempt the reverse process? i.e. creating an audio snippet based on an image?