TempoTokens
TempoTokens copied to clipboard
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
When I run the training code on the landscape dataset, I encounter an error. How should I solve it? LoRA rank 16 is too large. setting to: 4 Traceback (most...
Hello, excellent work! In the training phase (line 99 in the dataset.py file), you set the audio clip length to n_frames/24. If n_samples is 24, then the audio clip length...
Hello @guyyariv and team, Congratulations on this amazing work—the paper looks incredible! 🎉 You should consider adding the model (inference at least) to the Hugging Face Hub 🤗. Doing so...