SoundStream
SoundStream copied to clipboard
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
Firstly, thank you for sharing this code. And i trained with vctk data set. But unfortunately, i didn't get good result. These are the main problems i found: 1 The...
Hi, The project is missing any kind of information on the license. What license applies to this code?
 "each quantizer uses a codebook of size N = 2r/Nq = 280/8 = 1024" but i find each quantizer uses a codebook is 1024x512, so N=1024x512? and 8x10x9=720bit, not...
Hi, first of all thank you for sharing the code! Having been working on this code for a while, I am wondering how to run the code - is the...
Mentioned in the paper that:  In the code, there is no relevant content, can you show it
Hello, I've been following your project with great interest. I was wondering if you have any pre-trained models available for this project? Thank you for your time and effort in...
hi, thanks for your great work, but I have some confuses with the length of below code, https://github.com/wesbz/SoundStream/blob/e9dac26a7364656c5768cbc5dc38f41801d27831/main.py#L119 I wonder if the length should be: ``` lengths_s_x = torch.div(lengths_x, 256,...
Hi, this is a great works, and is there any samples right now?
i want to try out this model but could not find the Nsynth dataset anywhere. the link on the [official website](https://magenta.tensorflow.org/datasets/nsynth) seems to be broken. can anyone kindly share this...
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [1, 1, 1, 7]], which is output 0 of AsStridedBackward0, is at version...