SoundStream issues

Problems about this project.

6

Firstly, thank you for sharing this code. And i trained with vctk data set. But unfortunately, i didn't get good result. These are the main problems i found: 1 The...

linan06kuaishou

License?

Hi, The project is missing any kind of information on the license. What license applies to this code?

fumminger

![image](https://github.com/wesbz/SoundStream/assets/28752526/4ec71a6d-1deb-43d0-ad76-a18f477ebcd4) "each quantizer uses a codebook of size N = 2r/Nq = 280/8 = 1024" but i find each quantizer uses a codebook is 1024x512, so N=1024x512? and 8x10x9=720bit, not...

a897456

Is the code runnable without changing parameters?

10

Hi, first of all thank you for sharing the code! Having been working on this code for a while, I am wondering how to run the code - is the...

asdf2adsfad

Denoise and Enhancement

1

Mentioned in the paper that: ![image](https://github.com/wesbz/SoundStream/assets/28752526/cc3d85fe-0983-4883-9e7a-921a87274f1d) In the code, there is no relevant content, can you show it

a897456

Inquiry about Pre-trained Models

Hello, I've been following your project with great interest. I was wondering if you have any pre-trained models available for this project? Thank you for your time and effort in...

UkiTenzai

question about the length

hi, thanks for your great work, but I have some confuses with the length of below code, https://github.com/wesbz/SoundStream/blob/e9dac26a7364656c5768cbc5dc38f41801d27831/main.py#L119 I wonder if the length should be: ``` lengths_s_x = torch.div(lengths_x, 256,...

lonelyJOJO-s

Is here any samples?

4

Hi, this is a great works, and is there any samples right now?

xuerq

Problem downloading dataset

1

i want to try out this model but could not find the Nsynth dataset anywhere. the link on the [official website](https://magenta.tensorflow.org/datasets/nsynth) seems to be broken. can anyone kindly share this...

thivux

gradient computation has been modified by an inplace operation

15

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [1, 1, 1, 7]], which is output 0 of AsStridedBackward0, is at version...

MasterEndless

SoundStream
SoundStream copied to clipboard

Metadata

Problems about this project.

License?

Question about bit rate

Is the code runnable without changing parameters?

Denoise and Enhancement

Inquiry about Pre-trained Models

question about the length

Is here any samples?

Problem downloading dataset

gradient computation has been modified by an inplace operation

← Metadata

Owner

Metadata

SoundStream SoundStream copied to clipboard

Metadata

← Metadata

Owner

Metadata

SoundStream
SoundStream copied to clipboard