SoundNet-tensorflow issues

The 8 layer model link gives 403 error.

1

It seems I don't have permission to open the link page. May you upload it again and give the access to download it? Please.

AatroxMercer

audio length

Thanks for your hard work on audio embedding. when I extract feature on my own dataset, I am wondering if the pretrained model can only accept the audio of which...

gancx

numpy tile parameter bug

Line 59 in util.py ` raw_audio = np.tile(raw_audio, length/raw_audio.shape[0] + 1) ` Should be ` raw_audio = np.tile(raw_audio, length//raw_audio.shape[0] + 1) `

Khaled1337

Extracting features in pool5

I have read in the paper that the best layer for feature extraction is 'pool5'. However, the feature sizes in that layer are h x w x 256. Any idea...

janaal1

Thanks for your efforts! The kl loss is implemented as: tf.reduce_mean(-tf.nn.softmax_cross_entropy_with_logits(logits=dist_a, labels=dist_b)) I wonder whether there should be a negative indicator. The logits and labels definitions seem to be different...

Zoeyki