SoundNet-tensorflow
SoundNet-tensorflow copied to clipboard
TensorFlow implementation of "SoundNet".
It seems I don't have permission to open the link page. May you upload it again and give the access to download it? Please.
Thanks for your hard work on audio embedding. when I extract feature on my own dataset, I am wondering if the pretrained model can only accept the audio of which...
Line 59 in util.py ` raw_audio = np.tile(raw_audio, length/raw_audio.shape[0] + 1) ` Should be ` raw_audio = np.tile(raw_audio, length//raw_audio.shape[0] + 1) `
I have read in the paper that the best layer for feature extraction is 'pool5'. However, the feature sizes in that layer are h x w x 256. Any idea...
Thanks for your efforts! The kl loss is implemented as: tf.reduce_mean(-tf.nn.softmax_cross_entropy_with_logits(logits=dist_a, labels=dist_b)) I wonder whether there should be a negative indicator. The logits and labels definitions seem to be different...
When I run torchfile.load('./soundnet8_final.t7') on pycharm, I got this error, do you have any idea about it? Thank you~
The published soundnet5.t7 has `cudnn.SpatialBatchNormalization` layer, not `nn.SpatialBatchNormalization`. Add a condition to handle this.