spleeter icon indicating copy to clipboard operation
spleeter copied to clipboard

[Discussion] Why is it useful to train just masks of stft?

Open justStarG opened this issue 4 years ago • 1 comments

Spleeter trains many masks of stft to split songs. But why does it work? Is is possible to get a better model if I just input stft feature and output stft of each instrument?

justStarG avatar Jun 18 '21 05:06 justStarG

If you're referring to this fn: https://github.com/deezer/spleeter/blob/9629dcca9f846efbd32699d40201b97017211957/spleeter/utils/tensor.py#L117

Zero-padding / Masking just helps make the plot easier to visualize.

It does not add extra information in the F or T domain. Instead, there are more F bins, in tighter clusters in the F domain.

and it creates a cleaner Sinc interpolation.

ashbeats avatar Aug 19 '21 20:08 ashbeats