FullSubNet-plus icon indicating copy to clipboard operation
FullSubNet-plus copied to clipboard

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Results 18 FullSubNet-plus issues
Sort by recently updated
recently updated
newest added

Hey, first of all I'd like to thank you for this great model and for sharing it on github! a small bug i found: as we know, cIRM isn't bounded...

I tried to use your pre-trained checkpoints, data.pkl, to inference noisy signals, but found out that there is problem in torch.load() function, indicating failure to load the .pkl file. I...

If I want to train my own dataset, what should the structure of the dataset be, and should the names of the files in the clean and noise folders be...

I find a problem when training my model: soundfile.LibsndfileError: Error opening 'xx/xx/xx.wav': File contains data in an unknown format. I run this in Ubuntu I have tried many methods. What...

I follot the steps of the part Quick Start, the commands as follows: ![image](https://user-images.githubusercontent.com/48951434/197444325-051db2f6-e7e2-4565-9a4d-c267b603fb15.png) and the program runs with no bug, but the output is weird, there is only a...

I am trying to reproduce the FullSubNet+ on some speech enhancement datasets. The results are amazing, the noise suppression ability of this method is so good, and **very impressive**! :star_struck::star_struck::star_struck:...

Can we control the strength of enhancement of Fullsubnet by config this? ![image](https://user-images.githubusercontent.com/36092323/196156214-77728be3-f9a0-4cb6-959d-42785a1154d9.png)

你好呀,我在复现你的工作的时候发现,train.toml出错了,我发现是因为snr_range = [-5,20]和metrics = ["WB_PESQ", "NB_PESQ", "STOI", "SI_SDR"]这里有问题,把这两行注释掉就没问题。我在网上没找到解决方法,请问您遇到过这样的问题吗?期待您的回复

Hi, I encountered an error like that when training FullSubNet-Plus about 100 epoch on DNS dataset. What is the reason and how to solve it. error: File "/venv/py365/lib/python3.6/site-packages/librosa/util/utils.py", line 310,...

学长,我有个问题,num_groups_in_drop_band要是不为1,输出的掩膜维度的F不就变了嘛,后续还原语音的时候,就和原来的幅度谱大小对不上了