MelNet icon indicating copy to clipboard operation
MelNet copied to clipboard

Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"

Results 10 MelNet issues
Sort by recently updated
recently updated
newest added

Hi, I wanted to train the MelNet with my own dataset. There are some audio setting that I still not understand since I'm very new to this signal processing/speech field....

I have the following problem when doing inference :( ```Traceback (most recent call last): File "inference.py", line 36, in model.load_tiers() File "/content/MelNet/model/model.py", line 96, in load_tiers self.tiers[idx+1].load_state_dict(checkpoint['model']) File "/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py", line...

Hi, Thanks for this clean and great implementation for MelNet. I'm a beginner in Speech Synthesis so kindly guide me through the steps for training MelNet for TTS: What I...

I read the code,and don`t see Multiscale Modelling and Conditioning……

I have a problem with fiiguring out whats a valid timestep number because there are like no information about it. A general guide on what all the different variable means...

> Both unconditional generation and conditional generation currently does not support primed generation (extrapolating from provided data). It seems we cannot feed mel spectrograms directly to output wav files just...

1. epsilon을 std와 distrib 둘다 더해주었습니다. 2. model.py에서 t, f방향으로 나눠주는 방법이 제가 이해한 바랑 살짝 다른 거 같아요. self.f_div = f_div[hp.n_tiers], self.t_div = t_div[hp.n_tiers]를 self.f_div = f_div[hp.n_tiers+1], self.t_div =...

bucket_by_sequence_length는 제 생각대로 구현은 해봤는데 wav 파일들을 전부 로드해서 한번 길이를 구하는 과정도 너무 오래 걸리네요. 수정이 필요해 보입니다.

Bumps [torch](https://github.com/pytorch/pytorch) from 1.2.0 to 2.2.0. Release notes Sourced from torch's releases. PyTorch 2.2: FlashAttention-v2, AOTInductor PyTorch 2.2 Release Notes Highlights Backwards Incompatible Changes Deprecations New Features Improvements Bug fixes...

dependencies