MelNet issues

config parameter

Hi, I wanted to train the MelNet with my own dataset. There are some audio setting that I still not understand since I'm very new to this signal processing/speech field....

vinson2233

RuntimeError: Error(s) in loading state_dict for DataParallel

I have the following problem when doing inference :( ```Traceback (most recent call last): File "inference.py", line 36, in model.load_tiers() File "/content/MelNet/model/model.py", line 96, in load_tiers self.tiers[idx+1].load_state_dict(checkpoint['model']) File "/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py", line...

ElJaviLuki

Training Pipeline + Steps for training TTS

Hi, Thanks for this clean and great implementation for MelNet. I'm a beginner in Speech Synthesis so kindly guide me through the steps for training MelNet for TTS: What I...

m-hamza-mughal

Can you please provide model weights for inference?

harveenchadha

where is the Multiscale Modelling？

I read the code，and don`t see Multiscale Modelling and Conditioning……

shazhongcheng

inference. py: error: argument -t/--timestep: invalid int value

I have a problem with fiiguring out whats a valid timestep number because there are like no information about it. A general guide on what all the different variable means...

frosta95

MelNet cannot work as an independent vocoder currently, right?

> Both unconditional generation and conditional generation currently does not support primed generation (extrapolating from provided data). It seems we cannot feed mel spectrograms directly to output wav files just...

begeekmyfriend

Yhlee

1. epsilon을 std와 distrib 둘다 더해주었습니다. 2. model.py에서 t, f방향으로 나눠주는 방법이 제가 이해한 바랑 살짝 다른 거 같아요. self.f_div = f_div[hp.n_tiers], self.t_div = t_div[hp.n_tiers]를 self.f_div = f_div[hp.n_tiers+1], self.t_div =...

LEEYOONHYUNG

initial

bucket_by_sequence_length는 제 생각대로 구현은 해봤는데 wav 파일들을 전부 로드해서 한번 길이를 구하는 과정도 너무 오래 걸리네요. 수정이 필요해 보입니다.

LEEYOONHYUNG

Bump torch from 1.2.0 to 2.2.0

Bumps [torch](https://github.com/pytorch/pytorch) from 1.2.0 to 2.2.0. Release notes Sourced from torch's releases. PyTorch 2.2: FlashAttention-v2, AOTInductor PyTorch 2.2 Release Notes Highlights Backwards Incompatible Changes Deprecations New Features Improvements Bug fixes...

dependabot[bot]

dependencies

MelNet
MelNet copied to clipboard

Metadata

config parameter

RuntimeError: Error(s) in loading state_dict for DataParallel

Training Pipeline + Steps for training TTS

Can you please provide model weights for inference?

where is the Multiscale Modelling？

inference. py: error: argument -t/--timestep: invalid int value

MelNet cannot work as an independent vocoder currently, right?

Yhlee

initial

Bump torch from 1.2.0 to 2.2.0

← Metadata

Owner

Metadata

MelNet MelNet copied to clipboard

Metadata

← Metadata

Owner

Metadata

MelNet
MelNet copied to clipboard