MelNet icon indicating copy to clipboard operation
MelNet copied to clipboard

config parameter

Open vinson2233 opened this issue 4 years ago • 0 comments

Hi, I wanted to train the MelNet with my own dataset.
There are some audio setting that I still not understand since I'm very new to this signal processing/speech field. Can someone elaborate me or give me reference for me to understand what are the meaning of these setting :

audio:
  sr: 16000
  duration: 6.0
  n_mels: 180
  hop_length: 180
  win_length: 1080
  n_fft: 1080
  num_freq: 541
  ref_level_db: 20.0
  min_level_db: -80.0

Thanks in advance

vinson2233 avatar Oct 27 '21 04:10 vinson2233