check-777
check-777
好的,谢谢
> 感谢指正,之前传上来的代码版本有问题,现在修正过来了 spk_pred = self.timbre_predictor(timbre)[0] 这个地方应该去掉[0],要不和标签的维度对不上
还有一个地方有些疑问,在meldatasets处理数据的时候, `to_mel = torchaudio.transforms.MelSpectrogram( n_mels=MEL_PARAMS['n_mels'], **SPECT_PARAMS) mean, std = -4, 4 def preprocess(wave): # wave = wave.unsqueeze(0) wave_tensor = torch.from_numpy(wave).float() mel_tensor = to_mel(wave_tensor) mel_tensor = (torch.log(1e-5 + mel_tensor.unsqueeze(0)) - mean)...