gandolfxu
gandolfxu
Is the following example right? ``` import librosa import numpy as np import torch import torchaudio from torch.utils.data import Dataset from tqdm import tqdm from loguru import logger from audio_diffusion_pytorch...
Can you share the source code of how to preprocess the data?
I meets the same problem. Have you resolved it?
I run successfully following this instruction. Thanks!
@trestad Another mistake of traing. fairseq-train data/lmd_processed/ \ --arch transformer_lm --task language_modeling \ --decoder-attention-heads 4 --decoder-embed-dim 256 \ --decoder-input-dim 256 --decoder-output-dim 256 \ --decoder-layers 4 --update-freq 1 --optimizer adam \...
train: 21332/108590 (19.64%) midi files successfully processed valid: 93/1109 (8.39%) midi files successfully processed test: 74/1114 (6.64%) midi files successfully processed I find that the number of midi data for...
@Stardust-minus 预计什么时候出正式版?
> 我训得也不快啊,如果调参的话,可以加载没调的部分网络参数来加速训练 大概3000句,3个小时,使用两款V100训练,使用率大概30%。前面几次Epoch的log如下: 2023-06-19 11:38:06,489 oprah_ms_istft_vits INFO ====> Epoch: 1 2023-06-19 11:39:16,343 oprah_ms_istft_vits INFO ====> Epoch: 2 2023-06-19 11:40:11,469 oprah_ms_istft_vits INFO ====> Epoch: 3 2023-06-19 11:41:02,040 oprah_ms_istft_vits INFO ====> Epoch:...
@deepanwayx No caption is available for our music data. How should I prepare the dataset?