audio-diffusion
audio-diffusion copied to clipboard

teticio

→

Metadata

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Reame
Issues

Results 3 audio-diffusion issues

Sort by recently updated

Autoregressive models for Audio generation

Do you have any future plans for training flow based diffusion transformer models like [Make an Audio 3](https://github.com/Text-to-Audio/Make-An-Audio-3) or simple Vall E typle models ?

darshanmakwana412

expand the dataset

I would like to use this project to generate audio data similar to the training sample, so as to expand the data set. The data is divided into 5s segments....

hahahahaha20250325

I am trying to train an autoencoder for Latent diffusion. I run: ` python scripts/train_vae.py --dataset_name data/physio22/mel_res_64 --batch_size 1 --gradient_accumulation_steps 1 --hop_length 1024 --max_epochs 5` And get the following error...

Adrian-Makcimus

About

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

huggingface

music-generation

diffusion-models

latent-diffusion

662

Stars

69

Forks

Watchers

Owner

teticio

← Metadata

662

Stars

69

Forks

Watchers

Owner

teticio

Metadata

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Back

audio-diffusion audio-diffusion copied to clipboard

Metadata

Autoregressive models for Audio generation

expand the dataset

Issues Training VAE

← Metadata

Owner

Metadata

audio-diffusion
audio-diffusion copied to clipboard