audacity icon indicating copy to clipboard operation
audacity copied to clipboard

Formant preservation in realtime pitch shifting

Open saintmatthieu opened this issue 2 years ago • 0 comments

Your idea

Improve quality of non-destructive pitch shifting with formant preservation. This requires algorithmic modifications in the time-and-pitch library.

Problem to be solved

This is particularly desired for the pitch shifting of voice, which otherwise gets a chipmunk-like quality already at moderate ratios.

Spectrum of an "O": image

Low-frequency-resolution spectrum of that same "O": image

Low-frequency-resolution spectrum of that "O" after pitch-shifted one octave up: image

Frequency resolution is on purpose set low so that harmonics do not stand out individually. Instead, what we see are "spectral envelopes".

Observe the resonances (areas of strong energy) in the original spectrum, e.g. that around 3.6kHz. After pitch shifting, these resonances are also shifted up, and now it is around 7kHz. Formant preservation aims at keeping these resonances intact, meanwhile still moving the harmonic series alongside the frequency axis.

Note that this could pave the way to offering a more flexible effect, where pitch and timbre can be shifted independently, as does Elastique Pitch: image

Prior art

No response

Additional context

No response

saintmatthieu avatar Jan 31 '24 09:01 saintmatthieu