DSAlign issues

Phoneme-level alignment

According to #6, word-level alignment (instead of utterance-level alignment) should be possible, but is not yet implemented. Are there any plans for phoneme-level alignment, so that DSAlign could be used...

DanielSWolf

TaskCluster Download Issue

On a linux system, the `bin/lm-dependencies.sh` script fails. To reproduce the issue only four commands are needed: >> git clone https://github.com/mozilla/DSAlign.git >> cd DSAlign/ >> bin/createenv.sh >> bin/lm-dependencies.sh lm-dependencies.sh compiles...

kaleko

Quick and dirty #36 fix

Quick and dirty #36 fix, but it smells a bit. The code currently is in an inconsistent state and seems to need refactoring, ideally into classes. This only makes it...

comodoro

Could not generate example data from bin/gettestdata.sh

Could not generate example data from bin/gettestdata.sh. Hope to know whether that's my network problem ?

OswaldoBornemann

Is there any way to speed the alignment process?

Thanks for supporting such an excellent tool to align sentence and audio! I have a question that is there is a way to speed the alignment process? Now i have...

SenYan1999

Alphabet is not defined despite being loaded.

1

Ran a small example it doesn't produce the align.json file. ```multiprocessing.pool.RemoteTraceback: """ Traceback (most recent call last): File "/usr/local/Cellar/[email protected]/3.8.5/Frameworks/Python.framework/Versions/3.8/lib/python3.8/multiprocessing/pool.py", line 125, in worker result = (True, func(*args, **kwds)) File "/Users/raisintoastllc/MachineLearning/Projects/SpeechSynthesis/DataProcessing/DSAlign/align/align.py",...

ArEnSc

Part of aligned text gets shifted to the next segment

1

I've seen this in DS 0.6.x and the current DS 0.7.x fork I'm working on but I cannot figure out why it's happening. The first aligned audio segment contains "av...

BoneGoat

Duplicating audio/text pairs

1

Hello @tilmankamp, I'm using the transcribe.py with the catalog filetype tool to align my audios, but it is generating duplicated transcriptions. **Audio type** Very long audios with about 1h of...

carlfm01

Do not load model if it's already loaded

1

Not only is this slow, this will break on GPUs because TensorFlow does not release its allocated memory. So if you try to run a catalog with multiple files on...

gibiansky

DSAlign
DSAlign copied to clipboard

Metadata

Phoneme-level alignment

TaskCluster Download Issue

Quick and dirty #36 fix

Could not generate example data from bin/gettestdata.sh

Is there any way to speed the alignment process?

Alphabet is not defined despite being loaded.

Part of aligned text gets shifted to the next segment

Duplicating audio/text pairs

Do not load model if it's already loaded

← Metadata

Owner

Metadata

DSAlign DSAlign copied to clipboard

Metadata

← Metadata

Owner

Metadata

DSAlign
DSAlign copied to clipboard