DSAlign
DSAlign copied to clipboard
DeepSpeech based forced alignment tool
According to #6, word-level alignment (instead of utterance-level alignment) should be possible, but is not yet implemented. Are there any plans for phoneme-level alignment, so that DSAlign could be used...
On a linux system, the `bin/lm-dependencies.sh` script fails. To reproduce the issue only four commands are needed: >> git clone https://github.com/mozilla/DSAlign.git >> cd DSAlign/ >> bin/createenv.sh >> bin/lm-dependencies.sh lm-dependencies.sh compiles...
Quick and dirty #36 fix, but it smells a bit. The code currently is in an inconsistent state and seems to need refactoring, ideally into classes. This only makes it...
Could not generate example data from bin/gettestdata.sh. Hope to know whether that's my network problem ?
Thanks for supporting such an excellent tool to align sentence and audio! I have a question that is there is a way to speed the alignment process? Now i have...
Ran a small example it doesn't produce the align.json file. ```multiprocessing.pool.RemoteTraceback: """ Traceback (most recent call last): File "/usr/local/Cellar/[email protected]/3.8.5/Frameworks/Python.framework/Versions/3.8/lib/python3.8/multiprocessing/pool.py", line 125, in worker result = (True, func(*args, **kwds)) File "/Users/raisintoastllc/MachineLearning/Projects/SpeechSynthesis/DataProcessing/DSAlign/align/align.py",...
I've seen this in DS 0.6.x and the current DS 0.7.x fork I'm working on but I cannot figure out why it's happening. The first aligned audio segment contains "av...
Hello @tilmankamp, I'm using the transcribe.py with the catalog filetype tool to align my audios, but it is generating duplicated transcriptions. **Audio type** Very long audios with about 1h of...
Not only is this slow, this will break on GPUs because TensorFlow does not release its allocated memory. So if you try to run a catalog with multiple files on...