Jiayu DU comments

Results 21 comments of


                                            Jiayu DU

[Chinese text normalization]PR for chinese TN part in text_normalization

@BuyuanCui @ekmb This PR continues https://github.com/NVIDIA/NeMo/issues/4543 & https://github.com/NVIDIA/NeMo/pull/4638 , ready for another round of reviews.

[Chinese text normalization]PR for chinese TN part in text_normalization

@mzxcpp please give clear signals here once you have addressed all issues from last review, so that we know when to move forward, and everyone won't get distracted by intermediate...

can't process the date-time

Thanks for the report. Current code doesn't cover these cases, no plan to add rules to handle them in near future. Feel free to open a PR to fix it.

Module not found 'sklearn.semi_supervised.label_propagation'

same here

How can i get the keep the number of 1-ngrams consistent with number of words contained in the vocab?

@sf9218 I guess you want to specify kenlm's vocabulary exactly as your vocabulary even though some of the words are not presented in your training text, this is a common...

Construction of LM model on the fly.

@JRMeyer This is a typical speech recognition feature. If I understand you correctly, basically you want to up-weight or down-weight a list of "phrases", which may be a brand name,...

Clean the original dataset that collected from different resources YouTube , Podcast, and Audiobook.

The pipeline was developed based on existing Kaldi scripts as you mentioned above, but with a lot of bug fixes and ad-hoc modifications. However we have no near plan to...

Can you provide "text_raw" information?

The dataset generation pipeline contains some steps that are not 100% reversible, so currently I'm afraid the answer is no.

Adding scoring scripts.

As Kaldi recipe develpment is converging, it's time to think about how we organize this text normalization as a post processing before WER calculation. The processing is pretty simple, containing:...

I just added a simple scoring tool via https://github.com/SpeechColab/GigaSpeech/pull/35 , it uses sclite to evaluate REF and HYP. Before evaluation, the tool applies very simple text processing that we discussed...