Shayne Mei comments

Results 37 comments of


                                            Shayne Mei

Support for VAD / endpointing

Counting silence frames sounds like a good idea. Maybe we can make it more robust by adding a state machine with two states (one for silence, one for non-silence). Where...

Support for VAD / endpointing

Thanks for the suggestion. I have two other questions regarding using the endpoint information: 1. What is the plan for sending endpoint to the client? e.g. how will it be...

Return alignment info

this seems to be obtaining the alignment in batches. Is it possible to obtain this alignment info in real-time while streaming?

Return alignment info

@csukuangfj Just following up on this question: is it possible to obtain this alignment info in real-time, i.e. get alignment for each decoded segment while streaming?

Return alignment info

Preferably in seconds. Either relative to the start of the segment or utterance should be fine

Performance gap between icefall local streaming decoding and sherpa streaming decoding

local batch decoding command: ```bash ./pruned_transducer_stateless5/decode.py \ --epoch 4 \ --avg 1 \ --simulate-streaming False \ --causal-convolution True \ --use-averaged-model False ``` local streaming decoding command: ```bash ./pruned_transducer_stateless5/decode.py \ --epoch...