Zengwei Yao

[email protected]

Xiaomi Corporation Beijing

Results 25 comments of


                                            Zengwei Yao

Added zipformer ctc streaming support for librispeech at egs/librispeech/ASR/zipformer_ctc_streaming

Thanks for your contribution! I have some questions. It looks like the streaming zipformer code `zipformer.py` is copied from pruned7_steaming recipe, which should be a soft link instead. I am...

Streaming Zipformer2. extra words at the end.

Which decoding script did you use, `decode.py` or `streaming_decode.py`? I think the issue might be caused by the tail padding. You could try to reduce the tail padding length: https://github.com/k2-fsa/icefall/blob/b87ed26c09e9f5bb29174dd01f13670fb6124583/egs/librispeech/ASR/zipformer/decode.py#L439...

Streaming Zipformer2. extra words at the end.

You could also try setting `length_norm` to `False` for `modified_beam_search` when using `streaming_decode.py`, (just for debugging) https://github.com/k2-fsa/icefall/blob/b87ed26c09e9f5bb29174dd01f13670fb6124583/egs/librispeech/ASR/zipformer/decode_stream.py#L144

Fast Text-Only Domain Adaptation of RNN-Transducer Prediction Network

We did try using context-size of 3 or 4 (at least in Zipformer), but could not get improvements.

Question about replicating normalization: decode.py vs streaming_decode.py (streaming zipformer)

@AdolfVonKleist Did you get difference between `decode.py` and `streaming_decode.py` for a same audio?

Question about replicating normalization: decode.py vs streaming_decode.py (streaming zipformer)

I suggest to use the lastest recipe `zipformer` instead. In the old recipe `pruned_transducer_stateless7_streaming`, there might be some issues when doing the chunk-wise forward for the first chunks, since we...

Question about replicating normalization: decode.py vs streaming_decode.py (streaming zipformer)

> > > > I suggest to use the lastest recipe `zipformer` instead. In the old recipe `pruned_transducer_stateless7_streaming`, there might be some issues when doing the chunk-wise forward for the...

FYI: Zipformer paper is online available

> A minor comment: In Figure 3, the color of the dashed lines representing the E-branchformer and conformer is not easily distinguishable for people with color blindness. Thanks for your...

Update DoubleSwish

> could you also update https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/RESULTS-100hours.md OK. I will do that.

[WIP] RNN-T + MBR training.

Sure. I will have a look.

1
2
3
›