flashlight icon indicating copy to clipboard operation
flashlight copied to clipboard

Performance Enhancement Opportunity

Open mtmd opened this issue 4 years ago • 2 comments

Feature Description

Flashlight ASR decoder, fl_asr_decode can benefit from reduced precision computing and batch processing. In addition, the positional embedding function appears to be a bottleneck for both training and decoding.

Use Case

Most models should benefit from some or all of the aforementioned performance optimization opportunities. The Transformer model is an example that can benefit from all.

mtmd avatar Aug 04 '21 03:08 mtmd

I have been running this for a week, seems to work fine for me. Transformer decoding speed ~x4, training speed on small 27m conformer about 5% faster.

joazoa avatar Sep 14 '21 07:09 joazoa

In the mean time i found an issue, LER comparison will include UNK tokens that were added for padding which will lead to unexpected results where the WER is 0 but the LER is not.

joazoa avatar Sep 16 '21 14:09 joazoa