wav2vec2_stt_python icon indicating copy to clipboard operation
wav2vec2_stt_python copied to clipboard

provide API for returning output from intermediate layers

Open zhouyong64 opened this issue 4 years ago • 1 comments

It would be very helpful to have an API for returning output from intermediate layers, for example, the one before the final layers. This output can be used in other speech tasks other than speech recognition.

zhouyong64 avatar Aug 16 '21 03:08 zhouyong64

Interesting idea. This would be a bit complicated, because I think we would need to build the torchscript model differently to allow for this, in addition to modifications to the library code. My plan was to keep this library fairly simple, but I am not entirely opposed to adding this. I may not get around to adding it myself, though. Leaving open.

daanzu avatar Aug 16 '21 13:08 daanzu