Ning

Results 7 comments of Ning

Will land a fix soon. You can comment out latent_var related imports in train.py now as a workaround.

Sorry README is out-of-date. We already have BeamSearch class fully scripted in ensemble_export.py. Also Pytorch->ONNX->Caffe2 export path as mentioned in README is not supported now. We are switching to TorchScript...

Hey @barinov274 - It's not trivial to get ASR timestamps for our model unfortunately. Since it shares with translation tasks, decoding process is not "monotonic" like other ASR approaches (e.g....

Hey @Azam2107 - We suggest cutting the audio to

For English ASR that could be a general quality issue we are investigating.

Could you compare your implementation with HF demo around https://huggingface.co/spaces/facebook/seamless_m4t/blob/main/app.py#L78? Note that resampling might be needed.

Thanks @ggerganov for detailed comments! Regarding 1 and 2, makes sense, I will keep unity.cpp under seamless_communication for now. So keeping KNF library there would be easier without bloating ggml...