Ning
Ning
Will land a fix soon. You can comment out latent_var related imports in train.py now as a workaround.
Sorry README is out-of-date. We already have BeamSearch class fully scripted in ensemble_export.py. Also Pytorch->ONNX->Caffe2 export path as mentioned in README is not supported now. We are switching to TorchScript...
Hey @barinov274 - It's not trivial to get ASR timestamps for our model unfortunately. Since it shares with translation tasks, decoding process is not "monotonic" like other ASR approaches (e.g....
Hey @Azam2107 - We suggest cutting the audio to
For English ASR that could be a general quality issue we are investigating.
Could you compare your implementation with HF demo around https://huggingface.co/spaces/facebook/seamless_m4t/blob/main/app.py#L78? Note that resampling might be needed.
Thanks @ggerganov for detailed comments! Regarding 1 and 2, makes sense, I will keep unity.cpp under seamless_communication for now. So keeping KNF library there would be easier without bloating ggml...