asusdisciple comments

Results 12 comments of


                                            asusdisciple

Whisper: Decode with condition_on_previous_text=False

I stumbled upon this thread after benchmarking insanely-fast-whisper, seamless-m4t-v2, faster-whisper and the hugginface implementation of whisper based on transformers pipeline with bettertransformers. I found a bug related to the this...

Due to recent development, maybe release training components/scripts?

Yeah I found a high level explanation, but to my knowledge it is still necessary to train all the models because the codebook apparently does not exist anymore. I do...

Due to recent development, maybe release training components/scripts?

Ah I see I found the repo I think. I have access to a few a100 and will try to train and/or fine tune the tortoise model on my native...

[Bug] RuntimeError: No backend type associated with device type cpu

Got the same error, when trying to compute a confusion matrix on a `callback` when I call `metric.plot()`: ``` def on_test_epoch_end(self) -> None: metric = MulticlassConfusionMatrix(num_classes=self.num_classes).to("cpu") outputs = torch.cat(self.x_test, dim=0).to("cpu")...

M4T-v2 Language Identification for Audio / Spoken Language

Oh I see thanks for the clarification. However is there any way to set the target language to the source language for ASR? For example when I do not want...

M4T-v2 Language Identification for Audio / Spoken Language

Also if I may add: Unfortunately none of the MMS versions includes all of the languages in m4t-v2. These languages are not supported by MMS, which makes it kinda hard...

Source language detection

Would also be interested.

When use speech to text inference, how to keep the src_lang same as tgt_lang

I would like to know this as well. How can we set the target language to source language in M4Tv2? For Audio you often dont know the language. Is there...

Inconsistent results with same sentences

Thanks for your fast answer! This makes sense in a way. If character length influences the result my question would be, how does the model behave if the chunk is...

Faster Whisper sometimes stops running suddenly

Not really, I use Ubuntu with an A100 and 80GB of RAM