jacob.tang comments

Results 5 comments of


                                            jacob.tang

input pcm buffer_size issue

Thanks! Can feed_audio with 16k stereo data?the raw data is 48k,stereo data,when i call decode_and_resample,the audio data is 16k stereo. In my realtime server,I can not get recorder.text() in a...

Use coqui engine play_async Invalid output device error

Thanks a lot. I try to modify RealtimeTTS/stream_player.py, can avoid this problem. #self.stream = self.pyaudio_instance.open(format=pyFormat, channels=pyChannels, rate=pySampleRate, output=True) Another question,use the coqui engine,the callback data size is 512,but openai engine...

Use coqui engine play_async Invalid output device error

I haven't tested the Coqui engine before. Does it need to run on a GPU environment to ensure the real-time performance of speech data? Currently, using callback chunk data as...

The accuracy issue of real-time Speech-to-Text (STT) transcription

Thanks! Should I use the 0.6.0 version of the faster-whisper instead of the latetest [v1.0.1]?(https://github.com/SYSTRAN/faster-whisper/releases/tag/v1.0.1) Or just update the latest faster-whisper / RealtimeSTT version?

The accuracy issue of real-time Speech-to-Text (STT) transcription

great! Another question is the latest v0.1.15 of RealtimeSTT has the parameter beam_size, it can be use to reduce the delay?