jacob.tang
jacob.tang
Thanks! Can feed_audio with 16k stereo data?the raw data is 48k,stereo data,when i call decode_and_resample,the audio data is 16k stereo. In my realtime server,I can not get recorder.text() in a...
Thanks a lot. I try to modify RealtimeTTS/stream_player.py, can avoid this problem. #self.stream = self.pyaudio_instance.open(format=pyFormat, channels=pyChannels, rate=pySampleRate, output=True) Another question,use the coqui engine,the callback data size is 512,but openai engine...
I haven't tested the Coqui engine before. Does it need to run on a GPU environment to ensure the real-time performance of speech data? Currently, using callback chunk data as...
Thanks! Should I use the 0.6.0 version of the faster-whisper instead of the latetest [v1.0.1]?(https://github.com/SYSTRAN/faster-whisper/releases/tag/v1.0.1) Or just update the latest faster-whisper / RealtimeSTT version?
great! Another question is the latest v0.1.15 of RealtimeSTT has the parameter beam_size, it can be use to reduce the delay?