You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
it thinks I'm saying "thank you" apparently when I talk it transcribes what I say but when I don't talk it transcribes endless "thank you"
import Speech_rec as sr
from RealtimeSTT import AudioToTextRecorder # has to be implemented in the main script
if __name__ == '__main__':
record = AudioToTextRecorder()
wakeup = AudioToTextRecorder(wake_words="jarvis")
listener = sr.listener(record, wakeup)
listener.run()
Use only one recorder. Creating two recorders is overkill because you load the transciption models 2x into your VRAM.
Leave out self.recorder.start(). Just use recorder.text(), it will detect when you start speaking. You get "thank you" etc. because recorder.start() initiates recording immediately, then everything you say and also what you NOT say gets transmitted to whisper. And for the parts where you say nothing whisper tends to hallucinate ("thank you" is a common whisper hallucination).
So you should call only the recorder.text() method, then voice activity will detect when to start recording and it will hallucinate way less. If you really want to call recorder.start() you want to make sure you start talking immediately after that.
it thinks I'm saying "thank you" apparently when I talk it transcribes what I say but when I don't talk it transcribes endless "thank you"
this is my whole code (oh it also wakes up on it's own)
The text was updated successfully, but these errors were encountered: