goose icon indicating copy to clipboard operation
goose copied to clipboard

Add speech input support

Open NullLabTests opened this issue 11 months ago • 1 comments

This PR adds speech input support to the Goose CLI using Vosk for offline speech recognition.

Features:

  • Toggle voice input mode with /voice command
  • Start/stop recording with Ctrl+V
  • Uses Vosk for offline speech recognition
  • Cross-platform audio capture with cpal
  • Seamless integration with existing input system

Testing:

  • Tested on Linux
  • Voice commands are processed as regular text input
  • Works with all existing commands

Dependencies added:

  • cpal = "0.15.2"
  • whisper-rs = "0.8.0"
  • vosk = "0.2.0"

Note: This feature was developed and tested on Linux. Additional platform-specific installation instructions for speech recognition dependencies may be needed.

NullLabTests avatar Mar 02 '25 00:03 NullLabTests

cool! would it be possible to have this on desktop as well? (in some ways perhaps more common to be used there?)

michaelneale avatar Mar 03 '25 01:03 michaelneale

pst https://github.com/Kvadratni/speech-mcp

Kvadratni avatar Mar 05 '25 23:03 Kvadratni

Nice , converted to draft as it may be redundant?

NullLabTests avatar Mar 06 '25 15:03 NullLabTests

Wanted to follow up on this! Since this is a bit out of date and we have a few implementations floating around, i'm going to try to consolidate on an issue first and then we can bring things back to PR?

#2952

baxen avatar Jun 16 '25 20:06 baxen