claudecodeui icon indicating copy to clipboard operation
claudecodeui copied to clipboard

Add dependencies and read me information for whisper transcription feature, need to add openai key

Open shanelindsay opened this issue 7 months ago • 1 comments

Microphone Voice Dictation Not Working - Issue & Resolution Guide

The Problem Microphone voice dictation feature appeared to work (recording started/stopped) but failed to produce transcriptions, with server errors about missing modules.

Root Cause The issue was missing server-side dependencies for the transcription functionality. The client-side audio recording code was working correctly, but the server couldn't process the audio files. Investigation Process Frontend was fine - Audio recording, WebRTC, and file upload worked properly Server endpoint existed - /api/transcribe endpoint was implemented Dependencies were missing - Required packages not installed despite being used in code Missing Components Found multer

  • For handling multipart file uploads from the frontend form-data
  • For formatting audio data to send to OpenAI's Whisper API OPENAI_API_KEY
  • Environment variable for API authentication Resolution Steps

1. Install missing dependencies

npm install multer form-data

2. Add OpenAI API key to .env file

echo "OPENAI_API_KEY=your_openai_api_key_here" >> .env

Suggestion to mention the need to put OPENAI key in the readme file as well as fixing dependencies

shanelindsay avatar Jul 05 '25 07:07 shanelindsay

Also it should advise in README for transcription that only works on https for remote clients

shanelindsay avatar Jul 06 '25 09:07 shanelindsay