Web application for recording audio, transcribing speech (STT) and translating to Spanish.
- 🎤 Audio recording from microphone
- 🗣️ Speech-to-text using OpenAI Whisper
- 🌍 Translation to Spanish using GPT
- ⚡ Real-time processing
- 🎨 Modern React interface
- Frontend: React + TypeScript + Vite
- Backend: Node.js + Express + TypeScript
- AI: OpenAI API (Whisper + GPT)
- Node.js 18+
- OpenAI API key
git clone <repository-url>
cd llm-stt
cd backend
npm install
cp .env.example .env
# Edit .env and add your OPENAI_API_KEY
cd ../frontend
npm install
cd backend
npm run dev
# Server runs on http://localhost:5000
cd frontend
npm run dev
# App opens on http://localhost:5173
- Open http://localhost:5173 in browser
- Click "Start Recording"
- Speak in Russian
- Click "Stop Recording"
- Wait for transcription and translation
- View results: original text and Spanish translation
POST /api/transcribe
- Process audio file- Input: audio file (multipart/form-data)
- Output:
{ originalText, translatedText }
GET /health
- Health check
llm-stt/
├── frontend/ # React application
│ ├── src/
│ │ ├── components/
│ │ │ ├── AudioRecorder.tsx
│ │ │ └── TranslationDisplay.tsx
│ │ └── App.tsx
├── backend/ # Node.js API
│ ├── src/
│ │ └── index.ts
│ └── .env.example
└── README.md
Backend (.env):
OPENAI_API_KEY
- Your OpenAI API keyPORT
- Server port (default: 5000)
MIT