Voice Chat Demo - React AI Chatbot

A modern React chatbot application featuring AI-powered conversations with Web Speech-to-Text (STT) and Text-to-Speech (TTS) capabilities, built with Vite and styled with Tailwind CSS.

🚀 Features

Voice Recognition: Web Speech API integration for speech-to-text conversion
Text-to-Speech: Natural voice synthesis for AI responses
AI Integration: Support for multiple AI providers (OpenAI, Anthropic, Groq, Grok)
Visual AI: Image analysis and visual understanding capabilities
Modern UI: Responsive design with Tailwind CSS
Real-time Chat: Seamless conversation flow with message history
Thinking Mode: Advanced reasoning display for complex queries

🛠️ Tech Stack

Frontend: React 18 + Vite
Styling: Tailwind CSS
Speech APIs: Web Speech API (STT/TTS)
AI Services: OpenAI, Anthropic, Groq, Grok APIs
Build Tool: Vite with Hot Module Replacement (HMR)

📋 Prerequisites

Node.js (version 16 or higher)
npm or yarn package manager
API keys for your preferred AI service(s)

⚡ Quick Start

1. Clone the repository

git clone https://github.com/onigetoc/voice-chat-react.git
cd voice-chat-demo

2. Install dependencies

npm install

3. Environment Configuration

Important: Rename the .env.example file to .env:

Then edit the .env file and add your API keys:

VITE_OPENAI_API_KEY=your_openai_api_key_here
VITE_ANTHROPIC_API_KEY=your_anthropic_api_key_here
VITE_GROQ_API_KEY=your_groq_api_key_here
VITE_GROK_API_KEY=your_grok_api_key_here

Note:

You only need to configure the API keys for the services you plan to use.
Google Chrome offers the most natural and expressive voices available through the Web Speech API.

4. Start the development server

npm run dev

The application will be available at http://localhost:5173

🔧 Available Scripts

npm run dev - Start development server with HMR
npm run build - Build for production
npm run preview - Preview production build locally
npm run lint - Run ESLint for code quality

🎤 Voice Features

Speech-to-Text (STT)

Click the microphone button to start voice recognition
Speak clearly for best recognition accuracy
The system automatically converts speech to text

Text-to-Speech (TTS)

AI responses are automatically spoken aloud
Toggle voice output on/off as needed
Supports multiple voice options

🔧 Troubleshooting - Overlapping Voice Issues

Symptoms

Speech synthesis restarts while already speaking
Text rewrites in loops
Conflicts between voice recognition and speech synthesis

Technical Solutions

1. Speech Synthesis State Management

// Stop all ongoing speech before starting new synthesis
const stopAllSpeech = () => {
  window.speechSynthesis.cancel();
};

// Check if synthesis is currently active
const isSpeaking = () => {
  return window.speechSynthesis.speaking;
};

2. Voice Recognition Management

// Pause listening during speech synthesis
const pauseListeningDuringSpeech = () => {
  if (recognition && recognition.continuous) {
    recognition.stop();
  }
};

// Resume listening after synthesis ends
speechUtterance.onend = () => {
  if (recognition && isListening) {
    recognition.start();
  }
};

3. Mutex Pattern (Lock State)

const [isProcessing, setIsProcessing] = useState(false);

const handleSpeechToText = async () => {
  if (isProcessing) return; // Prevent multiple calls
  setIsProcessing(true);
  
  // ... processing logic ...
  
  setIsProcessing(false);
};

4. Debouncing to Prevent Multiple Triggers

import { useCallback } from 'react';
import { debounce } from 'lodash';

const debouncedProcessSpeech = useCallback(
  debounce((text) => {
    // Text processing logic
  }, 300),
  []
);

Recommended Checks

Verify speechSynthesis.cancel() is called before each new synthesis
Ensure voice recognition stops during speech synthesis
Implement state system to prevent concurrent calls
Add logging to debug event sequences

🔌 API Configuration

The application supports multiple AI providers. Configure your preferred service(s) in the .env file:

OpenAI: GPT models for general conversation
Anthropic: Claude models for advanced reasoning
Groq: Fast inference for real-time responses
Grok: Alternative AI provider option

📱 Browser Compatibility

Chrome: Full support for all features
Firefox: STT/TTS support may vary
Safari: Limited Web Speech API support
Edge: Good compatibility with modern versions

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/new-feature)
Commit your changes (git commit -am 'Add new feature')
Push to the branch (git push origin feature/new-feature)
Create a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🆘 Support

If you encounter any issues or have questions:

Check the troubleshooting section above
Review browser console for error messages
Ensure API keys are correctly configured
Verify microphone permissions are granted

🚧 Development Notes

Built with React 18 and modern JavaScript features
Uses Vite for fast development and building
Implements best practices for voice application development
Modular component architecture for easy maintenance

Happy coding! 🎉

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github		.github
public		public
src		src
.clinerules		.clinerules
.env.example		.env.example
.gitignore		.gitignore
.roorules		.roorules
CORRECTIONS_HISTORIQUE_IMPLEMENTEES.md		CORRECTIONS_HISTORIQUE_IMPLEMENTEES.md
CORRECTIONS_IMPLEMENTEES.md		CORRECTIONS_IMPLEMENTEES.md
FONCTIONNALITE_THINKING.md		FONCTIONNALITE_THINKING.md
PLAN_CORRECTION_HISTORIQUE.md		PLAN_CORRECTION_HISTORIQUE.md
PROBLEME_HISTORIQUE_CONVERSATION.md		PROBLEME_HISTORIQUE_CONVERSATION.md
README.md		README.md
TEST_THINKING_EXAMPLE.md		TEST_THINKING_EXAMPLE.md
app-FROM-JS-VERSION.js		app-FROM-JS-VERSION.js
eslint.config.js		eslint.config.js
index.html		index.html
indexDEMO.html		indexDEMO.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice Chat Demo - React AI Chatbot

🚀 Features

🛠️ Tech Stack

📋 Prerequisites

⚡ Quick Start

1. Clone the repository

2. Install dependencies

3. Environment Configuration

4. Start the development server

🔧 Available Scripts

🎤 Voice Features

Speech-to-Text (STT)

Text-to-Speech (TTS)

🔧 Troubleshooting - Overlapping Voice Issues

Symptoms

Technical Solutions

1. Speech Synthesis State Management

2. Voice Recognition Management

3. Mutex Pattern (Lock State)

4. Debouncing to Prevent Multiple Triggers

Recommended Checks

🔌 API Configuration

📱 Browser Compatibility

🤝 Contributing

📄 License

🆘 Support

🚧 Development Notes

About

Uh oh!

Releases

Packages

Languages

onigetoc/voice-chat-react

Folders and files

Latest commit

History

Repository files navigation

Voice Chat Demo - React AI Chatbot

🚀 Features

🛠️ Tech Stack

📋 Prerequisites

⚡ Quick Start

1. Clone the repository

2. Install dependencies

3. Environment Configuration

4. Start the development server

🔧 Available Scripts

🎤 Voice Features

Speech-to-Text (STT)

Text-to-Speech (TTS)

🔧 Troubleshooting - Overlapping Voice Issues

Symptoms

Technical Solutions

1. Speech Synthesis State Management

2. Voice Recognition Management

3. Mutex Pattern (Lock State)

4. Debouncing to Prevent Multiple Triggers

Recommended Checks

🔌 API Configuration

📱 Browser Compatibility

🤝 Contributing

📄 License

🆘 Support

🚧 Development Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages