Babylon Tower is a comprehensive solution for speech recognition and synthesis, offering various features such as audio file translation, speech recognition via microphone, and even real-time translation during audio calls.
- Real-time Audio Call Translation: Make audio calls with on-the-fly speech translation.
- Speech Recognition via Microphone: Recognize speech directly through your microphone.
- Text Translation and Voice Synthesization: Quick text translation.
- Audio File Translation: Upload an audio file and receive an audio file with the translation.
- Python 3.8 or higher
- Node.js 20 or higher
- npm 10 or higher
- Navigate to the backend directory:
cd backend
- a.
Ubuntu
:
sudo apt-get install portaudio19-dev python3-pyaudio
- Install the required Python packages:
pip install -r requirements.txt
- Start the Flask server:
python app.py
- Navigate to the frontend directory:
cd frontend
- Install the required npm packages:
npm install
- Start the Next.js development server:
npm run dev
The backend will be running on http://127.0.0.1:5000 The frontend will be running on http://127.0.0.1:3000
This project is licensed under the MIT License.
If you have any questions or need further assistance, please contact [artuar1990@gmail.com].