Teacher.ai

This project provides functionality to record audio until silence is detected, process the audio using a speech-to-text API, and generate a response with synthesized speech. The application is designed to run on a Jetson device with ALSA and Flask setup.

Prerequisites

Before you start, ensure the following dependencies are installed on your system:

Operating System: Linux-based system (e.g., Ubuntu) on Jetson devices.
Python: Python 3.8 or higher installed.
Packages:
- Flask
- Flask-SocketIO
- pyaudio
- requests
- pydub
ALSA Utilities:
- alsa-utils
- arecord

Install ALSA utilities using:

sudo apt update
sudo apt install alsa-utils

Microphone and Speaker: Ensure your microphone and speaker are properly connected and recognized by the system.

Installation

Clone the repository:

git clone https://github.com/dextboy/treehacks.git
cd treehacks/backend

Set up a Python virtual environment (recommended):

python3 -m venv venv
source venv/bin/activate

Install Python dependencies:

pip install -r requirements.txt

Install additional required libraries:
- pyaudio may require the portaudio library. Install it using:
```
sudo apt install libportaudio2 libportaudiocpp0 portaudio19-dev
```
Configure ALSA to use the correct audio device:
- Edit the ~/.asoundrc file:
```
pcm.!default {
    type hw
    card 2
    device 0
}

ctl.!default {
    type hw
    card 2
}
```
  Replace card and device with the appropriate values for your system. Use arecord -l to identify them.

Configuration

Set Up Flask Application:
- Run the application by navigating to the project directory and starting the Flask server:
```
python app.py
```
ALSA Setup:
- Restart ALSA or reload its configuration:
```
sudo alsactl init
```

Usage

Start Recording Until Silence: Run the application, and it will start recording audio until silence is detected. The recorded file will be saved as output.wav.
Process Recorded Audio: The output.wav file will automatically be sent to the /process/speech API endpoint for transcription and speech synthesis.
Play Synthesized Speech: The synthesized audio file will be played on the device.

Troubleshooting

ALSA Errors:
- If you encounter errors like Unknown PCM or Cannot open device, verify your .asoundrc configuration and check connected audio devices using:
```
aplay -l
arecord -l
```
PyAudio Installation:
- If pyaudio fails to install, ensure the portaudio library is installed:
```
sudo apt install libportaudio2 libportaudiocpp0 portaudio19-dev
```
Permissions Issues:
- Ensure you have the required permissions to access audio devices:
```
sudo usermod -aG audio $USER
```
Device Not Recognized:
- Check the microphone and speaker connection. Restart ALSA with:
```
sudo alsactl init
```

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
Frontend		Frontend
backend		backend
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Teacher.ai

Table of Contents

Prerequisites

Installation

Configuration

Usage

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

dextboy/treehacks

Folders and files

Latest commit

History

Repository files navigation

Teacher.ai

Table of Contents

Prerequisites

Installation

Configuration

Usage

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages