FluentMeet 🎙️🌐

"Speak your language, they hear theirs."

FluentMeet is a state-of-the-art, real-time voice translation video conferencing platform. It eliminates language barriers in global professional collaborations by providing instantaneous, natural-sounding voice translation, allowing participants to communicate naturally in their native tongues.

🚀 Key Features

Near-Instantaneous Translation: Targeted glass-to-glass latency of under 1.5 seconds.
Natural Voice Synthesis: High-quality TTS that preserves the natural flow of conversation.
Zero-Friction Client: Participants join via a secure link with no mandatory account creation for guests.
Intelligent Audio Routing: An "SFU-Lite" logic for routing raw or translated audio based on individual language preferences.
Dual-Language Captions: Real-time transcripts showing both original and translated text concurrently.
Professional Vocabulary: Optimised for business, technical, and domain-specific context.

🛠️ Technical Stack

Backend (Python Implementation)

Framework: FastAPI (Asynchronous, high-concurrency architecture).
Data Persistence: PostgreSQL with SQLAlchemy 2.0 (Async).
Migration Management: Alembic (Configured for asynchronous migrations).
Event Streaming: Apache Kafka (Decoupled, event-driven audio processing pipeline).
Real-time Communication: WebSockets for media signaling and caption streaming.
In-Memory Store: Redis for live room state, participant sessions, and rate-limiting.

AI Infrastructure

STT (Speech-to-Text): Deepgram / OpenAI Whisper (High-accuracy streaming).
Machine Translation: DeepL API / GPT-4o (Context-aware translation).
TTS (Text-to-Speech): Voice.ai (Natural audio synthesis).

📂 System Architecture

FluentMeet utilizes an event-driven pipeline to ensure minimal latency and high scalability:

Ingest: Speaker's audio is captured via WebRTC and streamed over WebSockets to the Backend.
STT: Raw audio chunks are pushed to Kafka (audio.raw), consumed by STT workers, and converted to text.
Translation: Original text is pushed to Kafka (text.original), consumed by Translation workers, and converted to the target language.
TTS: Translated text is pushed to Kafka (text.translated), consumed by TTS workers, and synthesized into target audio.
Egress: Synthesized audio is pushed back to Kafka (audio.synthesized) and routed via WebSockets to listeners who require that language.

graph TD
    UserA[Speaker] -->|WebRTC/WS| Backend[Signaling & Routing Server]
    Backend -->|Raw Audio| K1((Kafka: audio.raw))
    K1 --> STT[STT Worker]
    STT -->|Original Text| K2((Kafka: text.original))
    K2 --> TL[Translation Worker]
    TL -->|Translated Text| K3((Kafka: text.translated))
    K3 --> TTS[TTS Worker]
    TTS -->|Synthesized Audio| K4((Kafka: audio.synthesized))
    K4 --> Backend
    Backend -->|Translated Audio| UserB[Listener]

🏁 Installation & Setup

1. Prerequisites

Python 3.11+
Docker and Docker Compose
Access to Deepgram, DeepL, and Voice.ai APIs (API Keys needed).

2. Cloning the Repository

git clone <repository-url>
cd FluentMeet

3. Environment Configuration

Copy the example environment file and fill in your credentials:

cp .env.example .env

Generate a secure SECRET_KEY for JWT:

python -c "import secrets; print(secrets.token_hex(32))"

4. Local Development Setup

It is highly recommended to use a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt

5. Infrastructure Setup (Docker)

Start the required infrastructure (PostgreSQL, Redis, Kafka, Zookeeper):

docker-compose up -d

6. Database Migrations

Initialize the database schema using Alembic:

alembic upgrade head

🚀 Running the Application

Start the Backend

uvicorn app.main:app --reload

The API will be available at http://localhost:8000. You can access the interactive API documentation (Swagger UI) at http://localhost:8000/docs.

Database Models Setup with SQLAlchemy 2.0 (Async)

Defining Models

Create your SQLAlchemy models in `app/models.py` using the async syntax.

Registering Models with Alembic

Ensure your models are imported in `app/models/init.py` for Alembic to detect them during migrations.

Creating Migrations

python -m alembic revision --autogenerate -m "Add Meeting model"

Applying Migrations

python -m alembic upgrade head

🧪 Testing & Quality Assurance

Running Tests

pytest

Test Coverage

Generate and view a coverage report:

pytest tests/ -v --cov=app --cov-report=html --cov-report=term
# Open htmlcov/index.html in your browser

🛡️ Security & Compliance

Authentication: JWT-based authentication with HttpOnly, Secure, SameSite=Strict cookies for Refresh Tokens.
Data Privacy: Ephemeral audio/text processing; no data is persisted after the meeting ends.
Rate Limiting: Redis-backed throttling to manage API costs and prevent abuse.
Soft-Delete: Strict account deletion policies preventing reactivation via login.

Linting & Formatting

Black: Enforce consistent code formatting.

black .

isort: Sort imports for readability.

isort .

ruff: Linting for code quality and style.

ruff .

python -m ruff check .

🤝 Contributing

We welcome contributions! Please follow these steps:

Fork the repository.
Create your feature branch (git checkout -b feature/AmazingFeature).
Ensure your code follows Black and isort formatting.
Commit your changes (git commit -m 'Add some AmazingFeature').
Push to the branch (git push origin feature/AmazingFeature).
Open a Pull Request.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github		.github
alembic		alembic
app		app
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FluentMeet 🎙️🌐

🚀 Key Features

🛠️ Technical Stack

Backend (Python Implementation)

AI Infrastructure

📂 System Architecture

🏁 Installation & Setup

1. Prerequisites

2. Cloning the Repository

3. Environment Configuration

4. Local Development Setup

5. Infrastructure Setup (Docker)

6. Database Migrations

🚀 Running the Application

Start the Backend

Database Models Setup with SQLAlchemy 2.0 (Async)

Defining Models

Registering Models with Alembic

Creating Migrations

Applying Migrations

🧪 Testing & Quality Assurance

Running Tests

Test Coverage

🛡️ Security & Compliance

Linting & Formatting

🤝 Contributing

📄 License

About

Uh oh!

Releases 6

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FluentMeet 🎙️🌐

🚀 Key Features

🛠️ Technical Stack

Backend (Python Implementation)

AI Infrastructure

📂 System Architecture

🏁 Installation & Setup

1. Prerequisites

2. Cloning the Repository

3. Environment Configuration

4. Local Development Setup

5. Infrastructure Setup (Docker)

6. Database Migrations

🚀 Running the Application

Start the Backend

Database Models Setup with SQLAlchemy 2.0 (Async)

Defining Models

Registering Models with Alembic

Creating Migrations

Applying Migrations

🧪 Testing & Quality Assurance

Running Tests

Test Coverage

🛡️ Security & Compliance

Linting & Formatting

🤝 Contributing

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages