AI Interviewer

A native Python desktop application that simulates realistic job interviews using AI.

Features

AI-Powered Interviews: Claude LLM drives realistic interview conversations with dynamic follow-up questions
Human-like Voice: ElevenLabs TTS with emotion modulation for natural-sounding speech
Real-time Voice Analysis: Pitch, intensity, tempo analysis based on Juslin & Laukka (2003) framework
Visual Analysis: MediaPipe-based face/pose detection (100% local, free)
- Eye contact tracking
- Blink detection
- Posture analysis
- Emotion indicators from facial expressions
Intelligent Interjections: AI interjects when responses go off-topic, have long pauses, or are unclear
Post-Interview Feedback: Comprehensive grading and actionable improvement tips

Architecture

┌─────────────────────────────────────────────────────────────┐
│                        UI Layer (PyQt6)                      │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────────────┐  │
│  │   Camera    │  │  Transcript │  │  Modulation Viz     │  │
│  │   Widget    │  │   Widget    │  │     Widget          │  │
│  └─────────────┘  └─────────────┘  └─────────────────────┘  │
└─────────────────────────────────────────────────────────────┘
                              │
┌─────────────────────────────────────────────────────────────┐
│                      Core Engine                             │
│  ┌───────────────┐  ┌───────────────┐  ┌───────────────┐   │
│  │   Session     │  │ Interjection  │  │   Grading     │   │
│  │   Manager     │  │   Engine      │  │   Engine      │   │
│  └───────────────┘  └───────────────┘  └───────────────┘   │
└─────────────────────────────────────────────────────────────┘
                              │
┌───────────────────┬─────────┴─────────┬───────────────────┐
│   Audio Pipeline  │   AI Integration  │   Vision Module   │
│  ┌─────────────┐  │  ┌─────────────┐  │  ┌─────────────┐  │
│  │  Recorder   │  │  │   Claude    │  │  │   Face      │  │
│  │  Analyzer   │  │  │   Client    │  │  │  Analyzer   │  │
│  │ Transcriber │  │  │ ElevenLabs  │  │  │   Gaze      │  │
│  │   Player    │  │  │   TTS       │  │  │  Tracker    │  │
│  └─────────────┘  │  └─────────────┘  │  └─────────────┘  │
└───────────────────┴───────────────────┴───────────────────┘

Installation

Prerequisites

Python 3.11+
Microphone
Camera (optional, for visual analysis)
Anthropic API key (Claude)
ElevenLabs API key

Setup

Clone the repository:

git clone https://github.com/yourusername/AudioInterviewer.git
cd AudioInterviewer

Create virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Set API keys:

export ANTHROPIC_API_KEY=your_claude_api_key
export ELEVENLABS_API_KEY=your_elevenlabs_api_key

Or create a .env file:

cp .env.example .env
# Edit .env with your API keys

Usage

GUI Mode (Default)

Run the application with the graphical interface:

python -m src.main

CLI Mode

For testing without the GUI:

python -m src.main --cli

Command Line Options

--cli          Run in CLI mode (no GUI)
--debug        Enable debug logging
--config PATH  Path to custom config file
--help         Show help message

Configuration

Edit config/settings.yaml to customize:

Interview Settings

interview:
  default_duration_minutes: 30
  question_count: 5

Interjection Thresholds

interjection:
  off_topic_threshold: 0.5
  pause_duration_threshold: 3.0
  clarity_threshold: 0.6
  cooldown_seconds: 10.0

Grading Weights

grading:
  weights:
    confidence: 0.2
    clarity: 0.2
    content: 0.3
    engagement: 0.15
    eye_contact: 0.15

Audio Settings

audio:
  sample_rate: 16000
  channels: 1
  buffer_size: 1024

Voice Analysis

voice_analysis:
  pitch_range_hz: [75.0, 500.0]
  silence_threshold_db: -40.0

Project Structure

AudioInterviewer/
├── config/                 # Configuration files
│   ├── settings.yaml      # Main settings
│   ├── prompts.yaml       # Interview prompts
│   └── emotions.yaml      # Emotion thresholds
├── data/                   # Data storage
│   ├── sessions/          # Session recordings
│   ├── database/          # SQLite database
│   └── exports/           # Exported reports
├── src/
│   ├── ai/                # AI integration
│   │   ├── claude_client.py
│   │   ├── elevenlabs_client.py
│   │   ├── context.py
│   │   └── evaluator.py
│   ├── audio/             # Audio processing
│   │   ├── recorder.py
│   │   ├── analyzer.py
│   │   ├── transcriber.py
│   │   ├── player.py
│   │   └── vad.py
│   ├── core/              # Core engine
│   │   ├── session_manager.py
│   │   ├── interjection.py
│   │   ├── grading.py
│   │   ├── feedback.py
│   │   ├── events.py
│   │   └── models.py
│   ├── data/              # Data layer
│   │   ├── database.py
│   │   └── models.py
│   ├── ui/                # User interface
│   │   ├── main_window.py
│   │   ├── camera_widget.py
│   │   ├── transcript_widget.py
│   │   ├── modulation_widget.py
│   │   ├── feedback_widget.py
│   │   ├── control_panel.py
│   │   ├── settings_dialog.py
│   │   └── styles.py
│   ├── utils/             # Utilities
│   │   ├── config.py
│   │   ├── constants.py
│   │   └── logger.py
│   ├── vision/            # Visual analysis
│   │   ├── camera.py
│   │   ├── emotion_detector.py
│   │   ├── face_analyzer.py
│   │   ├── gaze_tracker.py
│   │   ├── posture_analyzer.py
│   │   └── models.py
│   └── main.py            # Entry point
├── tests/                  # Test suite
│   ├── test_audio/
│   ├── test_core/
│   ├── test_vision/
│   ├── integration/
│   └── conftest.py
├── scripts/               # Helper scripts
│   ├── build.sh          # Build script
│   └── setup_dev.sh      # Dev setup
├── plans/                  # Architecture docs
├── pyproject.toml         # Project metadata
├── requirements.txt       # Dependencies
├── .env.example          # Environment template
└── README.md             # This file

Testing

Run tests with pytest:

# Run all tests
pytest

# Run with coverage
pytest --cov=src

# Run specific test module
pytest tests/test_audio/

# Run integration tests
pytest tests/integration/ -m integration

Building

Build a standalone executable:

./scripts/build.sh

The executable will be in dist/AIInterviewer.

Development

Setup Development Environment

./scripts/setup_dev.sh

This will:

Create a virtual environment
Install dependencies
Install development tools (pytest, black, ruff, mypy)
Create .env from template
Create necessary data directories

Code Style

Format with Black: black src tests
Lint with Ruff: ruff check src tests
Type check with mypy: mypy src

API Keys

Anthropic (Claude)

Get your API key from Anthropic Console

ElevenLabs

Get your API key from ElevenLabs

Acknowledgments

Voice analysis based on Juslin & Laukka (2003) - "Communication of emotions in vocal expression and music performance"
Facial analysis using MediaPipe
LLM powered by Claude
TTS powered by ElevenLabs

License

MIT License - See LICENSE for details.

Contributing

Fork the repository
Create a feature branch: git checkout -b feature/my-feature
Commit changes: git commit -am 'Add my feature'
Push to branch: git push origin feature/my-feature
Submit a Pull Request

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Interviewer

Features

Architecture

Installation

Prerequisites

Setup

Usage

GUI Mode (Default)

CLI Mode

Command Line Options

Configuration

Interview Settings

Interjection Thresholds

Grading Weights

Audio Settings

Voice Analysis

Project Structure

Testing

Building

Development

Setup Development Environment

Code Style

API Keys

Anthropic (Claude)

ElevenLabs

Acknowledgments

License

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
config		config
data		data
plans		plans
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
Juslin_emotion2003.pdf		Juslin_emotion2003.pdf
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

AI Interviewer

Features

Architecture

Installation

Prerequisites

Setup

Usage

GUI Mode (Default)

CLI Mode

Command Line Options

Configuration

Interview Settings

Interjection Thresholds

Grading Weights

Audio Settings

Voice Analysis

Project Structure

Testing

Building

Development

Setup Development Environment

Code Style

API Keys

Anthropic (Claude)

ElevenLabs

Acknowledgments

License

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages