TranscripterAI

TranscripterAI is a Python pipeline that allows you to convert any audio recording of meetings or conversations into text and analyze it using AI. The script performs two stages:

Transcription: the audio file is converted into text using OpenAI Whisper (locally).
Analysis: the transcript is analyzed to identify key topics, technical skills, strengths and weaknesses of the candidate, as well as evaluate answers to technical questions.

Features

Local audio transcription using Whisper.
Interview analysis highlighting key aspects.
Flexible configuration of analysis prompts.
Support for various audio formats via ffmpeg.

Installation

All installation steps are done via the command line (press Win+S, type cmd).

1. Install Python

Download the latest version of Python from the official website: https://www.python.org/downloads/.
During installation, make sure to check Add Python to PATH.
Verify the installation:

python --version
pip --version

2. Install ffmpeg

Download the ffmpeg build: https://ffmpeg.org/download.html
Extract the archive, for example to C:\ffmpeg.
Add the path C:\ffmpeg\bin to the system PATH variable:
1. Open Control Panel: press Win + S, type Control Panel, and open it.
2. Go to System and Security → System → Advanced system settings (on the right).
3. In the System Properties window, click Environment Variables…
4. In System variables, find the variable Path and click Edit…
5. Click New and add the path to the bin folder of ffmpeg:
```
C:\ffmpeg\bin
```
6. Click OK in all windows to save the changes.
Verify the installation:

ffmpeg -version

3. Install OpenAI Whisper

Install Whisper and PyTorch via pip:

pip install openai-whisper
pip install torch

Verify the installation:

whisper --help
pip show torch

⚠️ Note: downloading via pip install can be very slow, so be patient.

4. Set up Gemini API

Obtain an API key via Google AI Studio:
1. Go to the API key creation page: https://aistudio.google.com/apikey
2. Log in with your Google account.
3. Create a new API key.
4. Select an existing project or create a new one.
5. Confirm the key creation.
6. Copy and save the key securely, as it will be shown only once.
Install the Python SDK for Gemini API:

pip install -q -U google-genai

Verify the installation:

pip show google-genai

5. Install and Configure TranscripterAI

Clone this repository (or download it as a ZIP).
Open the project using IntelliJ IDEA or another IDE. Notepad++ can also be used.

Edit the config.py file with the following parameters:

Your Gemini API key:

GEMINI_API_KEY = "YOUR_KEY"

Feature flags:

transcription_flag = True  # activates local transcription via WhisperAI
analysis_flag = True       # activates analysis via Gemini API

Choose the transcription model. Model names can be found in the table inside config.py:

MODEL_NAME = "tiny"

Set the prompt for AI analysis:

PROMPT_ANALYSIS = """YOUR PROMPT"""

Running and Output

1. Run `main.py`

You can run it through the IDE or directly by executing the file.

2. Select an audio file

All modern audio file formats are supported.

3. Wait for the process to finish

The output files will be saved in the same folder as your audio file. After running the program, you will get three files:

Original audio file
Transcript file
Transcript analysis file (labeled as Gemini)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
mode		mode
README.md		README.md
README_ru.md		README_ru.md
Transcripter.iml		Transcripter.iml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TranscripterAI

Features

Installation

1. Install Python

2. Install ffmpeg

3. Install OpenAI Whisper

4. Set up Gemini API

5. Install and Configure TranscripterAI

Running and Output

1. Run `main.py`

2. Select an audio file

3. Wait for the process to finish

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TranscripterAI

Features

Installation

1. Install Python

2. Install ffmpeg

3. Install OpenAI Whisper

4. Set up Gemini API

5. Install and Configure TranscripterAI

Running and Output

1. Run main.py

2. Select an audio file

3. Wait for the process to finish

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Run `main.py`

Packages