teamsTranscribe

Features

Real-time transcription with low-latency streaming chunks
Capture sources: microphone only, system loopback only, or a mix of both
Movable, always-on-top overlay window with a toggleable status panel for live captions and session details
Automatic language detection with optional overrides via the config file or environment variables
Simple CLI flags to list available devices and choose the capture mode

Prerequisites

Windows 10/11 (WASAPI loopback works out of the box); macOS or Linux are possible but require PortAudio-compatible loopback devices
Python 3.9 or newer
Git (optional, for cloning)
Visual C++ 14+ runtime on Windows (needed for the PyAudio wheel)
PortAudio development headers on Linux/macOS if you need to compile PyAudio from source
Optional: a virtual loopback device (e.g., VB-Audio Virtual Cable) if your system does not expose a loopback capture device by default

Getting Started

Clone the repository or download the source zip, then open a terminal in the project root:
```
git clone https://github.com/yourusername/teamsTranscribe.git
cd teamsTranscribe
```
Create and activate a virtual environment (recommended):
```
python -m venv .venv
.\\.venv\\Scripts\\Activate.ps1
```
Install the Python dependencies:
```
pip install --upgrade pip
pip install -r requirements.txt
```
- For GPU acceleration, install a CUDA-enabled wheel by following the official faster-whisper instructions.
Review the configuration file at config/settings.json and adjust the defaults before launching:
```
{
  "whisper_model_path": "base",
  "whisper_compute_type": "int8",
  "whisper_language": "auto",
  "whisper_window_seconds": 5,
  "whisper_overlap_seconds": 1,
  "whisper_beam_size": 1
}
```
- whisper_model_path: faster-whisper model name or local path (e.g., base, medium, large-v3).
- whisper_compute_type: int8, float16, etc., depending on CPU/GPU support.
- whisper_language: auto for automatic detection or a language code (e.g., en, ja).
- whisper_window_seconds / whisper_overlap_seconds: control streaming window size and overlap for smoother captions.
- whisper_beam_size: larger values improve accuracy at the cost of speed.
- Environment variables with the uppercase names (e.g., WHISPER_MODEL_PATH) still override the config when set.
- Set the TEAMS_TRANSCRIBE_CONFIG environment variable to point to an alternate config file if you maintain multiple profiles.

The legacy .env file is still loaded automatically, so existing overrides continue to work.

Managing Configuration from the CLI

Use the config subcommand to inspect or update settings without editing JSON manually:

python -m src.main config --list
python -m src.main config --set whisper_model_path=medium --set whisper_language=en

Key names are case-insensitive and accept either WHISPER_MODEL_PATH or whisper_model_path. Provide --config-path to point at a different JSON file for both the config command and the main run mode:

python -m src.main config --config-path C:\path\to\custom.json --list
python -m src.main --config-path C:\path\to\custom.json

Running the App

List available audio devices before launching, so you know which inputs are exposed:
```
python -m src.main --list-devices
```
Look for loopback or virtual devices if you plan to capture system audio.
Start live transcription (default mixes microphone and system audio when available):
```
python -m src.main
```
Microphone-only capture:
```
python -m src.main --mic-only
```
System-only capture (requires a loopback device, otherwise defaults to microphone):
```
python -m src.main --system-only
```

While running, the overlay window stays on top of other apps. Drag it to reposition, use the toggle arrow to reveal per-session status (model, language, compute type), and click the X button to close. The terminal logs will show which devices were selected and whether voice activity detection had to fall back due to missing optional dependencies (e.g., onnxruntime).

Building / Installing on Your PC

Editable install in the active environment (handy for local development):
```
pip install -e .
```
Build a wheel/sdist for distribution (requires pip install build once):
```
python -m build
```
The artifacts will be created under dist/, and you can install them on another machine with pip install dist/teamsTranscribe-<version>-py3-none-any.whl.

Troubleshooting

PyAudio install failures: install the Visual C++ Build Tools on Windows or PortAudio headers on Linux/macOS, then retry pip install pyaudio.
No system audio captured: ensure your audio driver exposes a WASAPI loopback device or install a virtual audio cable.
Slow transcription: switch to a smaller model (tiny, base) or reduce beam size. For better performance, use a GPU build of faster-whisper.
Overlay does not appear: PyQt needs access to a desktop session; ensure you are not running headless and that QT_QPA_PLATFORM is unset or set to windows/xcb as appropriate.

License

This project is distributed under the GNU Affero General Public License v3.0 (AGPL-3.0), allowing free use, modification, and redistribution provided that network-accessible deployments also share their source under the same terms.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github/workflows		.github/workflows
config		config
src		src
tests		tests
.gitignore		.gitignore
README.ja.md		README.ja.md
README.md		README.md
__init__.py		__init__.py
__main__.py		__main__.py
overlay_test.py		overlay_test.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run_test.py		run_test.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

teamsTranscribe

Features

Prerequisites

Getting Started

Managing Configuration from the CLI

Running the App

Building / Installing on Your PC

Troubleshooting

License

About

Uh oh!

Releases

Packages

Languages

digitask-dev/teamsTranscribe

Folders and files

Latest commit

History

Repository files navigation

teamsTranscribe

Features

Prerequisites

Getting Started

Managing Configuration from the CLI

Running the App

Building / Installing on Your PC

Troubleshooting

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages