Speech Service

A Speech Service using AI with current models like Whisper and NLLB.

The tests are performed in a Docker container that also works in the Windows Subsystem for Linux (WSL). An NVIDIA graphics card with at least 4 GB VRAM is recommended, depending on the models used. CUDA is part of the Docker image, only the NVIDIA graphics driver needs to be installed.

Docker must have CUDA enabled (e.g. for WSL see https://docs.nvidia.com/cuda/wsl-user-guide/index.html).

Start as local service with Test-UI

Clone https://github.com/andrePankraz/speech_service
```
$ export DOCKER_BUILDKIT=1
$ docker compose up
```
- Will take some time at first start (images & packages are downloaded, >10 GB)
- Wait & check if up and running
Go to URL: http://localhost:8200/
- Will take some time at first start (models are downloaded, several GB)

Start for Development

Clone https://github.com/andrePankraz/speech_service
```
$ export DOCKER_BUILDKIT=1
$ docker compose --env-file docker/.envs/dev.env up
```
- Will take some time at first start (images & packages are downloaded, >10 GB)
- Wait & check if up and running
Install VS Code
- Install Extension
  - Dev Containers
  - Docker
  - Markdown All in One
Attach VS Code to Docker Container
- Attach to running containers... (Lower left edge in VS Code)
  - select speech_service-python-1
- Explorer Open folder -> /opt/speech_service
- Run / Start Debug
  - VS Code Extension Python will be installed the first time (Wait and another Start Debug)
  - Select Python Interpreter
Go to URL: http://localhost:8200/
- Will take some time at first start (models are downloaded, several GB)

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.vscode		.vscode
docker		docker
notebooks		notebooks
speech_service		speech_service
tests		tests
uploads		uploads
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Service

Start as local service with Test-UI

Start for Development

About

Releases

Packages

Languages

License

andrePankraz/speech_service

Folders and files

Latest commit

History

Repository files navigation

Speech Service

Start as local service with Test-UI

Start for Development

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages