Speaker Recognition and Speech Transcription Capstone Project

The goal of this project is to provide a system that runs entirely locally for speaker recognition and speech transcription.

Group Members

Adam Strom
Aidan Horan
Hannah Berthiaume
Parker Rowe

Project Structure

|
├── /backend
|   └── any files responsible for running our python flask backend server
|
├── /saved_models
|   └── trained models -- NOTE: git does not track the models to save on storage space
|
├── /scripts
|   └── shell scripts that are useful to us
|
├── /testing
|   └── for experimenting with our models
| 
├── /training 
|   |
│   |── /testing_data // any files used to test the model (audio data that the model has never seen before)
│   |    └── *
|   |
│   |── /training_data_wav // .wav files organized into folders 
│   |    |── /{class_name1}
│   |    |    |── {audio1.wav}
│   |    |    |── ....
│   |    |    └── {audioX.wav}
│   |    |── ....
│   |    └── /{class_nameX}
|   |
|   └── any files used for training our model using tensorflow
|
├── /ui
|   └── a SvelteKit-based web GUI for interacting with our voice transcription system
|
├── /whisper.cpp
|   └── a git submodule containing a high performance fork of OpenAI's whisper model (see https://github.com/ggerganov/whisper.cpp)
|
├── .gitignore  // files that git should ignore
├── .gitmodules // information
└── README.md

Installation Instructions

To clone the repo make sure you have Git installed on your computer. Then authenticate your GitHub account using either:
- ssh keys (this one is easier)
  - Once done, run the following command in your terminal in whatever folder you want to save the repository
```
git clone git@github.com:parkuman/capstone.git
```
- GitHub personal access token
  - Once done, run the following command in your terminal in whatever folder you want to save the repository
```
git clone https://github.com/parkuman/capstone.git
```
Once cloned, check the READMEs in the other folders to see how to install those components.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker Recognition and Speech Transcription Capstone Project

Group Members

Project Structure

Installation Instructions

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
backend		backend
saved_models		saved_models
scripts		scripts
testing		testing
training		training
ui		ui
utils		utils
vad @ 4f57fae		vad @ 4f57fae
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md

parkuman/capstone

Folders and files

Latest commit

History

Repository files navigation

Speaker Recognition and Speech Transcription Capstone Project

Group Members

Project Structure

Installation Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages