Skip to content

Extent the length of silence inside an audiotrack to help with transcribing

Notifications You must be signed in to change notification settings

mynameisfiber/pregnantpause

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PregnantPause

What is?!

Transcribing can be hard, especially if the speaker doesn't take proper pauses while speaking. This little script intends to fix that! This script will output a new audio file with the pauses extended!

How does it work?

We analyse the windowed standard deviation of the waveforms and identify regions that are below the mean standard deviation. These regions are considered to be "silence". This assumption only really works for speech, but since we are focusing on transcribing I think it's quite a good assumption.

Installing

Installation is quite simple! Either clone the repo and install using python:

$ git clone https://github.com/mynameisfiber/pregnantpause.git
$ cd pregnantpause
$ sudo python setup.py install
$ pregnantpause.py --help

Or by using pip:

$ [sudo] pip install pregnantpause
$ pregnantpause.py --help

Usage

Invoking with --help shows the list of usable parameters. One thing to note is that we only operate on WAV files!

About

Extent the length of silence inside an audiotrack to help with transcribing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages