GitHub - kmeranda/disfluency_remover: This program trains a model on labelled training data to remove speech disfluencies, such as "um," "like," "you know," etc to keep only the core part of the text.

Kelsey Meranda

Instructions to run: disfluency_remover.py trains a bigram model on data/train.txt and tests the model on data/test.txt and outputs to output.log by default. use the "-h" or "--help" flag to see what other options are available.

Note: the trigram model takes a long time to run, so it prints out percent complete as it runs

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
README.md		README.md
disfluency_remover.py		disfluency_remover.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

kmeranda/disfluency_remover

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages