Skip to content

JihyeMooon/Moon_Speech_Analysis_Software

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 

Repository files navigation

Real-Time Speech Analysis and Pathological Speech Detection Software

Introduction

image

In 2015, I developed a real-time pathological speech detection software using C++ and the Microsoft Foundation Class Library (MFC). This program provides real-time speech data recording, voice activity detection (also called speech end-point detection), pathological speech detection, and the ability to import external audio data.

This software detects speech endpoints using energy/zero crossing rate, computes various speech features, including Jitter, Shimmer, and high-order statistics, and detects pathological speeches based on a pre-trained decision tree model (a simple machine learning model!). The accuracy in detecting patholocial speeches was obtained as 83.11%.

The details for the software were published as a peer-reviwed paper at a Korean Journal in 2015.

For now, this GitHub repository releases C++ code (compatible with MFC) for only the real-time speech data recording and speech end-point detection parts. If you have any questions, please contact me at husky.jihye.moon@gmail.com.

Other Voice Activity Detection Codes

For additional voice activity detection algorithms, I also implemented three based on Autocorrelation Function (ACF), Average Magnitude Difference Function (AMDF), and Higher Order Differential Energy Operators (HODEO) respectively. Results for other voice activity detection methods are displayed below.

ACF, AMDF, and HODEO-based voice activity detection codes are avablible at Link!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages