GitHub - AtharvSabde/Cognitive-Stress-Detection-System

Methodology Overview The pipeline consists of four main components:

Audio Preprocessing (preprocess.py):

Background noise removal using spectral gating and high-pass filtering Leading/trailing silence removal Pause detection with configurable duration thresholds

Speech-to-Text Conversion (aud_to_text.py):

Audio transcription using OpenAI's Whisper model (via Hugging Face transformers) Output in structured formats (text and CSV)

Feature Extraction (analysis.py):

Gemini API integration for sophisticated linguistic analysis Extraction of 15 standardized features across four categories:

Fluency and hesitation markers Prosodic and temporal characteristics Lexical retrieval abilities Sentence structure and completion metrics

Machine Learning Analysis (ml.py):

Primary Method: Isolation Forest for anomaly detection

Unsupervised learning approach suitable for detecting deviations from normal speech patterns Contamination parameter set to 0.3 to identify the most anomalous 30% of samples

Feature importance analysis using Spearman correlation Dimensionality reduction using PCA for visualization

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
audio_folder		audio_folder
results		results
transcibe		transcibe
transcribe_1		transcribe_1
README.md		README.md
Report -MemoTag.docx		Report -MemoTag.docx
analysis.py		analysis.py
aud _to_text.py		aud _to_text.py
ml.py		ml.py
preprocess.py		preprocess.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

AtharvSabde/Cognitive-Stress-Detection-System

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages