CCMH: Cross-Condition Mental Health Intelligent System

This repository contains the code for CCMH (Cross-Condition Mental Health), an intelligent decision support system for mental health text analysis using Blind Source Separation (BSS) methods.

📄 Paper

Title: CCMH: An Intelligent System for Cross-Condition Mental Health Text Analysis via Semantic Dictionary Learning

Status: Submitted to Expert Systems With Applications

Authors: Muhammad Usman Khalid, Shafiq ur Rehman, Malik Muhammad Nauman, Hatoon S. AlSagri, Sheikh Naeem Shafqat

📊 Dataset

This project uses the Reddit Mental Health Dataset by Low et al. (2020).

Download: https://zenodo.org/records/3941387

Citation:

Low, D. M., Rumker, L., Talkar, T., Torous, J., Cecchi, G., & Ghosh, S. S. (2020). 
Natural Language Processing Reveals Vulnerable Mental Health Support Groups and 
Heightened Health Anxiety on Reddit During COVID-19: Observational Study. 
Journal of Medical Internet Research, 22(10), e22635.

🚀 Getting Started

Prerequisites

MATLAB R2020a or later
Python 3.8+ (for sentence embeddings)
sentence-transformers library (pip install sentence-transformers)
Required MATLAB toolboxes:
- Statistics and Machine Learning Toolbox
- Signal Processing Toolbox

Installation

Clone this repository:

git clone https://github.com/usmankhalid06/CCMH.git
cd CCMH

Download the dataset from Zenodo
Extract the dataset to your working directory
Install Python dependencies:

pip install sentence-transformers pandas numpy

📁 File Structure

CCMH/
├── script_Sentence_Transformer_preCovid.m  # Main analysis pipeline
├── clean_reddit_post.m                      # Text preprocessing
├── get_sentence_embeddings.m                # Sentence transformer interface
├── find_K_multiple_criteria.m               # Dictionary size selection (AIC/BIC)
├── my_KSVD.m                               # K-SVD algorithm
├── my_ODL.m                                # Online Dictionary Learning
├── my_ACSD.m                               # Adaptive Consistent Sequential DL
├── SDL.m                                   # Shared Dictionary Learning (proposed)
├── my_sparse_encode.m                      # Sparse coding implementation
└── README.md

💻 Usage

Step 1: Preprocess Data

% Clean and preprocess Reddit posts
cleaned_text = clean_reddit_post(raw_posts);

Step 2: Generate Sentence Embeddings

% Generate 384-dimensional embeddings using all-MiniLM-L6-v2
embeddings = get_sentence_embeddings(cleaned_text);

Step 3: Run Main Analysis

% Execute complete pipeline
script_Sentence_Transformer_preCovid

This will:

Load preprocessed data
Determine optimal dictionary size (K) using AIC/BIC
Learn dictionaries using all four algorithms
Perform statistical validation
Generate figures

📖 Core Functions

Dictionary Learning Algorithms

my_KSVD.m - K-SVD dictionary learning
my_ODL.m - Online dictionary learning with LARS
my_ACSD.m - Adaptive consistent sequential dictionary learning
SDL.m - Shared dictionary learning (our proposed method)

For K-SVD and ODL you need to download SPAMS toolbox from here https://thoth.inrialpes.fr/people/mairal/spams/ to run mexOMP and mexLasso

Utilities

clean_reddit_post.m - Text preprocessing (remove HTML, URLs, formatting)
get_sentence_embeddings.m - Generate sentence transformer embeddings
find_K_multiple_criteria.m - Model selection (AIC, BIC, variance explained)
my_sparse_encode.m - Sparse coding with adaptive L1 regularization

🔧 Key Parameters

Dictionary size (K): Determined by 70% variance explained criterion
Sparsity (λ): 20 for cross-condition analysis, algorithm-specific for training
Iterations: 30 for all dictionary learning methods

📊 Outputs

The analysis generates:

Learned dictionary atoms for each algorithm
Activation matrices (11 conditions × K atoms)
Cross-algorithm validation metrics
Condition clustering visualizations
Discriminative atom analysis

🧪 Reproducing Results

To reproduce paper results:

% Ensure dataset is in path
addpath('path/to/reddit/data');

% Run main script
script_Sentence_Transformer_preCovid

% Results will be saved in figures/ directory

📧 Contact

For questions or issues, please contact:

Muhammad Usman Khalid: [mukhalid@imamu.edu.sa]
Corresponding Author: malik.nauman@ubd.edu.bn

🙏 Acknowledgments

This work was supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University (IMSIU) (grant number IMSIU-DDRSP2504).

📚 Citation

If you use this code, please cite:

@article{khalid2025ccmh,
  title={CCMH: An Intelligent System for Cross-Condition Mental Health Text Analysis via Semantic Dictionary Learning},
  author={Khalid, Muhammad Usman and Rehman, Shafiq ur and Nauman, Malik Muhammad and AlSagri, Hatoon S. and Shafqat, Sheikh Naeem},
  journal={Expert Systems With Applications},
  year={2025},
  note={Submitted}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CCMH: Cross-Condition Mental Health Intelligent System

📄 Paper

📊 Dataset

🚀 Getting Started

Prerequisites

Installation

📁 File Structure

💻 Usage

Step 1: Preprocess Data

Step 2: Generate Sentence Embeddings

Step 3: Run Main Analysis

📖 Core Functions

Dictionary Learning Algorithms

Utilities

🔧 Key Parameters

📊 Outputs

🧪 Reproducing Results

📧 Contact

🙏 Acknowledgments

📚 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
SDL.m		SDL.m
clean_reddit_post.m		clean_reddit_post.m
find_K_multiple_criteria.m		find_K_multiple_criteria.m
get_sentence_embeddings.m		get_sentence_embeddings.m
my_ACSD.m		my_ACSD.m
my_KSVD.m		my_KSVD.m
my_ODL.m		my_ODL.m
my_sparse_encode.m		my_sparse_encode.m
script_Sentence_Transformer_preCovid.m		script_Sentence_Transformer_preCovid.m

Folders and files

Latest commit

History

Repository files navigation

CCMH: Cross-Condition Mental Health Intelligent System

📄 Paper

📊 Dataset

🚀 Getting Started

Prerequisites

Installation

📁 File Structure

💻 Usage

Step 1: Preprocess Data

Step 2: Generate Sentence Embeddings

Step 3: Run Main Analysis

📖 Core Functions

Dictionary Learning Algorithms

Utilities

🔧 Key Parameters

📊 Outputs

🧪 Reproducing Results

📧 Contact

🙏 Acknowledgments

📚 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages