GitHub - PoorvikaN/ECG-Federated-Learning.: Explainable Federated Learning for Secure and Transparent Medical Diagnosis in IoT-based Smart Hospitals

DAYANANDA SAGAR UNIVERSITY

School of Engineering
Department of Computer Science and Engineering (Cyber Security)

(A State Private University under the Karnataka Act No. 20 of 2013)
Approved by UGC & AICTE, New Delhi

Explainable Federated Learning for Secure and Transparent Medical Diagnosis in IoT-based Smart Hospitals

High-Fidelity ML-based ECG Classification using Federated Learning & Explainability

TTEH Lab

Badges

Achived Badge

Overview

This project presents an ECG classification system using Federated Learning combined with Explainable AI (XAI) for secure and transparent medical diagnosis.

Electrocardiogram (ECG) signals are widely used for detecting cardiac abnormalities, but sharing such sensitive patient data across hospitals raises serious privacy concerns. To address this, we implement a Federated Learning framework where multiple simulated hospitals (clients) train models locally on their own ECG data without sharing raw data.

A global model is then constructed by aggregating the locally trained models using the Flower federated learning framework.

In addition to model training, this project integrates Explainability techniques (SHAP) to interpret model predictions, enabling better transparency and trust in clinical decision-making.

The system is evaluated in both centralized and federated settings, demonstrating that federated learning can achieve comparable performance while preserving data privacy.

This work aims to contribute toward privacy-preserving, interpretable AI solutions for smart healthcare systems.

Keywords: Federated Learning ECG Classification Explainable AI Healthcare AI Privacy-Preserving Machine Learning

Problem Statement

Cardiovascular diseases are one of the leading causes of mortality worldwide. Early detection using ECG signals is crucial for timely diagnosis. However, training machine learning models on ECG data requires access to large amounts of patient data, which raises serious privacy and security concerns.

Traditional centralized learning approaches require data to be collected and stored in a single location, increasing the risk of data breaches and violating healthcare data regulations.

Therefore, there is a need for a privacy-preserving, scalable, and interpretable system that can:

Train models without sharing sensitive medical data
Maintain high diagnostic accuracy
Provide transparency in model predictions

This project addresses these challenges using Federated Learning and Explainable AI.

Proposed Architecture

The system follows a distributed federated learning pipeline where clients collaboratively train a global model without sharing raw data.

Component	Description	Technology Used
ECG Dataset	Raw ECG signals from MIT-BIH Arrhythmia Dataset	PhysioNet
Data Preprocessing	Signal extraction, segmentation, normalization, labeling	NumPy, WFDB
Clients (Hospitals)	Simulated distributed nodes training on local ECG data	Flower Clients
Local Model	ECG classification model trained independently at each client	PyTorch
Server	Central aggregator coordinating federated learning	Flower Server
Aggregation	Combines model weights from all clients using Federated Averaging (FedAvg)	Flower Strategy
Global Model	Updated shared model distributed back to clients	PyTorch
Explainability	Interprets model predictions using SHAP	SHAP Library
Results Storage	Stores training results, logs, and plots	Local Storage

System Architecture

The proposed system follows a federated learning architecture with explainability support.

Components:

Clients (Hospitals) → Local ECG training
Server → Model aggregation
Global Model → Shared knowledge
Explainability Module → SHAP-based interpretation

How It Works

ECG data is loaded from the MIT-BIH dataset
Signals are preprocessed and converted into training samples
Data is split into multiple clients (simulated hospitals)
Each client trains a local model independently
Flower framework coordinates training across clients
Model weights are sent to the server
Server aggregates updates to create a global model
Process repeats for multiple rounds
Final global model is evaluated
SHAP is used to explain model predictions

Performance Evaluation

Centralized Model

Trained on full dataset
Achieves stable accuracy
Serves as baseline for comparison

Federated Model

3 simulated clients
Model trained without sharing raw data
Accuracy comparable to centralized approach

Key Findings:

Federated learning preserves privacy
Minimal drop in accuracy compared to centralized model
Model converges successfully across rounds

Metrics Used:

Accuracy
Loss

Conclusion:

Federated learning is effective for ECG classification while maintaining data privacy.

Training Performance

Explainability (SHAP)

To improve model transparency, SHAP (SHapley Additive Explanations) is used to interpret predictions.

SHAP identifies which parts of the ECG signal contribute most to classification
Helps understand model decision-making
Important for clinical trust and validation

Observations:

Certain waveform regions (QRS complex) show higher importance
Model focuses on key ECG patterns for classification

Output:

SHAP summary plots are generated and stored in the results/ folder

Code Architecture

ECG-Federated-Learning/
│
├── data/
├── notebooks/
├── results/
│   ├── .getkeep
│   ├── accuracy.png
│   ├── ecg_sample.png
│   ├── shap_plot.png
│   ├── system_architecture.png
├── src/
│   ├── model.py
│   ├── data_utils.py
│   ├── train_baseline.py
│   ├── federated_simulation.py
│   ├── explain.py
│   └── config.py
│
├── main.py
├── requirements.txt
└── README.md

Core Modules

1. data_utils.py

Handles:

Dataset loading
ECG signal extraction
Preprocessing
Label creation

2. model.py

Defines:

ECG classification model (1D CNN / simple model)

3. train_baseline.py

Implements:

Centralized training
Performance comparison

4. federated_simulation.py

Handles:

Client creation
Flower simulation
Model aggregation

5. explain.py

Implements:

SHAP explainability
Visualization of feature importance

6. config.py

Contains:

Hyperparameters
Training settings

Setup & Usage

1. Install dependencies

pip install -r requirements.txt

2. Download dataset

We use the MIT-BIH Arrhythmia Dataset

🔗 Download here:
https://physionet.org/content/mitdb/1.0.0

Dataset is NOT included in this repository.

After downloading, place it inside:

data/ └── mit-bih-arrhythmia-database-1.0.0/

3. Run centralized training

python src/train_baseline.py

4. Run federated learning

python src/federated_simulation.py

5. Run explainability

python src/explain.py

Implementation Results

Successfully implemented federated learning with 3 simulated clients
Verified model training across distributed datasets
Achieved stable convergence across training rounds
Demonstrated privacy-preserving training
Generated SHAP plots for interpretability
Compared centralized vs federated performance

Training Accuracy

SHAP Feature Importance

Sample ECG Signal

Limitations

Simulation uses limited number of clients (3 hospitals)
Dataset size is relatively small
Model architecture is not too complex
Federated learning overhead increases computation time
SHAP explanations may be computationally expensive

Contributors

Name	USN	Email
Poorvika N	ENG23CY0030	poorvikan99@gmail.com
B.Tanusree reddy	ENG23CY0054	bojja104@gmail.com
D.Himaja Sri vyshnavi	ENG23CY0061	himaja210205@gmail.com
K N Navya	ENG23CY0019	knnavya27@gmail.com
Pooja N	ENG23CY0029	poojanarayan0906@gmail.com

Mentor

Dr. Prajwalasimha S N
Associate Professor, Department of Computer Science and Engineering (Cyber Security)
School of Engineering, Dayananda Sagar University

Email: prajwasimha.sn1@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
Related links		Related links
data		data
results		results
src		src
Group_8_Explainable_Federated_Learning_for_Secure_and_Transparent_Medical_Diagnosis_in_IoT-based_Smart_Hospitals (1).pdf		Group_8_Explainable_Federated_Learning_for_Secure_and_Transparent_Medical_Diagnosis_in_IoT-based_Smart_Hospitals (1).pdf
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

DAYANANDA SAGAR UNIVERSITY

Explainable Federated Learning for Secure and Transparent Medical Diagnosis in IoT-based Smart Hospitals

Badges

Achived Badge

Overview

Table of Contents

Problem Statement

Proposed Architecture

System Architecture

Components:

How It Works

Performance Evaluation

Centralized Model

Federated Model

Key Findings:

Metrics Used:

Conclusion:

Training Performance

Explainability (SHAP)

Observations:

Output:

Code Architecture

Core Modules

1. data_utils.py

2. model.py

3. train_baseline.py

4. federated_simulation.py

5. explain.py

6. config.py

Setup & Usage

1. Install dependencies

2. Download dataset

3. Run centralized training

4. Run federated learning

5. Run explainability

Implementation Results

Training Accuracy

SHAP Feature Importance

Sample ECG Signal

Limitations

Contributors

Mentor

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages