This repository contains the code and supplementary materials for the paper titled Transformer-Based Transfer Learning on Self-Reported Voice Recordings for Parkinson’s Disease Diagnosis.
This project explores the use of advanced deep-learning techniques to diagnose Parkinson's Disease (PD) based on self-reported voice recordings. The methodology includes data collection, preprocessing, model training, and validation.
-
Data Collection:
- Voice recordings from participants using the mPower app.
-
Data Preprocessing:
- Resampling, padding, and converting audio waveforms to log Mel spectrograms.
-
Model Architectures:
- CNNs (ResNet50, DenseNet161) and Transformers (VIT, AST).
-
Training Regimens:
- Training from scratch and transfer learning with fine-tuning.
-
Validation and Evaluation:
- Subject-wise and record-wise data splits, evaluated by AUC and AUPRC.
- Python 3.10+
- PyTorch Lightning
- Other dependencies listed in
requirements.txt
- Clone the repository:
git clone https://github.com/Jetliqs/PD-VoiceDL.git
- Navigate to the project directory:
cd PD-VoiceDL - Install the dependencies:
pip install -r requirements.txt
- Prepare the dataset as described in the data preprocessing section.
- Open and run the Jupyter notebook:
jupyter notebook mmpd-voicedl.ipynb
We have already uploaded our model checkpoints here:
| Model | Checkpoint |
|---|---|
| AST (30 epochs) | download |
