Using DNA methylation (DNAm) data and regression models to develop epigenetic clocks that can estimate canine age. This was performed as part of a research training grant challenge, led by UCLA's Institute for Quantitative and Computational Biosciences (QCBio) and supported by NIH's Big Data to Knowledge (BD2K) initiative
- canineAge_dataMatrix.txt.gz Input file, with sample methylation levels at CpG sites
- canineAge_regressionModeling.ipynb Jupyter notebook with data analysis workflow
- Data Preparation
- PCA Feature Reduction
- Linear Regression
- Hyperparameter Tuning
- Elastic Net Regression
- Bootstrapping for Enhanced Variable Selection
- Distance-Weighted Prediction Interpolation
- Age Estimates Output