In this project we have compared the umbilical cord blood (UCB) methylome of neonates that were born after in vitro fertilisation (IVF) procedures using different IVF culture media, namely G5 (Vitrolife) and HTF (Lonza). Data were generated using Illumina’s Infinium MethylationEPIC bead chip that profiles around 850,000 CpG sites across the human genome. Furthermore, data from birth cohorts (ENVIRONAGE and FLEHS) were included to compare the methylomes of IVF and naturally conceived neonates.
The manifest_filter directory contains the manifest files used for the targeted analyses and the scripts directory contains the scripts used for data processing, analysis and visualization. A brief description of the contents of the scripts is given below:
The build files can be used to create singularity images containing all the necessary software and packages for the following purposes:
- Data processing using the RnBeads package
- Cell type deconvolution using reference data
- statistical testing with mixed effects linear models as implemented in the dream package
- Data were first pre-processed using the RnBeads package which applied the following functions:
- Normalisation using the SWAN method
- greedycut (p-value threshold 0.05) removal of poor quality sites and samples
- removal of probes on the sex chromosomes
- removal of probes containing SNPs
- removal of probes not in the CpG context
- removal of probes with missing values in 5% or more of the samples
- Sex estimation using the method from the sEst package:
- Cell composition estimates are calculated using the IDOL optimized probes from the package FlowSorted.Blood.Epic:
- Principal component analysis & determination of associations between PCs and sample characteristics
- Differential methylation analysis (sites) using eBayes moderated mixed effects linear models as implemented in the VariancePartition package
- Differential methylation analysis on a regional level using eBayes moderated mixed effects linear models as implemented in the VariancePartition package. Groups of interest: genes, promoters, CpG islands
- Visualisation of the differential methylation analyses as volcano plots
- Determination of outliers using a threshold-based approach
- Application of the iEVORA algorithm to identify differentially variable CpG sites
- Application of the Knight epigenetic gestational age predictor
- Calculation of epigenetic gestational age based on the Bohlin method
- Combination of the resultant plots from the scripts above for publication