MetaDICT is a method for the integration of microbiome datasets. This method is designed to remove batch effects and preserve biological variation while integrating heterogeneous datasets. MetaDICT consists of two stages: the first stage provides an initial estimation of batch effect via covariate balancing, while batch effects is defined as heterogeneous capturing efficiency in sequencing measurement; the second stage refines the estimation by shared dictionary learning. Compared with other existing methods, MetaDICT can better avoid overcorrection when unobserved confounding variables are present. The integrated data set can be applied to downstream analysis such as PCoA, taxa/sample community detection and differential abundance test.
The package can be downloaded from github:
devtools::install_github("BoYuan07/MetaDICT")
Load the package:
library(MetaDICT)
Bo Yuan, Shulei Wang, Microbiome Data Integration via Shared Dictionary Learning (2024). [link]