Skip to content

This repository contains all the code and files needed for our scientific data project that highlights comparative and functional study using 395 shotgun metagenomic samples from hematopoietic cell transplant patients collected in the past 10 years in Memorial Hospital.

Jinyuan1998/scientific_data_metagenome_shotgun

Repository files navigation

shotgun_scientific_data

Code from the Scientific Data manuscript to publish the shotgun metagenomics of allo-HCT patients. We also demonstrate functional study of those data such as antibiotic resistance genes.

- Figures

Figure1/scFigure1.m

  • Create the t-SNE plot calculated from Bray-curtris dissimilarity matrix using 16S and plot the samples with metagenome shotgun sequencing.
    • * CAUTION: running this function requires >30min! A saved result is available in 'savedMat' so it can be loaded directly. A parameter named 'calculateTSNE' is set to 1 when the script will recalculate the t-SNE score; else the script will load the saved data.
  • Compares read counts in different stool consistency.

Figure2/scFigure2.m

  • Plot and compare the stool composition between 16S and metagenome of the samples from a single patient.
    • *CAUTION: This script load the output from kraken2 of patric database and create a .mat file for each sample. This step is time consuming and takes >1 hour. When this is done, the variable 'rewriteShotgunAbundances' can be set to a value != 1, the script will read from a saved csv file.
      *NOTE: This script does not include the (U)nclassified reads from PATRIC output.
  • Compare the relative abundance in different stool consistency of each taxa.

Figure3/scFigure3.m

  • Calculate and plot the correlation between 16S and metagenomic sequencing;
  • calculate the alpha diversity using Shannon Index and compare among stool consistency.

Figure4/scFigure4.m

  • Plot the vanA PCR result in the t-SNE plot (the same as in Figure 1) using the saved .mat file.
  • Compare the relative abundance of vanA gene in the PCR(+) and PCR(-) groups. Examine the correlation between vanA and vanB gene in shotgun metagenomes.

Figure5/scFigure5.m

  • Plot the phylogenetic tree built from Enterococcus isolated from stool of a HCT patient and metagenome assembled genomes from the same samples.

makeCARDtbl/scCreateCardTable2.m

  • Make the output cardTbl.csv file.

makeVFDBtable/scProcessVFDB.m

  • Make the output vfdbTbl_2021.csv file.

- Metagenome data

metagenome_data/

    Containing .csv files used for metagenome analysis

savedMat/

    Containing saved data during figure generation to avoid re-calculation. This direcotry contains the results of t-SNE score calculated from Bray-curtris dissimilarity matrix using 16S, the correlation between 16S and shotgun, and the Shannon Index.

deidentified_data_tables/

    Files from previous 16S data paper used in this study for data comparison. The tblASVsampes from our other project with an additional columns, AccessionShotgun, which is the SRA accession for the shotgun fastq files (only available for 395 samples).

PATRIC_output/

    Directories containing the kraken2, CARD and VFDB output of all shotgun samples by PATRIC. Only the files used for data analysis was included due to the limit of size limitation.

PATRIC_output_10samples/

    Directories with full output of the kraken2 for 10 samples, including kraken2, CARD and VFDB, and a .txt file containing the name of the 10 samples. This folder provides a complete view of the patric output and allows to try data analysis with a small sized data output.

- utils

    To run our scripts smoothly, the following functions are in use:

- color-legends_16S.pdf

    Color legends used for the major taxa in the 16S gene and shotgun metagenome sequencing.

About

This repository contains all the code and files needed for our scientific data project that highlights comparative and functional study using 395 shotgun metagenomic samples from hematopoietic cell transplant patients collected in the past 10 years in Memorial Hospital.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published