Skip to content

R-Wright-1/PET-Plastisphere

Repository files navigation

A multi-OMIC characterisation of biodegradation and microbial community succession within the PET Plastisphere

Robyn J. Wright, Rafael Bosch, Morgan G.I. Langille, Matthew I. Gibson and Joseph A. Christie-Oleza

This is the Github repository for all code used to produce the analyses in A Multi-OMIC Characterisation of Biodegradation and Microbial Community Succession Within the PET Plastisphere

I have not provided comprehensive instructions on reproducing these analyses, but with some basic knowledge of Python usage, the scripts should be able to just be run (in the directories that they are in, as these already have the data needed). In some cases, larger files may need unzipping etc. Please feel free to email me with any questions.

This repository has the following sections:

1. FTIR with raw plastics

A .csv file with the FTIR data, an .Rmd document used to perform baseline correction, a .txt file with the baseline-corrected spectra, a .py script to plot these spectra and a .png file that this creates.
Note that the same script was used for normalising the FTIR spectra after incubation also, but file names and output locations were manually changed.

2. Community succession

a) DNA yields: DNA concentrations in each sample as measured by Qubit as well as bioanalyser and qPCR results of the library used for MiSeq (note that this also contained samples run for other projects in the same run) and the metadata files used for the SRA submission
b) Basic processing DADA2 in R: this has the script used for analysis and several of the plots output by this. Raw sequencing data can be downloaded from the NCBI SRA PRJNA544783
c) Stacked bar: All samples and taxonomy arranged by time point, a script for plotting the stacked bar plots and the bar plots .png with taxa below 1% abundance grouped to 'other'
d) Diversity: A script for plotting the mean diversity of treatments on different days, and outputting this information to .csv files
e) NMDS, principal response curve and heatmap: scripts for carrying out the PRC analysis in R, extracting the data in Python and plotting this alongside a regular NMDS plot and a heat map of the important ASVs
f) ANOSIM and PERMANOVA: A script for carrying out the ANOSIM and PERMANOVA tests listed in one of the supplementary tables
g) Colonisation dynamics: Two versions of the analysis that examines the day on which each ASV is highest in abundance. The first version (colonisation_plot.py producing Overall fig order abundance.png) was from the first submission and this was later edited to the current versions (colonisation_new_R1.py and New figure.png) based on reviewer suggestions. These both produce heatmaps with taxa normalised to the day on which they are most abundant
h) PICRUSt2: almost all PICRUSt2 results (some were removed due to large file sizes, but these haven't been used for plotting). This includes: - figure: final versions of the figures shown in the main and supplementary results
- older_plots: older versions of figures representing the data that weren't included in the final manuscript version
- picrust_out: the key output from running PICRUSt2 with the inclusion of additional genes related to PET degradation
- R-notebook: R notebook used to investigate the output of PICRUSt2 and the abundance of PETases within the predicted metagenome
- stratified_plots: data and scripts used to plot figures showing abundance of key genes stratified by the taxa that contributed to them
i) Community metabolomics: Data and scripts used to plot the NMDS plot showing the distance between the metabolomics results obtained from the different community incubations on day 42
j) Other figures previous: Some previous figures that didn't make it into the final manuscript version
z) Add genes to picrust: Details on adding genes of interest to the default PICRUSt2 database using Hidden Markov Models of these genes. Includes an R markdown document and .html output with step-by-step instructions for recreating this

3. Isolates

a) Genomic analysis: Genome annotations for each isolate as well as results of the HMM search. Note that details on making the HMM can be found in z) Add genes to picrust, above
b) Growth of isolates: data and scripts for plotting the growth of the two isolates on a range of common growth substrates, as measured by a plate reader across 72 hours of incubation
c) Proteomic analysis of isolates: scripts and data used for calculating fold change, making the plots and making the Thioclava gene cluster plot
d) Metabolomic analysis of isolates: scripts for calculating fold change, carrying out T-tests and plotting each metabolite. Note that the values shown here are taken directly from the appropriate raw metabolomics data, as presented in Supplementary Table 8
e) Global distribution: details and scripts used to perform the searches for the isolates in the plastisphere meta-analysis data, the TARA oceans data (mitag 16S rRNA gene and assembled metagenomes):
- figures created during analyses
- blast_database: BLAST databases created for each sample to be searched
- blast_out: output files from BLAST searches of all samples using the 16S rRNA gene sequences (both obtained from whole genome sequencing and sanger sequencing) of both isolates
- metagenome: output files from all MetaQUAST runs with the two isolates against the assembled TARA oceans metagenomes (note only summary files have been kept due to the large size of other output files)
- plastisphere_16S: data used and figures produced by searches for the 16S sequences of both isolates in the data obtained from our recent plastisphere meta-analysis
- TARA_16S: data used and figures produced by searches for the 16S sequences of both isolates in the TARA oceans mitag sequences

4. Community and isolates (5 month growth experiment)

a) Growth at three months and FTIR analysis at five months: scripts and raw FTIR data for plotting the growth of isolates and communities on different types of PET

5. All code used for this chapter of my PhD thesis

I think that all of this is repeated in the above sections, but I am leaving it in for compatibility with what I have listed in my PhD thesis (which will be linked once it is available - after the embargo period)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages