
<img src="images/Biofilm Website 2.png" width="600" height="500"></img>

# Submodule #0: Introduction - Concept Inventory and Workflow Overview

#### Microbial Community and Biofilm Analysis from Metagenomics Datasets: Docker with Google Cloud Tutorial

Biofilms are complex formations of microbial communities composed of different types of microorganisms such as bacteria, viruses and fungi. 
Here, we present a biofilm metagenomics workflow in the form of a self-paced practical learning module to aid in the understanding of the role of biofilms in human health. This will include the analysis of the biofilm community composition, diversity, and function. We will leverage quorum sensing signatures to provide insights into the microbial biofilm phenotype response markers.


**Primary Objective:** Develop a workflow characterizing the taxonomic diversity of biofilm communities and develop an educational resource to assist in the understanding of biofilm metagenomics analysis.



<img src="images/GraphicalAbstract2022IEEE.png" width="300" height="200"></img>

Figure 1

See our intro video for a review of the learning module, core concepts, and methods used to setup a metagenomic experiment, collect data, and analyze it for new biomarker marker discovery.

<font color = "red"> <b>[Intro_video] </b> </font>

----------------------------------------------------------------------------------------------------------------
# Training Plan 


<font color="green"> **Submodule #0: Introduction - Concept Inventory and Workflow Overview** </font>

 
Submodule #1:  Metagenome Data Preparation and QC

 
Submodule #2: Microbiome Analysis


Submodule #3: Biomarker Discovery

 
Submodule #4: Microbiome Community Analysis


Submodule #5: Metagenomics Analysis of Microbiome Community and Biofilm Using NextFlow!

# Learning Objectives:

The biofilm metagenomics workflow self-learning module can be used at the undergraduate and graduate levels. The learning objectives vary slightly based on the audience. It would also be our intention to offer additional technical learning objectives, such as deploying the learning module to alternate platforms and customizing the workflow, for interested students.

<div class="alert alert-block alert-success">
    <i class="fa fa-hand-paper-o" aria-hidden="true"></i>
    <b>Note: </b>  This module can take up to 2 hours to complete.
</div>

----------------------------------------------------------------------------------------------------------------

# Submodule #0: Introduction - Concept Inventory and Workflow Overview

## LO1. Quick Start - Concepts Inventory
 > The learner will receive fundamental concepts related to microbiome and biofilm analysis. Biofilms have great importance for public health because of their role in certain infectious diseases and importance in a variety of device-related infections.

<div class="alert alert-block alert-success">
    <i class="fa fa-hand-paper-o" aria-hidden="true"></i>
    <b>Note: </b> The code below uses the IPython.display tool and imports the function that allows the user to play a YouTube video in Jupyter Notebook.
</div>

### What Is A Biofilm?

In [1]:
#Run the command below to watch the video
from IPython.display import YouTubeVideo

YouTubeVideo('0DSA_8t4-UA', width=800, height=400)

#### Change kernel from Python 3 to qiime2-2022.2 (on top right of working space)

As stated above, in the upper right of this workspace where it says "Python 3" click to open a dropdown menu and click on "qiime2-2021.11". This changes the kernel that we are working in. We will be using this kernel for sumodules 2 and a part of submodule 3.

<p style="color:red"><b>[Change_kernel.mp4]</b></p>

The code below uses a jupyter quiz tool that allows a user to create a quiz in jupyter notebook. You will see a quiz at the end of each submodule. Run the command below to take the quiz.

In [2]:
from IPython.display import IFrame
IFrame("Quiz/QS11.html", width=800, height=350)

-------------------------------------------------------------------

## LO2. Dataset and Toolkits 
The learner will be able to describe and manipulate dataset and toolkits relevant to microbiome analysis project. Biofilm metagenomic analysis can be leveraged to aid in our understanding of microbial taxonomy, functions, interactions, ecology, and evolution.

## Bioinformatics Workflow Description

### Bioinformatics Workflow Overview

<img src="images/USD_workflow.png" width="450" height="300"></img>

### Workflow Analytic Toolkits
- Docker
- Jupyther Notebook
- Custom Scripts
- FastQC
- MultiQC
- Trimmomatic
- QIIME2
- Picrust2
- MicrobiomeAnalystR
- Google BigQuery
- BLAST+

## LO3. A Cloud-Based Workflow Implementation
A cloud-based approach provides easy access to suitable computational capabilities, but it is important to balance the allocation of the computational services of the analysis workflow in a cost-effective way. In the figure below, we show the technical infrastructure diagram of the step-by-step workflow presented in this module.

<img src="images/USD_TID.png" width="700" height="700">

--------------------------------------------------------------

# Summary

In this submodule we looked at a high-level overview of bacterial biofilms, learned about the sequencing methods used to generate metagenomic data, and explored the technology that we will use to analyze the data. In the next submodule, we will get some data and do some basic quality control checks and data preparation on it.

# References
1. Miquel, S., et al., Anti-biofilm activity as a health issue. Frontiers in microbiology, 2016. 7: p. 592.
2. Calle, M.L., Statistical analysis of metagenomics data. Genomics & informatics, 2019. 17(1).
3. Schmeisser, C., et al., Metagenome survey of biofilms in drinking-water networks. Applied and environmental microbiology, 2003. 69(12): p. 7298-7309.
4. Kumar Awasthi, M., et al., Metagenomics for taxonomy profiling: tools and approaches. Bioengineered, 2020. 11(1): p. 356-374.
5. Shafquat, A., et al., Functional and phylogenetic assembly of microbial communities in the human microbiome. Trends in microbiology, 2014. 22(5): p. 261-266.
6. Neelakanta, G. and H. Sultana, The use of metagenomic approaches to analyze changes in microbial communities. Microbiol Insights 6: MBI. S10819. 2013.
7. Hadrich, D., Microbiome research is becoming the key to better understanding health and nutrition. Frontiers in genetics, 2018. 9: p. 212.
8. Cheng, M., L. Cao, and K. Ning, Microbiome big-data mining and applications using single-cell technologies and metagenomics approaches toward precision medicine. Frontiers in genetics, 2019. 10: p. 972.
9. Thomson, C.H., Biofilms: do they affect wound healing? International wound journal, 2011. 8(1): p. 63-67.
10. Suryaletha, K., et al., Metataxonomic approach to decipher the polymicrobial burden in diabetic foot ulcer and its biofilm mode of infection. International wound journal, 2018. 15(3): p. 473-481.
11. Francolini, I. and G. Donelli, Prevention and control of biofilm-based medical-device-related infections. FEMS Immunology & Medical Microbiology, 2010. 59(3): p. 227-238.
12. Wi, Y.M. and R. Patel, Understanding biofilms and novel approaches to the diagnosis, prevention, and treatment of medical device-associated infections. Infectious Disease Clinics, 2018. 32(4): p. 915-929.
13. Mande, S.S., M.H. Mohammed, and T.S. Ghosh, Classification of metagenomic sequences: methods and challenges. Briefings in bioinformatics, 2012. 13(6): p. 669-681.
14. Sedlar, K., K. Kupkova, and I. Provaznik, Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics. Computational and Structural Biotechnology Journal, 2017. 15: p. 48-55.
15. Eng, A., A.J. Verster, and E. Borenstein, MetaLAFFA: a flexible, end-to-end, distributed computing-compatible metagenomic functional annotation pipeline. BMC bioinformatics, 2020. 21(1): p. 1-9.
16. Qasha, R., J. Cała, and P. Watson. A framework for scientific workflow reproducibility in the cloud. in 2016 ieee 12th international conference on e-science (e-science). 2016. IEEE.
17. Wilkinson, M.D., et al., The FAIR Guiding Principles for scientific data management and stewardship. Scientific data, 2016. 3(1): p. 1-9.
18. Devisetty, U.K., et al., Bringing your tools to CyVerse discovery environment using Docker. F1000Research, 2016. 5.
19. Gnimpieba, E.Z., et al., Bio-TDS: bioscience query tool discovery system. Nucleic acids research, 2018. 46(17): p. 9251-9251.

# Images and Illustration Credit: 

Figure 1 credit to Jessica Zylla, created with Biorender