# Methods

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;The process of building Euplotid began with taking raw sequencing reads stored in a few different formats and processing it all the way to quantified values. Due to the pliability, breadth, and flexibility of the methods they will be documented within their own Jupyter notebook. Acting as both the documentation and the pipeline itself this format allows for seamless data integration.

* Hello world intro to programming and Jupyter's capabilities [helloWorld](helloWorld.md) O*.
* Databases and good tools to crawl the internet for interesting datasets and hypothesis [databasesTools](databasesTools.md) O*.
* Fetch any type of sequencing data from SRA [getFastqReads](getFastqReads.md) O
* QC, trim, and filter sequencing reads [fq2preppedReads](fq2preppedReads.md) O
* Call peaks from Chip-Seq and Chromatin Accessibility reads [fq2peaks](fq2peaks.md) O
* Call normalized interactions from ChiA-PET reads [fq2ChIAInts](fq2ChIAInts.md) O
* Call normalized Interactions from HiC reads [fq2HiCInts](fq2HiCInts.md) O
* Call normalized interactions from Hi-ChIP reads [fq2HiChIPInts](fq2HiChIPInts.md) O
* Call normalized interactions from DNAse-HiC reads [fq2DNAseHiCInts](fq2DNAseHiCInts.md) O
* Call normalized expression and counts from RNA-Seq reads [fq2countsFPKM](fq2countsFPKM.md) O
* Call differentially expressed genes from RNA-seq counts [countsFPKM2DiffExp](countsFPKM2DiffExp.md) O
* Call normalized counts, miRNA promoters, and nascent transcripts from Gro-Seq reads [fq2GroRPKM](fq2GroRPKM.md) O
* Call normalized interactions from 4C [fq24CInts](fq24CInts.md) O
* Build, annotate and add INs to global graph for a given cell state using DNA-DNA interactions, Chromatin Accesibility, and FPKM [addINs](addINs.md) *
* View current built and annotated INs for all cell types [viewINs](viewINs.md) O*.
* Search for and/or manipulate annotation and other data available to euplotid [annotationManagement](annotationManagement.md) O*.
* Description of default software packages and images installed, how to get new ones, and which ones are currently installed. [packageManagement](packageManagement.md) O*.
* Find clusters of interconnected nodes (Communities) using a Louvain algorithm then visualize the results [vanillaCommunities](vanillaCommunities.md)
* Create, manipulate, and visualize cool DNA-DNA interaction files [chilledInteractions](chilledInteractions.md)
* Design Base Editor and sgRNA plasmids for transition mutation at picked Cis-Regulatory Element [CRE2plasmid](CRE2plasmid.md)

 [O] = Megatid compatible
 [*] = Euplotid compatible
 [.] = Minitid compatible

## [helloWorld](helloWorld.md)
Hello world intro to programming, ipython, and Euplotid

## [databasesTools](databasesTools.md)
Databases and good tools to crawl the internet for interesting datasets and hypothesis. Some examples include GTeX, uniprot, SRA, GEO, etc, check them out!!

## [getFastqReads](getFastqReads.md)
Allows you to use Tony to find local fastq.gz files OR provide an SRA number to pull from

## [fq2preppedReads](fq2preppedReads.md)
Take fq.gz reads and QC them using FastQC checking for over-represented sequences potentially indicating adapter contamination. Then use cutadapt and sickle to filter and remove adapters. Can also use trimmomatic for flexible trimming. 

## [fq2peaks](fq2peaks.md)
Take fq.gz align it using bowtie2 to the genome. Then using Homer software pick the type of peak (histone, chip-seq, dnase, etc) and chug through to get bed files of peaks. Can also use MACS2 w/ specific analysis parameters to deal with different types of peak finding problems.

## [fq2ChIAInts](fq2ChIAInts.md)
Take fq.gz reads, prep them by removing bridge adapters (can deal with either bridges), align, find interactions, normalize, and spit into cooler format for later viewing. Can perform analysis using either Origami or ChiA-PET2

## [fq2HiCInts](fq2HiCInts.md)
Take fq.gz reads and chug them through HiCPro w/ tuned relevant parameters. In the end spits out a cooler file which can be loaded for further visualization.

## [fq2HiChIPInts](fq2HiChIPInts.md)
Take fq.gz reads and chug them through customized Origami pipeline and customized HiCPro pipeline. In the end spits out a cooler file which can be loaded for further visualization.

## [fq2DNAseHiCInts](fq2DNAseHiCInts.md)
Take fq.gz reads and chug them through HiCPro pipeline. In the end spits out a cooler file which can be loaded for further visualization.

## [fq2countsFPKM](fq2countsFPKM.md)
Take fq.gz reads and chug them through STAR aligner and then RSEM pipeline. In the end spits out a counts vs transcripts matrix and a normalized transcript/gene FPKM matrix.

## [countsFPKM2DiffExp](countsFPKM2DiffExp.md)
Take RNA-seq count and FPKM matrix and run any one of many R packages (DESeq2,DESeq,EBSeq,edgeR...) to call differentially expressed genes. Plotting and interactive visualization of results included

## [fq2GroRPKM](fq2GroRPKM.md)
Take fq.gz reads and align them using bowtie2 then find nascent transcripts using FStitch and miRNA promoters using mirSTP

## [fq24CInts](fq24CInts.md)
Take fq.gz reads and align them using bowtie2. Chug them through HiCPro and/or custom pipeline to get cooler file

## [addINs](addINs.md)
Build, annotate and add INs to global graph for a given cell state using DNA-DNA interactions, Chromatin Accesibility, and FPKM.

## [viewINs](viewINs.md)
View current built and annotated INs for all cell types

## [annotationManagement](annotationManagement.md)
Search for and/or manipulate annotation and other data available to euplotid

## [packageManagement](packageManagement.md)
Description of default image and the software packages that are installed, also how to get new packages, and how to export environment in yaml file for others to replicate analysis.

## [vanillaCommunities](vanillaCommunities.md) 
Find clusters of interconnected nodes (Communities) using a Louvain algorithm then visualize the results 

## [chilledInteractions](chilledInteractions.md) 
Create, manipulate, and visualize cool DNA-DNA interaction files 

## [CRE2plasmid](CRE2plasmid.md) 
Design Base Editor and sgRNA plasmids for transition mutation at picked Cis-Regulatory Element 