# Pipeline Directory
## Pipelines to:
1. Fetch any type of sequencing data from SRA [getFastqReads](getFastqReads.ipynb)
2. QC, trim, and filter sequencing reads [fq2preppedReads](fq2preppedReads.ipynb)
3. Call peaks from Chip-Seq and Chromatin Accessibility reads [fq2peaks](fq2peaks.ipynb)
4. Call normalized interactions from ChiA-PET reads [fq2ChIAInts](fq2ChIAInts.ipynb)
5. Call normalized Interactions from HiC reads [fq2HiCInts](fq2HiCInts.ipynb)
6. Call normalized interactions from Hi-ChIP reads [fq2HiChIPInts](fq2HiChIPInts.ipynb)
7. Call normalized interactions from DNAse-HiC reads [fq2DNAseHiCInts](fq2DNAseHiCInts.ipynb)
8. Call normalized expression and counts from RNA-Seq reads [fq2countsFPKM](fq2countsFPKM.ipynb)
9. Call differentially expressed genes from RNA-seq counts [countsFPKM2DiffExp](countsFPKM2DiffExp.ipynb)
10. Call normalized counts, miRNA promoters, and nascent transcripts from Gro-Seq reads [fq2GroRPKM](fq2GroRPKM.ipynb)
11. Call normalized interactions from 4C [fq24CInts](fq24CInts.ipynb)

# 1. [getFastqReads](getFastqReads.ipynb)
Allows you to use Tony to find local fastq.gz files OR provide an SRA number to pull from

# 2. [fq2preppedReads](fq2preppedReads.ipynb)
Take fq.gz reads and QC them using FastQC checking for over-represented sequences potentially indicating adapter contamination. Then use cutadapt and sickle to filter and remove adapters. Can also use trimmomatic for flexible trimming. 

# 3. [fq2peaks](fq2peaks.ipynb)
Take fq.gz align it using bowtie2 to the genome. Then using Homer software pick the type of peak (histone, chip-seq, dnase, etc) and chug through to get bed files of peaks. Can also use MACS2 w/ specific analysis parameters to deal with different types of peak finding problems.

# 4. [fq2ChIAInts](fq2ChIAInts.ipynb)
Take fq.gz reads, prep them by removing bridge adapters (can deal with either bridges), align, find interactions, normalize, and spit into cooler format for later viewing. Can perform analysis using either Origami or ChiA-PET2

# 5. [fq2HiCInts](fq2HiCInts.ipynb)
Take fq.gz reads and chug them through HiCPro w/ tuned relevant parameters. In the end spits out a cooler file which can be loaded for further visualization.

# 6. [fq2HiChIPInts](fq2HiChIPInts.ipynb)
Take fq.gz reads and chug them through customized Origami pipeline and customized HiCPro pipeline. In the end spits out a cooler file which can be loaded for further visualization.

# 7. [fq2DNAseHiCInts](fq2DNAseHiCInts.ipynb)
Take fq.gz reads and chug them through HiCPro pipeline. In the end spits out a cooler file which can be loaded for further visualization.

# 8. [fq2countsFPKM](fq2countsFPKM.ipynb)
Take fq.gz reads and chug them through STAR aligner and then RSEM pipeline. In the end spits out a counts vs transcripts matrix and a normalized transcript/gene FPKM matrix.

# 9. [countsFPKM2DiffExp](countsFPKM2DiffExp.ipynb)
Take RNA-seq count and FPKM matrix and run any one of many R packages (DESeq2,DESeq,EBSeq,edgeR...) to call differentially expressed genes. Plotting and interactive visualization of results included

# 10. [fq2GroRPKM](fq2GroRPKM.ipynb)
Take fq.gz reads and align them using bowtie2 then find nascent transcripts using FStitch and miRNA promoters using mirSTP

# 11. [fq24CInts](fq24CInts.ipynb)
Take fq.gz reads and align them using bowtie2. Chug them through HiCPro and/or custom pipeline to get cooler file