-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bioinformatics 3: ChiP-seq analysis #21
Comments
An enhancer is a short piece or sequence of DNA that works to enhance or speed up the rate of genetic transcription. An enhancer is also often called a cis-regulatory element and is between 20 to 400 base pairs of DNA in size. Transcription factors first bind to an enhancer. Then a DNA bending protein brings the enhancer closer to the promoter in a process known as DNA looping. Enhancers thus enhance or speed up the rate of transcription by bringing transcription actors closer to the promoter. Enhancers can also regulate more than one gene regardless of their orientation relative to the genes or genes. Enhancers also are an important genetic element in development since they can help to enhance the activation of transcription in cells. Promoters are pieces of DNA sequences that indicate where transcription of DNA by RNA polymerase starts. Promoters are involved in initiating or starting genetic transcription since they determine which DNA strand will be transcribed (i.e. which strand is the sense strand), and in which direction the transcription will occur. Promoters are usually found upstream from the start of transcription at the 5’end of where transcription starts. Promoters have to be in a 5’position near to the gene to be transcribed. The 5’end of DNA refers to the DNA strand that ends on a 5’carbon. The promoters bind to both the RNA polymerase enzyme and to transcription factors.The promoter initiates the process of transcription by interacting with RNA polymerase and transcription factors. The RNA polymerase enzyme weakly binds to a DNA sequence and moves along the strand until it encounters a promoter. At this stage, it then forms a closed promoter complex with the promoter. The RNA polymerase then proceeds to unwind the DNA at the transcription initiation or start site to form an open promoter complex. Transcription is then initiated. Summary of Enhancer Vs. Promoter
|
CAP: Chromatin Associated Proteins (CAPs) introns Mapping (alignment) irreproducible discovery rate (IDR) |
Steps in data analysis
Preprocessing:
i) Bad quality -> Tool: Use “FASTQ Quality Filter” and/or “FASTQ Quality
ii) Flagged Kmer Content: About 100% of the first six bases are the same sequence -> Tool: Use “FASTQTrimmer” Trimmer
Quality control: Run fastqc on the processed samples to see if the problem has been removed. Tool: fastqc
Library complexity: the fraction of unique fragments present in a given library. A proxy is to look at the sequence
duplication levels on the FastQC report.
Low library complexity may be an indicator that:
– A new sample and a new library should be prepared.
– We have to find a better Ab to perform the IP.
– We can not sequence the same sample anymore because we will not find new sequences.
In certain experimental settings we may expect a low library complexity. i.e. We are profiling a protein that binds to a small subset of the genome.
Mapping (alignment): Treat IP and control the same way (preprocessing and mapping). Tool: bowtie 1 or bowtie 2 (use end-to-end mode) or bwa
– map the reads and removing unmapped reads
– filter reads mapped by quality mapping score
Peak calling
i) Read extension and signal profile generation: Estimation of the fragment length using Strand cross-correlation analysis
ii) Peak assignment and evaluation
– Look for fold enrichment of the sample over input or expected background
– Estimate the significance of the fold enrichment using
iii) Look at your mapped reads and peaks in a genome browser to verify peak calling thresholds
i) Link peaks to genes: Bed tools (intersectBed, closestBed, coverageBed, slopBed)
ii) Infer possible biological consequences of the binding
AnalysisofChIP-seqData2016.pdf
The text was updated successfully, but these errors were encountered: