/
index.Rmd
336 lines (165 loc) · 9.27 KB
/
index.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
---
title: "Home"
site: workflowr::wflow_site
output:
workflowr::wflow_html:
toc: false
editor_options:
chunk_output_type: console
---
I am interesting in understanding differential APA between human and chimpanzees. I used 3' Seq to identify PAS for total and nuclear frations of LCL in both species.
##Preparation and QC:
* [Prepare annotation files for mapping ](annotationInfo.html)
* [Remove SNORNAs from the annotations](removeSNOfromAnno.html)
* [Mapping statistics and plots](mapStats.html)
* [Liftover PAS](liftoverPAS.html)
* [Annotated PAS and make APA phenotype](annotatePAS.html)
* [PCA plot to look at variation in data](NromalizePhenos.html)
* [Correlation between individuals](CorrbetweenInd.html)
* [Verify sample identity](verifyBAM.html)
* [3' Seq enrichement along gene bodies](EnrichmentEndofGenes.html)
* [Cross map and prepare Chimp 3' for Viz in IGV](CrossChimpThreePrime.html)
* [Make PAS bed for 5% peaks](fixPeak2PAS.html)
## Choosing cuttoffs for usage and expression level
* [Use merged 3' at gene level to look for a data quantity cutoff](expressioncutoff.html)
* [Subsampleing apaQTL lines (not redone)](establishCutoffs.html)
* [Apply the expression filter](applyDoubleFilt.html)
* [Compare before and after cutoff](compareFilter.html)
##Catalog PAS:
* [Number of PAS in each species double filter](PASnumberSpecies_doubleFilter.html)
* [Top 2 signal sites double filter](signalsites_doublefilter.html)
* [Dominant PAS in Nuclear Fraction double filter](dominantPAS_DF.html)
* [Intronic location for dominant PAS double filter](DominantPASintronLoc_DF.html)
* [Test for dominant PAS for total fraction](TotalDomStructure.html)
* [Compare dominance between fractions](DomTotalvNuclear.html)
* [Species Specificity in PAS double filter](speciesSpecific_DF.html)
* [PAS number distribution according to dominance](ConservationofPASNumber.html)
* [PAS number distribution according to dominance in total](ConservationofPASNumberTotal.html)
* [Correlation between fraction in each species](SpecBetweenfrac.html)
* [Look at dominance by difference between dominant PAS and next highest used](NewDomDef.html)
* [New Dominance and species specific PAS](specSpecDom.html)
##Differential Isoform Analysis:
* [Nuclear Human vs. Chimp Analysis Double filter](Nuclear_HvC_DF.html)
* [Explore nuclear differential APA PAS and genes Double filter](ExploredAPA_DF.html)
* [Enrichment for differencial APA and apaQTL double filter](dAPAandapaQTL_DF.html)
* [Visualization for all Nuclear DAPA genes double filter](visualizePASUsage_DF.html)
* [Does presence or absense of signal explain differential usage](SS_diffUsage.html)
* [Protein interactions](PTM_analysis.html)
* [Explore coding dPAS](CodingdPAS.html)
* [New Dominance method and usage or differential usage](UsageForDomPAS.html)
###Use annotations to understand mechanisms
* [Identify orthologous 3'UTR set](ortho3UTR.html)
* [Look at differentially used PAS by length of UTR](LongShortUTR_2PAS.html)
* [Look at miRNA density](miRNAanalysis.html)
##Quantify Phenotypes with RNA:
* [Differential Splicing](diffSplicing.html)
* [Differential Expression](diffExpression.html)
* [Metadata by species](MetabySpecies.html)
* [DE genes and eQTLs](DEandeQTls.html)
* [Compare DE and DS genes](DEvDS.html)
##Intersect with outside molecular phenotypes:
* [Download data from differential protein and ribosomes](dProtanddRibo.html)
* [Compare to nuclear differential APA with Double filter](DiffTransProtvsNuclearAPA_DF.html)
* [Use UpSetR to vizualize intersections Double filter](upsetter_DF.html)
* [Effect size correlations double filter](correlationPhenos_DF.html)
* [Compare species 5' splice site strength](SpliceSiteStrength.html)
* [Compare dominance structure and DE genes](DomPatternandDE.html)
* [PolII](pol2.html)
* [New Dominance method and DE](NewDomandDE.html)
##Selection exploration
* [Explore PAS conservation](dAPA_Conservation.html)
* [Use directional selection from Khan](DirSelectionKhan.html)
* [Conservation based on dominance structure](ConsandDom.html)
##Explore effect size relationships
* [Explore more about APA expression relationship](ExploreAPAexpression.html)
* [Explore more about APA expression relationship with overlaping significant genes](ExploreAPAexpressionOverlap.html)
* [Apa and expression relationship for seperated by location of PAS](DiffUsedIntronic.html)
* [Look at genes that have differences in protein levels and APA levels but not mRNA expression](dAPAdPnotE.html)
##Evaluate Reasons for inconsistant dominance structure (not rerun after annotation change 4/20)
List of everything we tried for this section:
* [List of Test dominance structure](listFixes.html)
###Number of Ns
* [Look at number of reads lost if you exclude reads with N](index.html)
* [Base composition](BaseComp.html)
###Misprimming
* [Change misprimming parameters](changeMisprimcut.html)
Liftover
* [Liftover for 4](liftMis4.html)
* [Liftover for 5](liftMis5.html)
* [Liftover for 6](liftMis6.html)
Annotate and make phenotype
* [Annotation for 4](annoMis4.html)
* [Annotation for 5](annoMis5.html)
* [Annotation for 6](annoMis6.html)
Evaluate results:
* [Dominant PAS distribution after adjusting filters](NewFilterDomPAS.html)
###Annotation differences
* [Test for annotation bias](annotationBias.html)
###Other ideas:
* [Add usage cutoffs for the dominant PAS](DomPASDiffUsed.html)
* [Overlapping PAS](OverlappingPAS.html)
* [Test quantification methods](TrynewQuantMethods.html)
* [Look at dominance sturcture without multimapping and SNOrnas](checkResnoMM.html)
* [Distribution with normalized usage](DominantPAS_usingNorm.html)
* [Filter out ties and genes with 1 PAS](DominantNoTies.html)
* [Quantile Normalzie](PPASqauntnorm.html)
###Test multimapping and FC criteria
* [Look at new parameters for multimapping](multimapParameters.html)
* [Assess which PAS are in orthoexon file](orthoexonAnno.html)
* [Experiemnt with quality and FC](MMExpreiment.html)
* [Redo the quantification with primary reads](DomwithPrimary.html)
##Visualization
* [Try to use gviz](GvizPlots.html)
##Extra QC and Analysis
* [Compare identified PAS with apaQTL PAS](comp2apaQTLPAS.html)
* [Test for overlaps within species](MergePASwithin.html)
* [Check res after removing 18499](resultswo18499.html)
### Sample switch analysis
* [Investigate bad RNA seq samples](InvestigateBadSamples.html)
* [Differential splicing without bad samples](diffSplicing_removeBad.html)
* [Differential Expression without bad samples](diffExpressionRemove2.html)
### Mapping problems with pantro5 and multimapping
* [Map RNA to hg19](MapRNAhg19.html)
* [Evaluate multimapping criteria](multiMap.html)
* [Map to opposite Genomes](OppositeMap.html)
* [Evaluate Pantro 5 problems](investigatePantro5.html)
From before reseq:
* [Total Fraction](NormalizePheno_total.html)
* [Total Fraction- Prenormalized](NonNormPheno_total.html)
### Chosing between pantro files
* [Compare pantro5 and pantro6 orthoexon](compareOrthoExon.html)
* [Differential Expression with Pantro6](diffExpressionPantro6.html)
* [Compare DE in pantro5 vs pantro6](DE_pantro5vspantro6.html)
### Extra comparisons or analysis: not redone after double filter
* [Human Total vs Nuclear Analysis](Human_TvN.html)
* [Chimp Total vs Nuclear Analysis](Chimp_TvN.html)
* [Dominant PAS in Total Fraction](dominantPAS_total.html)
* [Total Human vs. Chimp Analysis](Total_HvC.html)
* [Venn diagrams for genes in each differential APA set](CompHC_totalandnuclear.html)
From the original nuclear dominant
* [Does signal site explain trend?](ss_sameDomdiffDom.html)
* [Look at Intronic Dominant with DPAU and DE](HumanIntronicDominant.html)
* [Enrichment for differencial APA and eQTL](DapaineQTL.html)
* [Look at dAPA and genes from Dominant PAS analysis](DominatPASandDIffAPA.html)
###PreDouble filter analysis
* [Number of PAS in each species](PASnumperSpecies.html)
* [Pressence of top 2 signal sites](signalsites.html)
* [Dominant PAS in Nuclear Fraction](dominantPAS.html)
* [Intronic location for dominant PAS](DominantPASintronLoc.html)
* [Species Specificity in PAS](speciesSpecific.html)
* [Nuclear Human vs. Chimp Analysis](Nuclear_HvC.html)
* [Explore nuclear differential APA PAS and genes](Explore.html)
* [Enrichment for differencial APA and apaQTL](andapaQTL.html)
* [Visualization for all Nuclear DAPA genes](visualizePASUsage.html)
* [Compare to nuclear differential APA](DiffTransProtvsNuclearAPA.html)
* [Use UpSetR to vizualize intersections](upsetter.html)
* [Effect size correlations](correlationPhenos.html)
* [Normalize Nuclear Fraction](NormalizePheno_nuclear.html)
* [Mediation analysis](mediation.html)
###Human v Chimp fraction differences - pre- march rerun
* [Differential isoform between fractons by species](TotalVNuclearBothSpecies.html)
####Mediation tests
* [Normalize (qqnrom) Nuclear Fraction double filter](NormalizePheno_nuclear_DF.html)
* [Normalize clusters from splicing analysis](normalizeClusters.html)
* [Mediation analysis double filter](mediation_DF.html)
* [Mediation with protein](mediation_prot.html)