-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathindex.Rmd
More file actions
252 lines (180 loc) · 8.16 KB
/
index.Rmd
File metadata and controls
252 lines (180 loc) · 8.16 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
---
title: Using Bioconductor for Microarray Analysis
output:
rmdformats::readthedown:
toc_depth: 3
number_sections: true
---
Bioconductor has advanced facilities for analysis of microarray
platforms including Affymetrix, Illumina, Nimblegen, Agilent, and
other one- and two-color technologies.
Bioconductor includes extensive support for analysis of expression
arrays, and well-developed support for exon, copy number, SNP,
methylation, and other assays.
Major workflows in Bioconductor include pre-processing, quality
assessment, differential expression, clustering and classification,
gene set enrichment analysis, and genetical genomics.
Bioconductor offers extensive interfaces to community resources,
including GEO, ArrayExpress, Biomart, genome browsers, GO, KEGG, and
diverse annotation sources.
# Sample Workflow
```{r echo=FALSE}
suppressPackageStartupMessages(library(limma))
suppressPackageStartupMessages(library(affy))
```
The following code illustrates a typical R / Bioconductor
session. It uses RMA from the `affy` package to pre-process Affymetrix
arrays, and the `limma` package for assessing differential expression.
```{r}
## Load packages
library(affy) # Affymetrix pre-processing
library(limma) # two-color pre-processing; differential
# expression
## import "phenotype" data, describing the experimental design
phenoData <-
read.AnnotatedDataFrame(system.file("extdata", "pdata.txt",
package="arrays"))
## RMA normalization
celfiles <- system.file("extdata", package="arrays")
eset <- justRMA(phenoData=phenoData,
celfile.path=celfiles)
## differential expression
combn <- factor(paste(pData(phenoData)[,1],
pData(phenoData)[,2], sep = "_"))
design <- model.matrix(~combn) # describe model to be fit
fit <- lmFit(eset, design) # fit each probeset to model
efit <- eBayes(fit) # empirical Bayes adjustment
topTable(efit, coef=2) # table of differentially expressed probesets
```
A top table resulting from a more complete analysis, described in
Chapter 7 of [Bioconductor Case
Studies](/help/publications/books/bioconductor-case-studies/),
is shown below. The table enumerates Affymetrix probes, the log-fold
difference between two experimental groups, the average expression
across all samples, the t-statistic describing differential
expression, the unadjusted and adjusted (controlling for false
discovery rate, in this case) significance of the difference, and
log-odds ratio. These results can be used in further analysis and
annotation.
ID logFC AveExpr t P.Value adj.P.Val B
636_g_at 1.10 9.20 9.03 4.88e-14 1.23e-10 21.29
39730_at 1.15 9.00 8.59 3.88e-13 4.89e-10 19.34
1635_at 1.20 7.90 7.34 1.23e-10 1.03e-07 13.91
1674_at 1.43 5.00 7.05 4.55e-10 2.87e-07 12.67
40504_at 1.18 4.24 6.66 2.57e-09 1.30e-06 11.03
40202_at 1.78 8.62 6.39 8.62e-09 3.63e-06 9.89
37015_at 1.03 4.33 6.24 1.66e-08 6.00e-06 9.27
32434_at 1.68 4.47 5.97 5.38e-08 1.70e-05 8.16
37027_at 1.35 8.44 5.81 1.10e-07 3.08e-05 7.49
37403_at 1.12 5.09 5.48 4.27e-07 1.08e-04 6.21
<p class="back_to_top">[ <a href="#top">Back to top</a> ]</p>
# Installation and Use
Follow [installation instructions](/install/) to start using these
packages. You can install `affy` and `limma` as follows:
```{r eval=FALSE}
## try http:// if https:// URLs are not supported
source("https://bioconductor.org/biocLite.R")
biocLite(c("affy", "limma"))
```
To install additional packages, such as the annotations associated
with the Affymetrix Human Genome U95A 2.0, use
```{r eval=FALSE}
## try http:// if https:// URLs are not supported
source("https://bioconductor.org/biocLite.R")
biocLite("hgu95av2.db")
```
Package installation is required only once per R installation. View a
/packagesfull list of [available packages](/packages/release/bioc/).
To use the `affy` and `limma` packages, evaluate the commands
```{r eval=FALSE}
library("affy")
library("limma")
```
These commands are required once in each R session.
<p class="back_to_top">[ <a href="#top">Back to top</a> ]</p>
# Exploring Package Content
Packages have extensive help pages, and include vignettes highlighting
common use cases. The help pages and vignettes are available from
within R. After loading a package, use syntax like
help(package="limma")
?topTable
to obtain an overview of help on the `limma` package, and the
`topTable` function, and
```{r eval=FALSE}
browseVignettes(package="limma")
```
to view vignettes (providing a more comprehensive introduction to
package functionality) in the `limma` package. Use
```{r eval=FALSE}
help.start()
```
to open a web page containing comprehensive help resources.
<p class="back_to_top">[ <a href="#top">Back to top</a> ]</p>
# Pre-Processing Resources
The following provide a brief overview of packages useful for
pre-processing. More comprehensive workflows can be found in
documentation (available from
[package descriptions](/packages/release/bioc/))
and in Bioconductor [Books and monographs](/help/publications/).
## Affymetrix 3'-biased Array ##
[affy](/packages/release/bioc/html/affy.html),
[gcrma](/packages/release/bioc/html/gcrma.html),
[affyPLM](/packages/release/bioc/html/affyPLM.html)
* Require cdf package, probe package and annotation package
* All these packages are available from Bioconductor via `biocLite()`
[xps](/packages/release/bioc/html/xps.html)
* Requires installation of [ROOT](http://root.cern.ch/)
* Uses data files from Affymetrix (.CDF, .PGF, .CLF, .CSV) directly
## Affymetrix Exon ST Arrays ##
[oligo](/packages/release/bioc/html/oligo.html)
* Requires a pdInfoPackage built using
[pdInfoBuilder](/packages/release/bioc/html/pdInfoBuilder.html)
* This package collates cdf, probe, annotation data together
* These packages are available from Bioconductor via `biocLite()`
* Most cases will require a 64-bit computer running Linux and >= 8Gb RAM
[exonmap](/packages/release/bioc/html/exonmap.html)
* Requires installation of MySQL and Ensembl core database tables
* Requires specially modified
[cdf](http://xmap.picr.man.ac.uk/download/) and affy package
* Requires a 64-bit computer running Linux and >= 8 Gb RAM
[xps](/packages/release/bioc/html/xps.html)
* Requires installation of [ROOT](http://root.cern.ch/)
* Uses data files from Affymetrix (.CDF, .PGF, .CLF, .CSV) directly
* Will run on conventional desktop computers
## Affymetrix Gene ST Arrays ##
[oligo](/packages/release/bioc/html/oligo.html)
* Requires a pdInfoPackage built using
[pdInfoBuilder](/packages/release/bioc/html/pdInfoBuilder.html)
* This package collates cdf, probe, annotation data together
* These packages are available from Bioconductor via `biocLite()`
[xps](/packages/release/bioc/html/xps.html)
* Requires installation of [ROOT](http://root.cern.ch/)
* Uses data files from Affymetrix (.CDF, .PGF, .CLF, .CSV) directly
## Affymetrix SNP Arrays ##
[oligo](/packages/release/bioc/html/oligo.html)
* Requires a pdInfoPackage built using
[pdInfoBuilder](/packages/release/bioc/html/pdInfoBuilder.html)
* This package collates cdf, probe, annotation and HapMap data
* These packages are available from Bioconductor via `biocLite()`
* Not yet capable of processing CNV regions in SNP5.0 and SNP6.0 </ul>
## Affymetrix Tiling Arrays ##
[oligo](/packages/release/bioc/html/oligo.html)
* Requires a pdInfoPackage built using
[pdInfoBuilder](/packages/release/bioc/html/pdInfoBuilder.html)
* This package collates data from bpmap and cif files
## Nimblegen Arrays ##
[oligo](/packages/release/bioc/html/oligo.html)
* Requires a `pdInfoPackage` built using
[pdInfoBuilder](/packages/release/bioc/html/pdInfoBuilder.html)
## Illumina Expression Microarrays ##
[lumi](/packages/release/bioc/html/lumi.html)
* Requires lumi-specific mapping and annotation packages (e.g.,
`lumiHumanAll.db` and `lumiHumanIDMapping`)
[beadarray](/packages/release/bioc/html/beadarray.html)
* Requires beadarray-specific mapping and annotation packages (e.g.,
`illuminaHumanv1BeadID.db` and `illuminaHumanV1.db`)
<p class="back_to_top">[ <a href="#top">Back to top</a> ]</p>
```{r}
sessionInfo()
```
<p class="back_to_top">[ <a href="#top">Back to top</a> ]</p>