-
Notifications
You must be signed in to change notification settings - Fork 21
/
ExploringGraphs.Rmd
325 lines (234 loc) · 12.6 KB
/
ExploringGraphs.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
---
title: "Exploring Graphs"
author: "James Thompson"
date: 16/02/2023
bibliography: BrailleRPublications.bib
vignette: >
%\VignetteIndexEntry{Exploring Graphs}
%\VignetteEngine{knitr::rmarkdown}
output: knitr:::html_vignette
---
*Problem*
Much of statistics course work in your first year will revolve around making and interpreting graphs. Now for a sighted user R and R studio with ggplot2 etc it is quite a straight forward procedure. However without the use of sight this becomes a challenge.
*Solution*
In BrailleR there are three avenues that can be taken to fix this they are: [Describe] is for learning, [VI] is about getting quick information and [SVG Plots] are for a detailed dive into a plot. Each of these are designed to serve the user and give them potential to 'view' the information a graph has to offer.
*Note*
All of the code below is run after these 'setup' functions
```{r, setupLibrarys, message = FALSE, warnings = FALSE}
library(ggplot2)
library(BrailleR)
```
# Describe
This function will take a graph object and give you the non contextual information about the graph layers.
It is designed as a learning tool to help you understand what a particular graph is trying to show you. It also provides applicable information about how this graph type works in ggplot.
An example of this would be something like this
```{r Basic plot Describe}
basicPlot <- ggplot(iris, aes(Sepal.Length, Sepal.Width)) +
geom_point()
Describe(basicPlot)
```
As can be seen there is a title a general description and then some r hints.
This structure is the same for all Describe output expect for when using it in `MakeAccessibleSVG()`. However more can be found about that here: [How to use]
## Multiple layers
If you have multiple layers in the plot it will default to asking you what layers you want.
You can always tell it which layer to describe by giving the layer number or vector of numbers to the `whichLayer` argument. However more about this can be read in the function documentation.
# VI
This function can be thought of as a text 'Visual Inspection' of the graph.
It has enough information that you might be able to draw some conclusions however it mostly will tell you about the layout of the graph. This function can also be used to investigate non graph objects like lm and aov etc. Only the graph application will be discussed here.
## ggplot
There is full support here for many layers as well as faceted displays (multiple plots in one graph object).
For every graph there will be a few lines at the start telling you the title subtitle what the axes are and there tick marks. This is the same across all of the graph printouts. It is the individual layer sections can be quite different.
The VI function is built into the print function for ggplot objects. This means anytime you have BrailleR loaded and try to display a ggplot it will give you the text output.
This effect can be seen below
```{r VI implicit call}
basicPlot <- ggplot(iris, aes(Sepal.Length, Sepal.Width)) +
geom_point()
basicPlot
```
However you can also easily call it explicitly without having to print the graph.
```{r VI explicit call}
VI(basicPlot)
```
Below is not only a list of the supported Geom but also a little explanation about there printouts.
### GeomHLine
```{r VI geomHLine, fig.show='hide'}
hline <- ggplot(mtcars, aes(mpg, cyl)) +
geom_hline(yintercept = 5)
hline
```
As this is quite a simple geom it is quite a simple printout.
It will tell you how many lines and at what height they are at.
### GeomPoint
```{r VI geomPoint, fig.show='hide'}
point <- ggplot(mtcars, aes(mpg, cyl)) +
geom_point()
point
```
Information will include here the number of points and the shape.
There is also a percentage of approximately how many points can be seen. This is used to help you understand the over plotting. This is only effective for when the points sizes are not too big (size < 18). Don't worry it will only show you the percentage if it is accurate.
### GeomBar / GeomCol
```{r VI geomBar, fig.show='hide'}
bar <- ggplot(mtcars, aes(mpg)) +
geom_histogram()
bar
```
Even though there is a geom_histogram, geom_bar and geom_col in the backend of ggplot it is the same. So for the VI they are treated all pretty much the same.
There will be information just on the number of bars displayed.
### GeomLine
```{r VI geomLine, fig.show='hide'}
line <- ggplot(mtcars, aes(mpg, wt)) +
geom_line()
line
```
Very similar to the hline it will tell you about how many lines there are.
Then for each line it will tell you the number of points that make up the line.
### GeomBoxplot
```{r VI geomBoxplot, fig.show='hide'}
boxplot <- ggplot(mtcars, aes(mpg, as.factor(cyl))) +
geom_boxplot()
boxplot
```
As a boxplot is effectively a 5 number summary with outliers this printout will include all of that information.
It shall include this summary for each boxplot in the layer.
### GeomSmooth
```{r VI geomSmooth, fig.show='hide'}
smooth <- ggplot(mtcars, aes(mpg, wt)) +
geom_smooth()
smooth
```
This output tell you the method used to get the smoothed curve and the confidence interval level which is set.
It also tells you what percentage of the graph is covered by the CI. This information can be used to quickly gauge how confident the graph is
### GeomRibbon / GeomArea
```{r VI geomRibbon, fig.show='hide'}
ribbon <- ggplot(diamonds, aes(x = carat, y = price)) +
geom_area(aes(y = price))
ribbon
```
Once again this layer gives you some information about the layers data.
It will tell you whether it is bound on the y or x axis and then if it is constant or non constant width.
For it to be bound on the y axis means that it covers the whole range of x and vice versa for y.
It includes some widths and centers throughout 5 points in the layer. The width will either be bottom to top for y bound or left to right for x bound.
It also includes the area covered statistic found in the smooth layer.
### GeomBlank
```{r VI geomBlank, fig.show='hide'}
blank <- ggplot(BOD, aes(x = demand, y = Time)) +
geom_line() +
expand_limits(x = c(15, 23, 6), y = c(30))
blank
```
GeomBlank is made by the expand_limits function. It is a normal layer like all the others however it just doesn't have any points to be displayed.
So what this VI does is explain the effect that this layer had on the limits. It shall tell you how much larger the x and y axis are.
## Base
Base R graphics support is kept in here and does work to some extant. However it should be considered deprecated and replaced by the ggplot graphics.
There is support for the base plots:
- Boxplot
- Dotplot
- Histogram
# SVG Plots
SVG plots in BrailleR are used create webpages that a user can explore by using arrow keys and some basic navigation buttons.
The `MakeAccessibleSVG()` function is the easiest way to do this.
You simply pass it a graph object and it will create the webpage and load it for you in your default browser.
The functions that `MakeAccessibleSVG()` uses to create SVGs are `SVGThis()`, `AddXML()` and `BrowseSVG()`. These are available to use but really shouldn't be needed.
There is one last function `ViewSVG()`. This will bring up a webpage that has a list and links of all of the svg webpages available in the current working directory.
Below is the code used to make the example
```r
plot.example = ggplot(mtcars, aes(wt, mpg)) +
geom_point() +
geom_smooth()
MakeAccessibleSVG(plot.example)
```
[example SVG webpage](../rawHTML/plot.example-SVG.html)
## How to use
Using the function is as simple as passing a graph object to `MakeAccessibleSVG()` like this:
```r
simplePlot = ggplot(mtcars, aes(mpg, wt)) +
geom_point()
MakeAccessibleSVG(simplePlot)
```
On completion of the function it will open the webpage in your default web browser.
On the webpage there are 4 sections.
1. The graph
2. VI output
3. List of keys to help explore graph
4. Describe output.
The VI and Describe output is simply there as convenience.
The VI will look exactly the same as the output to console.
However the describe will look slightly different with that first title line being changed to just the geom type header. This should help with it being navigable on the webpage.
### Graph structure
I will explain a little bit about the structure of the graph and how you can explore it.
The explanations will speak of trees, sub trees, parents, children and nodes. Hopefully you are familiar with what these words mean in this context.
A quick summary is to think of a family tree. The root is Adam and Eve. Each of there descendant have there subtrees. Your parents are the final subtree for your branch and you and your siblings are all children nodes (assuming you don't have kids yet). Every person is a node.
For these accessible svgs the graph is the root. Each section has its own sub tree.
The order of subtrees goes:
1. title (if present)
2. x axis
3. y axis
4. first ggplot layer
5. second ggplot layer
6. etc...
There can be an arbitrary number of ggplot layers. These are just added in line
### How to navigate
Navigation around the graph should be somewhat intuitive.
Up arrow to go to you parent.
Down arrow to go to first child.
Left and right to go to your siblings.
Other useful keys are:
**X** which will enable the descriptive mode. This simply means you can get more / different information at any particular node you are at. Note that not all nodes will have this extra information so don't be worry if it doesn't do anything.
**A** This activate keyboard exploration so once you have focuses on the SVG you press A and then are good to with exploring using the arrow keys.
## ggplot
All of the readouts of plots numbers will be formatted and rounded to help make it easier to read.
Less geoms are supported as SVGs than are supported by VI. This is mostly because they have not yet been developed.
In general there will be no more than 5 children nodes to the geom tree.
You might be able to go into each child node of the geom to see more information or there could be a summary of the section at each child node.
Below is geom specific information
### Geom_line
```{r svg geom line, eval = FALSE, echo=FALSE}
line <- ggplot(mtcars, aes(mpg, wt)) +
geom_line()
MakeAccessibleSVG(line)
```
For this layer there we be sub trees for each line.
Within each individual line subtree if the line is disjoint there will be more subtrees for each continuous section of the line.
Once you are looking at either the continuous section or the whole section you can click through to actually look at the line.
If there are more than 5 lines then it will summarizes them. If there are 5 or less then you can press through and individually see the line start and finish locations.
### Geom_Point
```{r svg geom point, eval = FALSE, echo=FALSE}
point <- ggplot(iris, aes(Sepal.Length, Sepal.Width)) +
geom_point()
MakeAccessibleSVG(point)
```
Geom points is a lot simpler.
Depending on the number of points the information the children have will be different.
If there are more than 5 points it will display a summary of the points.
If there are 5 or less points you will get a summary of each individual point.
It is worth noting that due to current technical difficulties the points will not be highlighted at all. More information can be found at [Github issue](https://github.com/ajrgodfrey/BrailleR/issues/90)
### Geom_Bar
```{r svg geom bar, eval = FALSE, echo=FALSE}
bar <- ggplot(iris, aes(Sepal.Length)) +
geom_bar()
MakeAccessibleSVG(bar)
```
This geom is also quite simple. There will be a children node for each of the bars in the plot.
There is some slight differences between the histograms and the bar charts.
For continuous x axis graphs you get information about the width of the bar its height and the density.
For categorical you will simple get the the location on x axis and the value on the y axis.
There is no summary as of this moment regardless of how many bars are in the plot.
### Geom_Smooth
```{r, svg geom smooth, eval = FALSE, echo=FALSE}
smooth <- ggplot(mtcars, aes(mpg, wt)) +
geom_smooth()
MakeAccessibleSVG(smooth)
```
For the smoother the graph will be split into 5 sections no matter how few underlying data points there are.
This means there are 5 children to the geoms node.
Each child will either have one or two children.
The first child will be the line and the second child would be the confidence interval if it is present in the graph (This is determined by the `se` argument in `geom_smooth()`)
You can however see simple information about the line and confidence interval from the section nodes description.
## Base R
Like with VI there is some support for base r graphics however all of this support could be considered deprecated.
These are:
- Dotplot
- Boxplot
- Scatterplot
- Histogram
- Tsplot