# Data visualisation

Visualising your data, should always be the first step  of your data analyses. Never start an analysis with the statistical analysis. So, even before starting to make your graphs with your data, make a picture (a drawing) in a pice of paper of what you expect to see. Because you have plan your experiment, set your hypotheses, and collect your data, you a pretty good idea of what to expect. 

In this `Notebook`, we are going to get the basis of making nice figure with the package `ggplot2`.

So, if you are using Rstudio, start a new script.

In [None]:
# Plotting with ggplot2
# data
# Make as many anotations as necessary
rm(list=ls())

# load the libraries

library(ggplot2)
library(dplyr)

In [None]:
data <- read.csv("datasets-master/compensation.csv")

glimpse(data)


## Scatterplots

In [None]:
## ggplot2 grammar:
ggplot(data, aes(y=Fruit, x=Root))+
geom_point()

In [None]:
# Change the size of the points
ggplot(data, aes(y=Fruit, x=Root))+
geom_point(size= 3)

In [None]:
# Change the size of the points
# color the points by groups, e.g. Grazing
ggplot(data, aes(y=Fruit, x=Root, colour= Grazing))+
geom_point(size = 3)

### interpretation and then customize... 

In [None]:
# Change the size of the points
# color the points by groups, e.g. Grazing
# remove the gray background
ggplot(data, aes(y=Fruit, x=Root, colour= Grazing))+
geom_point(size = 3) + 
theme_bw()

In [None]:
# Change the size of the points
# color the points by groups, e.g. Grazing
# remove the gray background
# change axis names
ggplot(data, aes(y=Fruit, x=Root, colour= Grazing))+
geom_point(size = 3) + 
theme_bw() + 
xlab("Root Biomass") +
ylab("Fruit Production")

## When and why to use scatterplots?

# Box-whisker plots

In [None]:
# compare the fruit production between grazed and ungrazed plants
ggplot(data, aes(y=Fruit, x=Grazing))+
geom_boxplot() + 
theme_bw() + 
ylab("Fruit Production")

In [None]:
# compare the fruit production between grazed and ungrazed plants
# get more information about the box-plot
ggplot(data, aes(y=Fruit, x=Grazing))+
geom_boxplot() + 
geom_point(size = 5, colour= "gray", alpha =0.7)+
theme_bw() + 
ylab("Fruit Production")

In [None]:
# compare the fruit production between grazed and ungrazed plants
# get more information about the box-plot
ggplot(data, aes(y=Fruit, x=Grazing))+
geom_boxplot() + 
geom_jitter(size = 5, colour= "gray", alpha =0.8) +
theme_bw() + 
ylab("Fruit Production")

In [None]:
# compare the fruit production between grazed and ungrazed plants
# get more information about the box-plot
ggplot(data, aes(y=Fruit, x=Grazing))+
geom_boxplot() + 
geom_jitter(size = 5, colour= "gray", alpha =0.8, width = 0.25) +
theme_bw() + 
ylab("Fruit Production")


### Why and when to use boxplot?

## Distributions: histograms
Clues about the shape of the distribution, central tendency, and spread of the data.

In [None]:
# Making histogram

ggplot(data, aes(x=Fruit)) +
geom_histogram()

In [None]:
ggplot(data, aes(x=Fruit)) +
geom_histogram(binwidth= 13)

In [None]:
## Facet for panels

ggplot(data, aes(x=Fruit)) +
geom_histogram(binwidth = 13)+
facet_wrap(.~ Grazing)

## Saving figures

In [None]:
ggsave("MyHistogram.png", dpi = 800)
ggsave("MyHistogram.pdf")