MIC training: Modern data analysis in R/RStudio
Presentation, the code, and exercises to accompany the MIC training at UniBe, November 3, 2020.
File description
presentation-main.Rpres
andhtml
- the main presentation with workshop lecture and exercises,examples
- code snippets and examples for the presentation,practicals
- data, presentation, and code for the practical session.
Required software
Please download the latest versions of the following:
Required R packages
The workflow is written in R and takes advantage of RStudio notebooks. The analysis uses the following packages that need to be present in your RStudio installation:
R.utils
for directly reading comressed filesmagrittr
for pipes (typically loaded by other packages)data.table
for fast processing of large datasetsreadxl
for reading Excel filesggplot2
for plottingimputeTS
for data imputation such as interpolation of NA'sggthemes
for additional color schemes in ggplotplotly
for interactive plotsRColorBrewer
for extended colour palettespheatmap
for heatmapheatmaply
for interactive heatmapsscales
for percentages on y-axis in ggplotsfactoextra
for extracting and visualisation of the results of multivariate data analysesNbClust
for determining the best number of clusterstestthat
for unit testingprofvis
for profilingforeach
with extension of thefor
loopdoParallel
for parallel computationsoptparse
for command-line parameters
Install these packages by typing the following line in the R command-line interface:
install.packages(c(
"data.table", "tidyverse",
"readxl", "R.utils",
"ggplot2", "plotly",
"pheatmap", "heatmaply",
"dendextend", "RColorBrewer", "scales",
"imputeTS",
"factoextra", "NbClust",
"testthat", "profvis",
"foreach", "doParallel",
"optparse"))
Alternatively, you can install packages using RStudio GUI by going to Tools > Install packages...