Presentation, the code, and exercises to accompany the QuantBio workshop in Fiesch, 15-20 September, 2019.
presentation-intro.pdfintroductory presentation with background information.presentation-main.RpresandhtmlR presentation with workshop lecture and exercises.analysis-student.RmdR notebook with the analysis for exercises. Contains the code with placeholders to fill in during the workshop. Can be compiled (in Rstudio) to a finalhtmldocument.analysis-main.Rmdandhtml. R notebook with full analysis from raw data to plots. The associatedhtmlfile contains a final document with code and results.datafolder with raw in intermediate data files.
The workflow is written in R and takes advantage of RStudio notebooks. Please download the latest RStudio from here.
The analysis uses the following packages that need to be present in your RStudio installation:
data.tablefor fast processing of large datasetsreadxlfor reading Excel filesR.utilsfor directly reading comressed filesimputeTSfor data imputation such as interpolation of NA'sggplot2for plottingplotlyfor interactive plotsgplotsfor plotting heatmapsd3heatmapfor plotting interactive heatmapsdendextendfor modifying dendrogramsRColorBrewerfor extended colour palettesscalesfor percentages on y-axis in ggplotsdtwclustfor dynamic time warping distance measuremagrittrfor pipes (typically loaded by other packages)factoextrafor extracting and visualisation of the results of multivariate data analysesclusterfor extended cluster analysisNbClustfor determining the best number of clusters
Install these packages by typing:
install.packages(c(
"data.table", "tidyverse",
"readxl", "R.utils",
"ggplot2", "gplots", "plotly", "d3heatmap",
"dendextend", "RColorBrewer", "scales",
"imputeTS",
"dtwclust", "factoextra", "cluster", "NbClust"))