Skip to content

christakakis/r_data_analysis

Repository files navigation

R Data Analysis Repository

Analysis and Visualization with R for various datasets.

The main goal of the repo is to understand in depth the most famous libraries in R language and visualize various datasets. The codes will be presented in format of R Markdown and R Shiny Dashboard.

Briefly, the repo contains:

(1) Simple R functions - Dataset tidyr::who. Using the simplest R functions to transform the dataset into tidy format.

(2) Simple R plots with ggplot2 - Datasets queen & mcdonalds. Plotting the simplest possible R plots. Dataset queen.csv contains characteristics for each song of the Queen albums as derived from Spotify. Mcdonalds.csv contains the price of Big-Mac in local currency for various countries and years. In general, this R Markdown contains Faceted ScatterPlots, BoxPlots, Histograms and BarPlots.

(3) Rules, Correcting, Imputing with R - Dataset dirty_iris. With the use of deducorrect, editrules and VIM the dataset is transofrmed into tidy. In general, this R Markdown contains numerical and caterogical rules, violations, hotdeck imputation and lastly some plots.

(4) Visualization of the Olympic Games with R - Dataset results. Dataset results.csv contains results of the track and field events of all the Olympic Games events until 2016. In general, this R Markdown contains the preprocessing steps as well as univariate and multivariate analysis, time series analysis techniques and interesting graphs for this dataset.

(5) Interactive Maps with use of cshapes, leaflet and tmap (Dashboard Included) - Dataset world from cshapes. Manage and visualize geographic data with world datset from cshapes. This folder contains both R Markdown and Shiny Dashboard. Distance thresholds, buffer from capitals, distance from country's centroid are included.

(6) Graph plots and shortest paths (Dashboard Included) - Dataset world from cshapes. Manage and visualize network data again with world datset from cshapes. This folder contains also both R Markdown and Shiny Dashboard. Directed graph of capitals and their distances, shortest path between capitals considering weight distance or total number of nodes are included.

(7) Interesting interactive visualizations (Dashboard Included) - Dataset album. Managing data about music albums, their genres, titles, year and artists. The Dashboard is available for preview on my account on ShinyApps.

(8) Visualizations of NYPD shooting incidents (Dashboard Included) - Various Datasets. Managing data about NYC shootings and trying to plot something interesting.

This repository was initially created to store my personal R Markdowns but also be available to others trying to build or understand something similar. The R codes contained in this repo are made specifically for an EDAV course of my MSc program.

Releases

No releases published

Packages

No packages published

Languages