Automated data exploration in R
Using this tool is a simple as importing the dataExporation.R functions and passing in a DataFrame of interest.
data <- read.csv("your_data_file.csv")
source("dataExplore.R")
dataExplore(data)
Note that the scripts and RMD files need to be in the working directory for this to work.
Required: R, RStudio are needed, and the dataExporation.R, univarient.R, bivarient.R, main.RMD files should be in the working directory.
The tool creates a data summary, a univarient (distribution) analysis and a bivarient (correlation and regression) analysis. Full example outputs are available for the Titanic dataset and macroeconomic data
High level information about the dataset.
Detailed analysis of each variable and their distributions.
Analysis of the relationships between variables using correlation, regression and scatter plots.
This project is licensed under the MIT License - see the LICENSE.md file for details