Skip to content

Gholtes/dataExploration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Exploration

Automated data exploration in R

Getting Started

Using this tool is a simple as importing the dataExporation.R functions and passing in a DataFrame of interest.

data <- read.csv("your_data_file.csv")

source("dataExplore.R")
dataExplore(data)

Note that the scripts and RMD files need to be in the working directory for this to work.

Prerequisites

Required: R, RStudio are needed, and the dataExporation.R, univarient.R, bivarient.R, main.RMD files should be in the working directory.

Analysis

The tool creates a data summary, a univarient (distribution) analysis and a bivarient (correlation and regression) analysis. Full example outputs are available for the Titanic dataset and macroeconomic data

Data summary

High level information about the dataset.

UniVar Summary

Univarient analysis

Detailed analysis of each variable and their distributions.

UniVar Summary

Bivarient analysis

Analysis of the relationships between variables using correlation, regression and scatter plots.

UniVar Summary

UniVar Summary

License

This project is licensed under the MIT License - see the LICENSE.md file for details

About

automated data exploration with R

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages