This repository contains my exploratory data analysis projects using R and several other packages included in Tidyverse. All source code for these projects can be found down below.
- Analyzing Forest Fires Data - This project performs exploratory data analysis to show which months and days of the week had greater number of forest fires.
- Association Rules - In this project, I have used the titanic dataset to perform association rule mining to determine which class of passengers survived the tragedy using the
arules
andarulesViz
packages. - Basic COVID-19 Analysis - This project performs a simple analysis using
Hmisc
package to determine which age group and gender has higher chances of death from the virus. - EDA of Diamonds Dataset - This project uses the
diamonds
dataset ofggplot2
package to perform exploratory data analysis using few statistical methods such as summary statistics, hypothesis tests and statistical distributions - Investigating COVID Virus Trends - This project performs kaggle data analysis to understand which country has the highest number of cases against total number of testing performed using
tidyverse
package. - NYC School Perceptions - Responses to surveys designed to gauge parent, student, and teacher perceptions of the quality of New York City schools. In this project, a few questions with respect to the same are answered.
- Text Mining Emily Bronte Novels - This Project involved Text Mining and Sentiment Analysis to predict the tone of the book using
gutenbergr
package. - Titanic Dataset Analysis - This project contains Exploratory Data Analysis results of the titanic dataset that is publically available on Kaggle. I have used the following packages
readr
,dplyr
andggplot2
of tidyverse
This repository is licensed under MIT License.