This repository showcases SQL Data Cleaning and Exploratory Data Analysis (EDA) projects using MySQL.
The first project focuses on cleaning a dataset containing layoff data from around the world. It involves several steps to ensure data integrity and consistency.
- Removing Duplicates: Identifying and removing duplicate rows from the dataset.
- Standardizing the Data: Identifying and fixing data issues like standardizing company names, industry names, and country names.
- Handling Null or Blank Values: Populate missing values and handle null or blank entries in the dataset.
- Removing Irrelevant Columns: Safeguarding the data by creating a new table mirroring the original one and deleting unnecessary rows.
The second project involves conducting exploratory data analysis (EDA) on the cleaned dataset to gain insights and extract useful information.
- Analyze layoffs by various factors such as industry, country, location, and stage.
- Calculate total funds raised by each company per year.
- Determine the date range of the dataset and analyze layoffs by date, year, and month.
- Calculate rolling total of layoffs by month to identify trends over time.
- Analyze layoffs by company and year, displaying the top 5 companies with the highest total layoffs each year.
Both projects are implemented using SQL queries executed using MySQL.