Data Cleaning in SQL

Welcome to the Data Cleaning in SQL repository! This project demonstrates the application of SQL techniques to clean and prepare raw datasets for analysis. It serves as a practical example of how to transform messy data into structured, reliable information using SQL.

🧹 Project Overview

In this project, I focused on cleaning a raw dataset by addressing common data quality issues such as:

Removing duplicates
Handling missing or NULL values
Standardizing data formats
Correcting data inconsistencies

The goal was to prepare the dataset for further analysis, ensuring its integrity and reliability.

🛠️ Tools & Technologies

SQL: Utilized SQL queries for data manipulation and cleaning.
MySQL Workbench: Executed SQL scripts and managed the database.
CSV Files: Worked with CSV files for importing and exporting data.

📁 Repository Structure

Dataset/: Contains the raw and cleaned datasets in CSV format.
Data Cleaning SQL Project Queries.sql: SQL script file with all the queries used for data cleaning tasks.

🔍 Key SQL Techniques Used

SELECT DISTINCT – To identify and remove duplicate records
IS NULL / IS NOT NULL – For detecting and handling missing values
UPDATE – To correct data inconsistencies
ALTER TABLE – For modifying table structures when necessary

🚀 Getting Started

To replicate this project:

Clone the repository:

git clone https://github.com/Shivangkus/Data-Cleaning-in-SQL.git

Open the Data Cleaning SQL Project Queries.sql file in MySQL Workbench.

Execute the SQL queries step by step to clean the dataset.

Import the cleaned dataset into your preferred analysis tool.

📈 Next Steps After cleaning the data, you can proceed with:

Exploratory Data Analysis (EDA) – To uncover patterns and insights

Data Visualization – Using tools like Tableau or Power BI

Statistical Analysis – For deeper understanding and modeling

📄 License This project is licensed under the MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Dataset		Dataset
Data Cleaning SQL Project Queries.sql		Data Cleaning SQL Project Queries.sql
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Data Cleaning in SQL

🧹 Project Overview

🛠️ Tools & Technologies

📁 Repository Structure

🔍 Key SQL Techniques Used

🚀 Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Shivangkus/Data-Cleaning-in-SQL

Folders and files

Latest commit

History

Repository files navigation

Data Cleaning in SQL

🧹 Project Overview

🛠️ Tools & Technologies

📁 Repository Structure

🔍 Key SQL Techniques Used

🚀 Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages