Skip to content

Cleaning sales data in Excel and using SQL to query the data for analysis.

Notifications You must be signed in to change notification settings

stevemorrisbi/DataCleaningExcelandSQL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 

Repository files navigation

Excel and SQL - Cleaning and Analysis Project

Overview: Working in Excel to clean sales data and using SQL to derive insights.

Note - All data in this public respository has been redacted to ensure confidentiality.

Steps taken to clean the data in Excel:

  • The CSV contained NULLS for non distinct entries aside from 'order number' and 'user i.d'. NULL data needed to be replaced.
  • Removal of unnessary columns before importing into SQL.
  • Use of the Proper function to ensure unifrom casing for all columns.
  • Replacement of diacritics to standard English characters.
  • Removal of £ symbol in currency columns - changing the format to general.

Excel data cleaning blur


Steps taken to derive insights in SQL:

A commented SQL script can be found in this repository.

It displays the uses cases for -

SELECT, WHERE, Aliases, SUM, MIN, MAX, AVG, HAVING, COUNT, GROUP BY, ORDER BY, TempTables, DISTINCT, NOT NULL, LIKE and Wildcards.

About

Cleaning sales data in Excel and using SQL to query the data for analysis.

Resources

Stars

Watchers

Forks

Languages