Skip to content

AdventureWorksDW project combines Python & SQL for EDA on sales data. Clean, visualize & analyze sales trends, customer behavior, product performance, geography & correlations. Insights for data-driven decisions & business optimization.

Notifications You must be signed in to change notification settings

dekoma4u/Python_with_SQL_for_AdventureWorksDW_Database_Data_Analysis

Repository files navigation

AdventureWorksDW Database Data Analysis with Python and SQL

Introduction

In this project, I connected to the AdventureWorksDW database using Python and performed exploratory data analysis (EDA) by querying the database with SQL. The AdventureWorksDW database is a sample database from Microsoft that simulates a fictional company's sales data.

Expertise

As a data analyst, I possess expertise in Python, SQL, and data analysis. I leveraged Python's pandas library to manipulate and analyze the data, and used pyodbc to connect to the AdventureWorksDW database. My skills in both Python and SQL allowed me to perform in-depth analysis of the data and gain valuable insights.

Repository Overview

In my GitHub repository Python_with_SQL_for_AdventureWorksDW_Database_Data_Analysis, you will find the following:

  1. README.md: Provides an overview of the project, its objectives, and instructions on how to run the code.

  2. analysis.ipynb: A Jupyter Notebook containing the Python code for connecting to the database, executing SQL queries, and performing exploratory data analysis.

  3. data/: Contains the necessary .env file to securely store the database credentials.

10 Business Questions Explored

Throughout the analysis, I answered the following 10 business questions:

  1. Find the top 5 customers with the highest total sales amount.
  2. Calculate the total revenue generated by each product category.
  3. Identify the top 3 salespeople based on the number of orders they have processed.
  4. Calculate the total profit for each subcategory of products.
  5. Determine the average order processing time for each month in a given year.
  6. Find the top 10 most popular products based on the number of units sold.
  7. Calculate the average discount percentage offered for each product category.
  8. Identify the customers who have made purchases in all the available sales territories.
  9. Find the top 5 products that have experienced the highest growth in sales revenue over the last quarter.
  10. Calculate the total revenue generated by each customer for each quarter of the year.

Explore the Code

Feel free to explore the code and analysis in the Jupyter Notebook provided in the repository. If you have any questions or suggestions, please don't hesitate to reach out. Happy exploring!

Conclusion

This project showcases my ability to leverage Python and SQL for data analysis, providing valuable insights into the AdventureWorksDW database. I hope you find the analysis informative and insightful. Enjoy!

About

AdventureWorksDW project combines Python & SQL for EDA on sales data. Clean, visualize & analyze sales trends, customer behavior, product performance, geography & correlations. Insights for data-driven decisions & business optimization.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published