The goal of this project is to analyze online retail sales data to gain insights into customer behavior, product performance, and sales trends.
The dataset for this project is generated using the Python Faker library. It includes the following tables:
- Customers: Contains information about the customers, such as customer ID, name, gender, age, country, and region.
- Products: Contains information about the products, including product ID, name, price, and category.
- Orders: Contains information about the orders, such as order ID, customer ID, order date, and delivery date.
- OrderItems: Contains information about the items in each order, including order ID, product ID, and quantity ordered
The project is structured as follows:
data_generation.py: Python script to generate the synthetic dataset using the Python Faker library.Sales Analysis.sql: SQL queries to perform various analyses on the generated dataset.
To use this project, follow the steps below:
- Clone the project repository:
https://github.com/devGeepee/SQL-Sales-Analysis - Install the required dependencies (Faker and Pandas)
- Run the
data_generation.pyscript to generate the dataset. - Import the generated dataset into your preferred database management system.
- Execute the SQL queries in
Sales Analysis.sqlto perform the desired analyses. - Review the results and extract insights from the analysis.
For any questions or inquiries, please contact godspowerobielum@gmail.com