Hi I 'm Franck. I have a background in Business Admnistration, but I switched to Information technology after falling in love with the power of analytics during my time as an real estate admnistrative assistant. This is when I was introduced to the world of databases and the power SQL.
Since then, I decided to immerse myself into tech and to become a data analyst. I got one step closer to my goal by obtaining a master's degree in Management Information Systems with a concentration in project management.
Now, I have a created this repository to showcase skills, share projects and track my progress in Data analytics/Data engineering related topics.
-
Python Projects
-
SQL Projects
Goal: Creating a data pipeline for the analysis of Walmart's supply and demand around the holidays.
Description: The project focused on building a data pipeline using custom functions to extract, transform, aggregates, and loads e-commerce data. I used the data from the grocery_sales table in PostgreSQL and from the extra_data parquet file.
Code: Building a retail data pipeline
Skills: Data cleaning, data analysis, data engineering
Technology: Python, SQL
Goal: Cleaning a bank marketing dataset
Description: During this project, I used many data cleaning techniques to perform operations on the bank marketing dataset for processing among which:
- Data types conversion
- Missing values replacement
- Split of the dataset into three subsets:
Code: Cleaning Bank Marketing Data
Skills: Data cleaning
Technology: Python
Description: The product_emissions table contains product carbon footprints (PCFs) for various companies. The records include the greenhouse gas emissions attributable to given product measured in CO2.
Code: Analyzing Industry Carbon Emissions
Analysis: Based on the query's result, the worst offenders, as far as global emissions, are:
- The materials industry with a product carbon footprint of 107129
- In second position, we have the capital goods industry with a PCF of 94942.7, which is three times more than the next industry on the list (Technology Hardware & Equipment).
Skills: Data Manipulation
Technology: SQL
Goal: The project consists in finding out if international students have a higher risk of mental health difficulties than the general population.
Description: A Japanese international university surveyed its students in 2018 and published a study the following year. The study found that international students have a higher risk of mental health than the general population. I will explore the Students data data using PostgreSQL to find out if I would come to a similar conclusion for international students.
Code: Analyzing Students Mental Health
Analysis: The queries return the average depression, social connectedness, and accultural stress for both domestic and international students. A side-by-side comparison indicates the following:
- On average, domestic students scored higher on depression, social connectedness.
- The international students averaged more on accultural stress, as expected.
Based on these results, We cannot support the thesis claiming that the international students have higher risk of mental health difficulties.
Skills: Data Manipulation, Data analysis
Technology: SQL