Skip to content

Implemented an Analytical Data Architecture (ADA) to create a single source of truth (SSOT) from multiple data sources such as CSV files and SQL databases for IMDb and the Numbers datasets

Notifications You must be signed in to change notification settings

mirajkarani/IMDb-Data-Warehousing-BI

Repository files navigation

IMDb Data Warehousing & Business Intelligence

• Designed and implemented an Analytical Data Architecture (ADA) for data profiling, cleansing and ETL operations from multiple data sources such as CSV files and SQL databases
• Implemented data profiling and staging in Alteryx Designer and integrated data into DI schema (dimensional data model) using Talend Open Studio to create a single source of truth
• Generated insights using data visualization tools for total revenue generated and top-grossing movies of all time

Architecture diagram

alt text

Entity Relationship Diagrams

Data Staging

alt text

Data Integration

alt text

Technologies Used

• Talend Open Studio
• Alteryx Designer
• SQL (MySQL, SQL Server, PostgreSQL)
• BI Tools (Tableau, Power BI)

BI Dashboard (Tableau)

alt text

About

Implemented an Analytical Data Architecture (ADA) to create a single source of truth (SSOT) from multiple data sources such as CSV files and SQL databases for IMDb and the Numbers datasets

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published