Skip to content

Creating Staging data from csv and tsv files and loading to make a Data Warehouse using Talend jobs and using the Data Warehouse for Business Insights using PowerBI and Tableau

Notifications You must be signed in to change notification settings

jayshilj/ETL-Pipelines-and-Business-Intelligence-on-IMDB-dataset

Repository files navigation

ETL-Pipelines-and-Business-Intelligence-on-IMDB-dataset

Medium Article

https://medium.com/@jayshil97/understanding-data-pipeline-integration-and-business-intelligence-efdd0016ebe4

To setup this project install the following softwares

Talend Real-Time Data Platform 7.1 SQL Server Developer Edition Microsoft SQL server Management Studio Tableau Microsoft PowerBI

  1. Run the following scripts in SSMS to setup the staging database

The Number - stage tables.sql stg imdb tables - core tables.sql stg imdb tables expanded part 2.sql stg_ml_tables.sql

  1. Open Talend and setup your database connections and input file connections

  2. When the connections are successfull run the final job (Runtime 75 mins)

Tableau Dashboards:- https://public.tableau.com/profile/jayshil.jain#!/vizhome/imdb_proj/Dashboard1

Powerbi Report:- https://drive.google.com/drive/u/3/folders/1H17XUFnp5ZuDNIgjNpxftKLsljYpRR5V

Data Warehouse using ER Studio

Talend Jobs Sample

PowerBI Jobs Sample

Tableau Jobs Sample

About

Creating Staging data from csv and tsv files and loading to make a Data Warehouse using Talend jobs and using the Data Warehouse for Business Insights using PowerBI and Tableau

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages