Analyzed and categorized an IMDB data set through Kaggle to gain insights about entertainment content for users.
• Employed Python Pandas to cleanse and optimize the dataset, which involved eliminating extraneous spaces, converting arrays to text and converting text to integers before transferring it to a PostgreSQL #database table.
• Utilized SQL to clean data through anomaly detection, applied content filters, executed case statements, and joins for data enrichment, employed aggregate functions and subqueries for in-depth analysis, and documented all queries in a constants file.
• Connected Postgres to Power BI and made visualizations (tree maps, bar chart, line chart, pie chart) of insights
Image of Power BI Dashboard at power_bi_dashboard.png (https://github.com/vanxLord90/movie_insights/blob/main/power_bi_dashboard.png)