Skip to content
View YordiPolanco's full-sized avatar

Block or report YordiPolanco

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
YordiPolanco/README.md

Yordi Polanco - Data Analyst Portfolio

About

Hi, I'm Yordi! I'm a Computer Technology Engineering student with a strong passion for data and a growing portfolio of self-driven projects across industries like education and sales. Although my major isn't directly focused on data analysis, I've built a solid foundation in Python and statistics, and I've complemented that with hands-on experience using Excel, SQL, and Tableau.

What sets me apart is my ability to go beyond the numbers—transforming complex datasets into clear, compelling insights that stakeholders can act on. I thrive at the intersection of analysis and creativity, combining technical skills with thoughtful storytelling to ensure every insight has impact.

In a world increasingly shaped by data, I'm excited to help organizations make smarter decisions by uncovering the patterns that matter most. This portfolio showcases projects where I’ve cleaned, analyzed, and visualized data to solve real-world problems, always with the end user in mind.

This is a repository to showcase skills, share projects and track my progress in Data Analytics.

My CV in pdf.

Table of Contents

Portfolio Projects

In this section I will list data analytics projects briefly describing the technology stack used to solve cases.

Analyzing the Factors Contributing to the Success of a Movie

Code: Analyzing the Factors Contributing to the Success of a Movie.ipynb

Goal: To determine what factors contribute the most to a movie's success.

Description: The project focused on analyzing a dataset of movies released between 1980 and 2022. The dataset included movie titles, ratings, genres, release dates, budgets, gross earnings, and other relevant information. The project involved loading the data, cleaning and preprocessing it, performing exploratory data analysis (EDA), analyzing the correlation between budget and gross earnings, and implemented Pearson’s correlation statistical analysis.

Skills: data cleaning, data analysis, correlation matrices, hypothesis testing, data visualization.

Technology: Python, Pandas, Numpy, Seaborn, Matplotlib, SciPy.

Results: Using Python functions the analysis revealed that votes and budget have the highest correlation with gross earnings, while the company has no significant correlation.

Tech Store Sales Analysis

Goal: To examine the sales history of the store and extract insights on its performance, as well as to identify potential improvements that can be implemented.

Code: Tech Store Sales Analysis.ipynb

Description: The dataset contains a list of sales records. The records include the products for sale and order information(order id, order date, price, quantity ordered and purchase address). The project includes the following steps: data loading, data cleaning and preprocessing, EDA (exploratory data analysis), analyzing sales data and hypothesis testing.

Skills: data cleaning, data analysis, hypothesis testing, data visualization.

Technology: Python, Pandas, Matplotlib.

Results: Python functions that calculated and visually presented the sales data by month, city, and the most commonly sold items. Additionally, the reasons for the high frequency of these items being sold were analyzed and provided as insights.

Nashville Housing Data Cleaning

Code: Data Cleaning Project Queries: Nashville Housing.sql

Description: The dataset contains a list of houses that have been sold in Nashville between 2013 and 2019. This project includes the following steps: data loading, data cleaning and preprocessing.

Skills: DML(Data Manipulation Language), DQL (Data Query Language), DDL (Data Definition Language).

Technology: SQL Server

Covid 19 Data Exploration

Code: COVID Portfolio Project.sql

Description: The dataset contains records of Covid-19 cases, deaths and vaccine records by country in 2020-2021. This project includes the following steps: data loading, data cleaning and preprocessing and EDA (exploratory data analysis).

Skills: Joins, CTE's, Temp Tables, Windows Functions, Aggregate Functions, Creating Views, Converting Data Types

Technology: SQL Server

Legendary Pokémon Analysis

Goal: To predict Pokémon status based on their characteristics and rank their importance in determining whether a Pokémon is classified as legendary.

Code: Legendary Pokémon Analysis (Study Project)

Description: The dataset contains a list of Pokémon. The records include their characteristics such as attack, defense, type and size. The project includes the following steps: data loading, data cleaning EDA (exploratory data analysis), analyzing characteristics of different Pokémon.

Skills: data cleaning, data analysis, data visualization.

Technology: Tidyverse

Education

University of Colorado, Boulder: Master of Science - MS, Data Science, Dec 2022 - Dec 2024

The University of the West Indies, Mona: Bachelor's degree, Biochemistry and Molecular Biology, 2019 - 2022

Pre University School: Associate's degree, Science and Mathematics, 2017 - 2019

Certificates

The best way to showcase skills is by doing and sharing your job done but sometimes certificates appear to be as an indirect result. Here's a list of the ones I have (in reverse-chronological order, with the date of completion in brackets):

Contacts

Pinned Loading

  1. YordiPolanco YordiPolanco Public

    This repository showcases my data analysis projects using. Each project reflects my ability to work with real-world data, uncover insights, and communicate them clearly through meaningful visualiza…