Hi, I'm Yordi! I'm a Computer Technology Engineering student with a strong passion for data and a growing portfolio of self-driven projects across industries like education and sales. Although my major isn't directly focused on data analysis, I've built a solid foundation in Python and statistics, and I've complemented that with hands-on experience using Excel, SQL, and Tableau.
What sets me apart is my ability to go beyond the numbers—transforming complex datasets into clear, compelling insights that stakeholders can act on. I thrive at the intersection of analysis and creativity, combining technical skills with thoughtful storytelling to ensure every insight has impact.
In a world increasingly shaped by data, I'm excited to help organizations make smarter decisions by uncovering the patterns that matter most. This portfolio showcases projects where I’ve cleaned, analyzed, and visualized data to solve real-world problems, always with the end user in mind.
This is a repository to showcase skills, share projects and track my progress in Data Analytics.
My CV in pdf.
-
- Excel / Google Sheets
- Python
- SQL
- R
- Tableau---> go to Tableau..
- Power BI
In this section I will list data analytics projects briefly describing the technology stack used to solve cases.
Code: Analyzing the Factors Contributing to the Success of a Movie.ipynb
Goal: To determine what factors contribute the most to a movie's success.
Description: The project focused on analyzing a dataset of movies released between 1980 and 2022. The dataset included movie titles, ratings, genres, release dates, budgets, gross earnings, and other relevant information. The project involved loading the data, cleaning and preprocessing it, performing exploratory data analysis (EDA), analyzing the correlation between budget and gross earnings, and implemented Pearson’s correlation statistical analysis.
Skills: data cleaning, data analysis, correlation matrices, hypothesis testing, data visualization.
Technology: Python, Pandas, Numpy, Seaborn, Matplotlib, SciPy.
Results: Using Python functions the analysis revealed that votes and budget have the highest correlation with gross earnings, while the company has no significant correlation.
Goal: To examine the sales history of the store and extract insights on its performance, as well as to identify potential improvements that can be implemented.
Code: Tech Store Sales Analysis.ipynb
Description: The dataset contains a list of sales records. The records include the products for sale and order information(order id, order date, price, quantity ordered and purchase address). The project includes the following steps: data loading, data cleaning and preprocessing, EDA (exploratory data analysis), analyzing sales data and hypothesis testing.
Skills: data cleaning, data analysis, hypothesis testing, data visualization.
Technology: Python, Pandas, Matplotlib.
Results: Python functions that calculated and visually presented the sales data by month, city, and the most commonly sold items. Additionally, the reasons for the high frequency of these items being sold were analyzed and provided as insights.
Code: Data Cleaning Project Queries: Nashville Housing.sql
Description: The dataset contains a list of houses that have been sold in Nashville between 2013 and 2019. This project includes the following steps: data loading, data cleaning and preprocessing.
Skills: DML(Data Manipulation Language), DQL (Data Query Language), DDL (Data Definition Language).
Technology: SQL Server
Code: COVID Portfolio Project.sql
Description: The dataset contains records of Covid-19 cases, deaths and vaccine records by country in 2020-2021. This project includes the following steps: data loading, data cleaning and preprocessing and EDA (exploratory data analysis).
Skills: Joins, CTE's, Temp Tables, Windows Functions, Aggregate Functions, Creating Views, Converting Data Types
Technology: SQL Server
Goal: To predict Pokémon status based on their characteristics and rank their importance in determining whether a Pokémon is classified as legendary.
Code: Legendary Pokémon Analysis (Study Project)
Description: The dataset contains a list of Pokémon. The records include their characteristics such as attack, defense, type and size. The project includes the following steps: data loading, data cleaning EDA (exploratory data analysis), analyzing characteristics of different Pokémon.
Skills: data cleaning, data analysis, data visualization.
Technology: Tidyverse
University of Colorado, Boulder: Master of Science - MS, Data Science, Dec 2022 - Dec 2024
The University of the West Indies, Mona: Bachelor's degree, Biochemistry and Molecular Biology, 2019 - 2022
Pre University School: Associate's degree, Science and Mathematics, 2017 - 2019
The best way to showcase skills is by doing and sharing your job done but sometimes certificates appear to be as an indirect result. Here's a list of the ones I have (in reverse-chronological order, with the date of completion in brackets):
- Google Data Analytics Professional Certificate (Sep 2022) (Coursera - Google)
- Tableau (Oct 2022) (Coursera - University of California, Davis)
- Python for Data Science (Feb 2023) (Coursera - University of Colorado Boulder)
- LinkedIn: @yordipolancopujols
- Email: y.polanco314@gmail.com
- Phone: +1 (809) 319-7839