Skip to content
View caiocasagrande's full-sized avatar
Block or Report

Block or report caiocasagrande

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
caiocasagrande/README.md

Caio Casagrande 👋🏼

LinkedIn GMail Data Portfolio Data Portfolio

About Me

I am currently a Data Analyst and Researcher on a CNPq project, studying the expansion of dengue fever in the Brazilian Legal Amazon to identify spatiotemporal patterns. Previously, I analyzed the dynamics of the ethanol market for Ipiranga's intelligence sector. With a strong academic background in mathematics, statistics, and econometrics, alongside a passion for data science, I'm looking for opportunities to contribute to companies' business decision-making processes in the data area.

Academic Background

  • Master's degree in Applied Economics with a focus on time series and spatial econometrics. The dissertation analyzes how Brazilian soybean exports relate to macroeconomic variables, providing insights into the economic impacts of global changes.

  • Bachelor's degree in Agricultural Engineering with a focus on agribusiness. For my final project, I used R to create price forecasting models for key agricultural commodities using the ARIMA methodology, published in the Journal of Economics and Agribusiness.

Professional Experiences

During my time at Ipiranga, I was responsible for analyzing and communicating market information, particularly focusing on ethanol market dynamics for the Supply & Trading team. In addition to my data expertise, I'm proficient in English and experienced in using the Microsoft Office suite. I'm a proactive individual that is willing to learn new skills and work in a fast-paced and collaborative environment.

Current Focus

Presently, I'm involved in a project aimed at understanding the recent expansion of dengue fever in the Brazilian Legal Amazon. This involves exploring how various social determinants of health—economic, social, environmental, geographic, and cultural factors—have contributed to the spread of the disease. My role includes identifying space-time trends and analyzing data to uncover insights crucial for addressing this public health challenge.

I am currently seeking opportunities to transition into a professional role in the data field, where I can contribute to companies' business decision-making processes.

My Work

Here I highlight some of the projects I have worked on throughout my professional career.

Econometrics using Python

In addition to my data science work, I also have a strong interest in econometrics. Here are some of my econometric projects:

These projects showcase my expertise in applying econometric techniques to real-world problems, particularly in forecasting and demand analysis.

Data Science Portfolio

I'm passionate about leveraging data to solve business problems. Here are some of my recent projects:

Classification Projects

  • InStyle - Customer Classification: Used different Machine Learning algorithms to train and predict customer satisfaction. The project provided insights and recommendations to classify customers and predict dissatisfaction, enabling InStyle to address customer satisfaction issues and improve overall experience.

  • Cardiovascular Disease Detector: Created a model to enhance the accuracy of cardiovascular disease diagnosis, improving from 65% to 75% accuracy, resulting in a significant revenue increase for Cardio Catch Disease.

Regression Projects

  • Sales Forecasting: Employed various Machine Learning models and chose the best performing model to forecast sales for Rossmann stores. The project provided a comprehensive step-by-step Data Science project to optimize the resources allocation to renovate Rossmann stores.

  • Taxi Destination Prediction: Developed a predictive model to infer the final destination of taxi rides, reducing empty mileage and optimizing operational planning.

Clustering Projects

  • Customer Clustering for Marketing Campaign: The goal is to enhance business and marketing strategies by clustering customers using the K-Means algorithm. This helps understand customer behaviors and preferences, enabling personalized strategies for each group. Benefits include more effective marketing, better customer experience, and optimized resource allocation for maximum business impact.

Recommendation System

  • Beyond "You May Also Like": Understanding Consumer Choices: Conducted an Exploratory Data Analysis to understand purchases, products, consumer temporal behavior, and the price-quantity relationship. Used Market Basket Analysis to identify association patterns among products, classifying those frequently bought together, and Recommendation System to generate product suggestions for customers based on their similarity to others. This project equips Rex London with information to enhance customer experience through personalized recommendations, potentially increasing sales.

Finance Projects

  • Credit Score: Built an internal risk model to optimize profitability and business security while balancing market expansion, providing key insights on loan approval strategies through credit score decile analysis. Streamlit.

  • Fraud Detection: Developed a fraud detection model using Exploratory Data Analysis, data preparation techniques, and various Machine Learning models, achieving high accuracy and recall to confidently identify fraud.

  • Credit Score Analysis: Conducted a comprehensive credit score analysis to increase bank profitability by implementing and comparing Machine Learning models, resulting in a detailed client decile classification for optimal profit and risk management.

Data Analysis Projects

  • Exploratory Data Analysis for Olist: Performed a comprehensive exploratory data analysis for Olist, uncovering key business insights related to consumer behavior, customer satisfaction, sales patterns, and regional differences across states. The analysis provided quality and actionable informations to support strategic decision-making.

All my projects are structured around solving business problems, employing regression, clustering, and classification algorithms to deliver optimal solutions. You can read more about them on my Portfolio Page and find the codes on their repositories.

Technical Skills


  • Programming Languages: Python, R, SQL
  • Feature Engineering: Encoding, Scaling, Imputers, Resampling
  • Machine Learning: Regression, Classification, Clustering, Cross-Validation, Hyperparameter Tuning, Random Search
  • Metrics: Confusion Matrix, Accuracy, Precision, Recall, F1 Score, MAE, MAPE, RMSE, R²
  • Visualization: Power BI, Streamlit Dashboards
  • Cloud: AWS (Glue, S3, Athena, SageMaker)
  • Professional Skills: Advanced English, Microsoft Office

Pinned Loading

  1. arima_revenue arima_revenue Public

    Implementation and comparison of ARIMA models to forecast Olist revenue for the next 14 days.

    Jupyter Notebook 1

  2. customer_behavior customer_behavior Public

    Beyond "You May Also Like": Deconstructing Consumer Choices with Data Science

    Jupyter Notebook

  3. mkt_campaign_clustering mkt_campaign_clustering Public

    Projeto de clusterização de clientes com base em campanhas de marketing.

    Jupyter Notebook

  4. instyle instyle Public

    A loja de moda InStyle é uma grande loja de roupas, mas enfrenta desafios significativos em relação à experiência do cliente. Em vista disso, a InStyle montou uma equipe com a tarefa de treinar um …

    Jupyter Notebook 1