I am currently a Data Analyst and Researcher on a CNPq project, studying the expansion of dengue fever in the Brazilian Legal Amazon to identify spatiotemporal patterns. Previously, I analyzed the dynamics of the ethanol market for Ipiranga's intelligence sector. With a strong academic background in mathematics, statistics, and econometrics, alongside a passion for data science, I'm looking for opportunities to contribute to companies' business decision-making processes in the data area.
-
Master's degree in Applied Economics with a focus on time series and spatial econometrics. The dissertation analyzes how Brazilian soybean exports relate to macroeconomic variables, providing insights into the economic impacts of global changes.
-
Bachelor's degree in Agricultural Engineering with a focus on agribusiness. For my final project, I used R to create price forecasting models for key agricultural commodities using the ARIMA methodology, published in the Journal of Economics and Agribusiness.
During my time at Ipiranga, I was responsible for analyzing and communicating market information, particularly focusing on ethanol market dynamics for the Supply & Trading team. In addition to my data expertise, I'm proficient in English and experienced in using the Microsoft Office suite. I'm a proactive individual that is willing to learn new skills and work in a fast-paced and collaborative environment.
Presently, I'm involved in a project aimed at understanding the recent expansion of dengue fever in the Brazilian Legal Amazon. This involves exploring how various social determinants of health—economic, social, environmental, geographic, and cultural factors—have contributed to the spread of the disease. My role includes identifying space-time trends and analyzing data to uncover insights crucial for addressing this public health challenge.
I am currently seeking opportunities to transition into a professional role in the data field, where I can contribute to companies' business decision-making processes.
Here I highlight some of the projects I have worked on throughout my professional career.
In addition to my data science work, I also have a strong interest in econometrics. Here are some of my econometric projects:
-
Using ARIMA to Forecast Olist Revenue: Applied the Autoregressive Integrated Moving Average model to predict Olist's revenue for the next 14 days, providing the company with robust forecasting models to improve financial planning and resource optimization. Read more on Medium.
-
Price Elasticity of Demand Analysis: Analyzed price-demand elasticity to understand how price variations impact consumer demand for elastic products. This project provides a brief overview of how this economic theory can be applied to real-world scenarios and be a valuable resource for businesses. Explore the insights with StreamlitApp.
These projects showcase my expertise in applying econometric techniques to real-world problems, particularly in forecasting and demand analysis.
I'm passionate about leveraging data to solve business problems. Here are some of my recent projects:
-
InStyle - Customer Classification: Used different Machine Learning algorithms to train and predict customer satisfaction. The project provided insights and recommendations to classify customers and predict dissatisfaction, enabling InStyle to address customer satisfaction issues and improve overall experience.
-
Cardiovascular Disease Detector: Created a model to enhance the accuracy of cardiovascular disease diagnosis, improving from 65% to 75% accuracy, resulting in a significant revenue increase for Cardio Catch Disease.
-
Sales Forecasting: Employed various Machine Learning models and chose the best performing model to forecast sales for Rossmann stores. The project provided a comprehensive step-by-step Data Science project to optimize the resources allocation to renovate Rossmann stores.
-
Taxi Destination Prediction: Developed a predictive model to infer the final destination of taxi rides, reducing empty mileage and optimizing operational planning.
- Customer Clustering for Marketing Campaign: The goal is to enhance business and marketing strategies by clustering customers using the K-Means algorithm. This helps understand customer behaviors and preferences, enabling personalized strategies for each group. Benefits include more effective marketing, better customer experience, and optimized resource allocation for maximum business impact.
- Beyond "You May Also Like": Understanding Consumer Choices: Conducted an Exploratory Data Analysis to understand purchases, products, consumer temporal behavior, and the price-quantity relationship. Used Market Basket Analysis to identify association patterns among products, classifying those frequently bought together, and Recommendation System to generate product suggestions for customers based on their similarity to others. This project equips Rex London with information to enhance customer experience through personalized recommendations, potentially increasing sales.
-
Credit Score: Built an internal risk model to optimize profitability and business security while balancing market expansion, providing key insights on loan approval strategies through credit score decile analysis. Streamlit.
-
Fraud Detection: Developed a fraud detection model using Exploratory Data Analysis, data preparation techniques, and various Machine Learning models, achieving high accuracy and recall to confidently identify fraud.
-
Credit Score Analysis: Conducted a comprehensive credit score analysis to increase bank profitability by implementing and comparing Machine Learning models, resulting in a detailed client decile classification for optimal profit and risk management.
- Exploratory Data Analysis for Olist: Performed a comprehensive exploratory data analysis for Olist, uncovering key business insights related to consumer behavior, customer satisfaction, sales patterns, and regional differences across states. The analysis provided quality and actionable informations to support strategic decision-making.
All my projects are structured around solving business problems, employing regression, clustering, and classification algorithms to deliver optimal solutions. You can read more about them on my Portfolio Page and find the codes on their repositories.
- Programming Languages: Python, R, SQL
- Feature Engineering: Encoding, Scaling, Imputers, Resampling
- Machine Learning: Regression, Classification, Clustering, Cross-Validation, Hyperparameter Tuning, Random Search
- Metrics: Confusion Matrix, Accuracy, Precision, Recall, F1 Score, MAE, MAPE, RMSE, R²
- Visualization: Power BI, Streamlit Dashboards
- Cloud: AWS (Glue, S3, Athena, SageMaker)
- Professional Skills: Advanced English, Microsoft Office