Data Scientist in training with hands-on experience at a high-growth startup (working end-to-end across predictive modeling, business intelligence, and LLM-based automation).
Currently a Data Science Intern @ InstaCarro | B.Sc. Statistics @ UFSCar
Languages
Python · SQL · R
Libraries & Tools
pandas · NumPy · scikit-learn · Matplotlib · Seaborn · statsmodels · Metabase · Git
Competencies
Machine Learning · Predictive Modeling · Clustering · Factor Analysis
Product Analytics · CRM Analytics · A/B Testing · ETL · LLMs · Dashboards
Socioeconomic Determinants of Voting — 2000 Brazilian Elections Factor analysis on public electoral data · R · Validated correlation between socioeconomic development and voting patterns
Multidimensional Analysis of Sanitation Quality in São Paulo (2013–2022) Clustering analysis across all São Paulo municipalities · Python · K-means, DBSCAN, Hierarchical · 5 interpretable clusters identified