Python — Python frameworks: NumPy, Pandas, Scikit-Learn; Supervised & Unsupervised Machine Learning Models: Regression, Classification, Clustering; Deep Learning: Keras, TensorFlow;
Statistical Analysis - Hypothesis testing, p-value, Confidence level;
Data Visualization - Matplotlib, Seaborn, Tableau, Looker
SQL — SQL, SQLite, SQL Alquemy
AWS - Cloud Practitioner Certified, Sage Maker
Project 3 (in progress) link
Evaluating factors that are relevant for the revenue of a movie. This dataset collects most relevant information for movies released between 2012 and 2021. The features cover information like budget, revenue, ratings, certification and so on.Data was collected through API calls. Predictive model still in progress.
Cervical Cancer Prediction link
This Project was designed to predict Cervical Cancer based on patient's behavior, Age and other medical history, my model was able to explain 96% of the variation of the predictions, with a 21% rate of error for “False Negatives”
Data was prepared and cleaned, EDA (exploratory data analysis) was done and interesting trends were observed, all confirmed by other studies published on cancer.gov. Modeling - 3 models were applied, Random Forest, KNN and Lgbm - LGBM had the best results and it was the model chosen to be used.
Sales Prediction Model link
This Project was created to evaluate if the data collected would be useful for predicting prices, unfortunately the data had very low correlation or relevance to the prediction.
Data was prepared, EDA (exploratory data analysis) was done, Modeling - 3 models were applied, Linear Regression, Decision Tree and Random Forest, Random Forest had the best results but not relevant to the prediction.
Data Science Certificate, Remote — Coding Dojo - Certificate June 2022 - October 2022
This data science bootcamp is a deep dive into the fundamentals of data science and machine learning with Python. Throughout the course, the students will gain a comprehensive understanding of the entire data science process from end-to-end, including data prep, data analysis and visualization, as well as how to properly apply machine learning algorithms to various situations or tasks. They’ll also walk away with a portfolio of projects showcasing their data science certification.
AWS Cloud Practitioner, Remote — AWS Skill Builder - Badge October 2022
Earners of this certification have a fundamental understanding of IT services and their uses in the AWS Cloud. They demonstrated cloud fluency and foundational AWS knowledge. Badge owners are able to identify essential AWS services necessary to set up AWS-focused projects.
Tableau, Remote — Udemy Certificate September 2022 - October 2022
Tableau 2022 A-Z: Hands-On Tableau Training for Data Science - my Tableau Page
Faculdade Oswaldo Cruz, São Paulo, Brazil - Postgraduate Course Industrial Processes June 2012 - July 2014
Universidade Estadual de Maringá, Paraná, Brazil - B.S. Chemistry February 2004 - December 2008
Hi-Tec Enterprises, Oxnard, CA — General Management Feb 2016 - PRESENT
- Accounting
- Logistics
- Customer Service
I transitioned the company’s records from an outdated platform to Quickbooks, making all information more accessible and organized. That transition saved time in everyone's routine, allowing the company for a more concise setup.
Qualitec Inc., Sao Paulo - Brazil — Industrial Chemist 2008 - 2014
- Industrial Processes Supervisor
- Customer Service Advisor
- Supply Chain Compliance Management
- ISO 9001 Maintenance
Managing production, quality control, suppliers and customers. I was able to increase the company's profit by reevaluating the costs of the process and dismissing customers that were requesting prices under the established margin. That also decreased hours of productition reducing machinery use and over-hour payments.
Fluent - English and Portuguese(BR)
https://www.linkedin.com/in/paulapipkin/
Ventura, CA 93003
(805) 212 2893