Experienced Data Scientist with 14 years of expertise in data science, data visualization, product management, and data management. Proven ability to develop and implement strategies for data-centric product development and data-driven decision-making. Applying for the position of Senior Data Scientist to leverage expertise refocus on emerging ai, advanced analytics, and model driven products and solutions.
Date | Type | Repo | Description | Status |
---|---|---|---|---|
2023 Sept | Python Unsupervised Learning | news-articles-categorization | A report focused on modeling news artical categorization for BBC News focused on the application of natural language processing, unsupervised learning with matrix factorization, and a comparison to supervised learning. | In Progress |
2023 Sept | Supervised Deep Learning | marketing_text_classification | A report focused on modeling news article categorization for marketing analytics. This notebook focuses on the application of natural language processing, supervised learning with k-train (a wrapper for Tensorflow, Keras, and Huggingface Transformers), and a evaluation of performance. | Completed |
2023 Aug | Python Supervised Learning | customer-churn-prediction | In this report, I will play the role of data scientist. Stepping out of my business-facing role and working with a similar model that was created to predict customer attrition. While I cannot use proprietary business data for this analysis, I will find and use a publicly available customer churn dataset to emulate a similar customer context. I will also use the Random Forest classifier package, similar to the model that was implemented at the company. | Completed |
2023 April | Python Data Vizualization | consumer-price-index | While the CPI news release and charts are thorough, they focus on the top-level CPI number. This is effective at describing the why behind inflation, but given the amount of aggregation, this package falls short of successfully communicating the βso whatβ of changes in CPI and inflation to the average person. | Completed |
2023 April | R Analysis | nypd-shooting | Analysis of NYPD Shooting Incident Data to identify factors and trends contributiong to shootings in New York City. | Completed |
2023 April | R Analysis | covid-19 | Analysis of COVID-19 data for global and US datassets. Exploratory data anlaysis in R to identify how the data interacts and connects. | Completed |
Date | Repo | Description | Status |
---|---|---|---|
2023 Aug | The 4 Cs of Data Governance Measurement | A framework introducing Capability, Capacity, Competency, and Compliance to guide data strategy and highlight improvement areas within enterprises. | Completed |