Skip to content
View KattsonBastos's full-sized avatar
:octocat:
:octocat:

Organizations

@builtcode-git @owshq-plumbers @koguitec
Block or Report

Block or report KattsonBastos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
KattsonBastos/README.md

author GPLv3 license contributions welcome Image License

Kattson Bastos

This repo presents some skills and practical projects I've been working on since I started my journey in the IT and data fields.

A Data Lover in the realm of ML and Data Engineering, DevOps and IT in general.

🔗 Links

  • Linkedin Badge
  • Gmail Badge

Data Engineering:

1. Streaming Pipeline with Apache Beam and Dataflow
drawing

Building a streaming data pipeline to load real-time users data into Big Query so the marketing and analytics teams can work on products offering and customer segmentation for faster business value generation.

Skills: GCP, Dataflow, Apache Beam, Pub/Sub, Terraform, Big Query.


2. Daily Covid 19 ELT With Modern Data Stack
drawing

Our team was asked to implement a simple ELT pipeline in order to provide daily Covid 19 data to the BI team so they workk on their analytics and take decisions.

Skills: Airflow, Dbt, Snowflake, Airbyte, Docker


Data Science:

1. Store Sales Prediction
drawing

Building an end-to-end solution for a six weeks sales forecast of a pharmacy chain using Machine Learning. The predictions can be accessed by a bot on Telegram.

Skills: Machine Learning, Time Series, Heroku, API, Bot


2. Prioritizing Customers for Insurance Cross-Sell (Ongoing)
drawing

Predicting whether or not the customer would be interested in auto insurance so the sales can be optimized.

Skills: Machine Learning, Heroku, API, Streamlit


3. Cardiovascular Disease Detection (Ongoing)
drawing

Building a Machine Learning Model to detect cardiovascular disease in early stages leverage the diagnostic precision made by health professionals.

Skills: Machine Learning, Heroku, API, Streamlit


Data Anaysis / Insights:

1. Education Dataset Analysis (Brazilian ENADE)
drawing

Analyzing data from a brazilian performance's valuation of students. The analysis is focused on the state of Bahia.

Skills: Data Visualization, Data Processing


2. Analysis of Election Data

Analyzing data from Vitoria da Conquista - Ba's 2020 elections. The focus is on the economic and the social profile of candidates.

Skills: Data Visualization, Data Processing



Pinned

  1. rossmann_sales_prediction rossmann_sales_prediction Public

    Jupyter Notebook 3

  2. cardio_disease_detection cardio_disease_detection Public

    Jupyter Notebook

  3. health_insurance_cross_sell health_insurance_cross_sell Public

    Jupyter Notebook

  4. bahia_enade18_analysis bahia_enade18_analysis Public

    Jupyter Notebook 2