Skip to content
View jitsejan's full-sized avatar

Block or report jitsejan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jitsejan/README.md

πŸ‘‹ Hello, I'm Jitse-Jan!

πŸš€ Lead Data Engineer | Cloud & ETL Specialist | FinTech & Startups
I lead data teams in FinTech, focusing on modern data infrastructure, scalable ETL pipelines, and cloud-native solutions. Over the past decade, I've built several data platforms and teams, spanning full end-to-end solutions with engineers, data scientists, analysts, and ML engineers.

GitHub Stats Top Languages


πŸ” What I Do

  • Building scalable ETL pipelines using Dagster, dbt, and Kubernetes.
  • Designing cloud-native data solutions with Azure & AWS.
  • Developing API connectors with dlt, DuckDB, and Polars.
  • Modernizing legacy Java data pipelines to Python-based architectures.

πŸ›  Tech Toolbox

Programming & Query Languages:
Python SQL

Cloud Platforms:
AWS Azure

Data & Transformation Tools:
dbt DuckDB Polars Dagster

Development & Infrastructure:
Kubernetes Terraform Docker


πŸš€ Currently Working On

Building a new architecture migrating from multi-cloud to single-cloud solutions with focus on:

  • Modular data architecture with clear separation of bronze/silver/gold/semantic layers
  • LLM-ready data platforms optimized for AI/ML workloads
  • Cloud-native ETL using modern data stack principles

πŸ† Recent Achievement

Built a scalable Kubernetes architecture leveraging dlt, dbt, and DuckDB to efficiently load data from Azure Blob Storage to Azure SQL Server, creating a robust and maintainable data pipeline.


πŸ“Œ Featured Projects

  • agile-ai ⭐ – AI-powered agile project management tools
  • Modular API Ingestion – Built an extendable API ingestion framework using dlt, DuckDB, and Python, improving data integration efficiency
  • Automated Data Processing Pipelines – Created dbt-driven transformation workflows, enabling structured reporting and analytics
  • Dagster & dbt on Kubernetes – Designed a fully automated cloud-native data pipeline, reducing manual intervention and improving observability

πŸ“« Let's Connect!


GitHub Activity Graph

Pinned Loading

  1. python-flask-with-javascript python-flask-with-javascript Public

    This repository contains an example app to communicate between JavaScript and Python.

    HTML 67 25

  2. notebooks notebooks Public

    This repo contains my public notebooks together with the Docker files to get the Anaconda environment up and running.

    Jupyter Notebook 4 4

  3. vps-provision vps-provision Public

    Another experiment to provision my VPS

    Shell 1 2

  4. mario-api-python-eve mario-api-python-eve Public

    Python

  5. pyspark-101 pyspark-101 Public

    A PySpark course to get started with the basics for a Data Engineer

    Jupyter Notebook 9 9