I work as a Tech Lead in ING Advanced Analytics, where I am a part of a team contributing to Data Analytics Platform.
My current area of influence is within the data cataloging, discoverability and lineage.
In the past I was heavily involved in:
- Writing and extending frameworks for data ingestion
- Writing and extending frameworks for data quality / profiling
I have strong experience in distributed systems, leveraging modern technologies such as:
- Kubernetes
- Apache Airflow
- Apache Spark
- Apache Superset
- Confluent Kafka
- Elastic Stack (formely ELK)
I hold following certificates:
- Google Cloud Professional Cloud Architect (PCA)
- Google Cloud Associate Engineer (AE)
- Kubernetes Certified Application Developer (CKAD)
I also am a trainer for a Polish training company Sages where I am responsible for conducting Elastic Stack and Spark related trainings. So far I've conducted 30+ trainings for over 250 people.
My experience revolves mostly around Open Source technologies, towards which I have a strong fondness. I am proud to be a contributor/maintainer for:
- π Amundsen (LF AI) /maintainer/ - a data discovery and metadata engine for improving the productivity of data analysts, data scientists and engineers when interacting with data
- π OpenMetadata /contributor/ - an all-in-one platform for data discovery, data lineage, data quality, observability, governance, and team collaboration
- π¬οΈ Apache Airflow /contributor/ - a platform to programmatically author, schedule and monitor workflows
- π Apache Atlas /contributor/ - a metadata governance framework
Although rather seldom, I sometimes write medium stories:
- π΄ Smart Indoor Trainer - an IoT project leveraging AI and ANT+ to make indoor cycling trianing sessions bearable
- ποΈ Smart Garbage App - an AI-powered waste sorting trash bin
- π΄ Snooker Data Client - a simple Python client for retrieving snooker related data from api.snooker.org
- β Coffee
- π΄ Cycling
- π΄ Snooker