Skip to content
View sebastiandaberdaku's full-sized avatar
Block or Report

Block or report sebastiandaberdaku

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sebastiandaberdaku/README.md

Hi there 👋

With over 5 years of experience as a data engineer and a PhD in Information Engineering, I am passionate about designing and implementing efficient, scalable, and fault-tolerant data pipelines.

My expertise lies in building and managing data ingestion pipelines with technologies such as Spark, Airflow, Python, DataBricks, TrinoDB, and the related cloud architecture (mainly AWS) with CloudFormation and Terraform. I have worked with diverse datasets in different domains, including bioinformatics (proteomics), clinical/health informatics, and finance. I have a strong background in research and data science, which enables me to apply a rigorous and analytical approach to solving complex problems and delivering innovative solutions. I also have experience in backend development in Python and Java (Spring Boot), and the SOLID programming principles in general.

Throughout my career, I have led and contributed to various projects, both in industry and academia. These experiences have honed my skills in project management, stakeholder engagement, and collaboration with cross-functional teams. I am always seeking new opportunities to learn and grow, and I am committed to staying current with emerging technologies and best practices in the data engineering field. I am motivated by the challenge of transforming data into meaningful insights that can drive business value and social impact.

Check out my professional resume here.

Pinned Loading

  1. apache-airflow-providers-pysparkonk8s apache-airflow-providers-pysparkonk8s Public

    Python package for the pysparkonk8s Apache Airflow provider.

    Python

  2. spark-glue-python spark-glue-python Public

    Apache Spark with AWS Glue metastore and Python docker image

    Dockerfile 3

  3. spark-with-glue-builder spark-with-glue-builder Public

    Docker image that builds a patched Apache Spark with AWS Glue support as metastore

    Dockerfile 6 1

  4. base-aws-tf-infrastructure base-aws-tf-infrastructure Public

    Base Terraform project used to bootstrap a remote Terraform state on AWS

    HCL 1

  5. charts charts Public

    My custom Helm Chart repository

    Smarty 9 1

  6. terraform-modules terraform-modules Public

    Custom Terraform modules repository

    HCL 1