Skip to content
View juan-esteban-berger's full-sized avatar

Block or report juan-esteban-berger

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Juan Esteban Berger

LinkedIn | GitHub | Email

πŸ‘‹ About Me

I am a Data Engineer who is passionate about machine learning, mathematics, and data science. With experience in cloud technologies, data visualization, and automation, I'm always excited to tackle challenging projects that create real impact.

πŸ† Certifications

πŸŽ“ Education

  • M.S. in Applied and Computational Mathematics and Statistics
    University of Notre Dame, IN
    GPA: 3.54/4.00
    Graduated: May 2023

  • B.S. in Applied and Computational Mathematics and Statistics
    University of Notre Dame, IN
    Minor: Actuarial Science
    Graduated: May 2022

πŸ’Ό Work Experience

Data Engineer, Grupo Disresa, Guatemala City, Guatemala

Aug 2023 - Present

  • Built and maintained ETL data pipelines for the distribution of brands across Central America and the Caribbean, managing over 300,000 SKUs.
  • Developed data visualization dashboards with Power BI and Streamlit, deployed using MicroK8s Kubernetes.
  • Designed and implemented a PostgreSQL data warehouse integrating data from multiple sources (SAP, BigQuery, MongoDB).
  • Created a metadata dashboard to improve data quality, reducing missing values from 35% to under 5%.

Graduate Teaching Assistant, University of Notre Dame, IN

Aug 2022 - Dec 2022

  • Assisted in teaching Statistical Learning for Data Science (graduate-level) and Introduction to Probability (undergraduate-level).
  • Graded assignments and provided assistance to students regarding their questions.

πŸ“š Projects

MicroKeras: Minimal Deep Learning Library

A minimal implementation of the Keras Sequential model and Dense Layer classes, built from scratch using Python and NumPy.

S&P 500 Data Engineering Project

Built data pipelines using Apache Airflow and Docker to store financial data in PostgreSQL. Developed three data visualization dashboards, one using Streamlit, one using Tableau and and one using React and Django.

Options Pricing Master's Research Project

Trained models using Google Cloud AutoML, TensorFlow, and XGBoost to price European options and compared performance against the Black-Scholes model.

CUDA Neural Networks

Deep neural network implementation from scratch in C++ with CUDA for GPU acceleration. This project focuses on creating a neural network to classify digits using the MNIST dataset.

πŸ› οΈ Technical Skills

  • Programming: Python, SQL, C++
  • Machine Learning: TensorFlow, PyTorch, SciKit-Learn, XGBoost
  • ETL Pipelines: Pandas, PySpark, Polars, Dask
  • Databases: PostgreSQL, SQL Server, MongoDB, BigQuery, Hive
  • DevOps: Docker, Kubernetes, Apache Airflow
  • Data Visualization: Power BI, Streamlit, Tableau
  • Web Development: React, Tailwind CSS, Django, FastAPI
  • Editor & OS: Neovim, Linux (Arch, Debian, Ubuntu)

Pinned Loading

  1. microkeras microkeras Public

    MicroKeras is a minimal implementation of the Sequential Class from the Keras deep-learning library, built from scratch using Python and NumPy.

    HTML 1

  2. spx_dashboard spx_dashboard Public

    S&P 500 Data Engineering Project

    Python

  3. Options_Pricing_AutoML_TensorFlow_XGBoost Options_Pricing_AutoML_TensorFlow_XGBoost Public

    Pricing European Options with Google AutoML, TensorFlow, and XGBoost

    Jupyter Notebook 3 4

  4. cuda_networks cuda_networks Public

    Deep neural network from scratch in C++ with CUDA for GPU acceleration.

    Cuda

  5. termpandas termpandas Public

    Scrollable Pandas DataFrames in the Terminal

    Python 3