Skip to content
View viplazylmht's full-sized avatar
🏠
Working from home
🏠
Working from home
Block or Report

Block or report viplazylmht

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
viplazylmht/README.md

Hi there 👋

Here's some information that can help you to know about me, let's go!

TLDR; Check out this pdf or image versions of my CV.

Hits

Experience

  • 01/2022 -> present: Data Engineer at MoMo (M_service). From MoMo Talents Program.

Education


Skills

  • Agile / Scrum concept
  • Programming Languages (C/C++, Java, Kotlin, Python, SQL,...)
  • MS SQL Server / Oracle OCI / Bigquery / Vertica / Trino
  • Open Table Format (Delta Lake / Apache Iceberg)
  • Command Line (with or without Linux/Unix system)
  • Git and Version Control
  • CI / CD
  • Shell / Linux
  • Docker
  • Kubernetes
  • ETL / ELT
  • Spark Application
  • Data modeling
  • Data Observability / Data Quality / Data Catalog / Data Security
  • Data Governance
  • Google Cloud Platform (Bigquery / PubSub / Dataproc / GKE / GCS / Cloud Functions / Resource monitoring / Looker / GCP gRPC API)
  • Oracle APEX
  • Scikit-learn
  • Machine Learning Algorithms
  • Generative AI
  • MS Office
  • Kubectl / Helm / Skaffold

Tools

Contributions


Project

Company Projects

  • Golden Record - Process to achieve high-value Data Mart at MoMo
    Build tools and services on top of open-source projects to control the data model's quality, freshness, and extensionality. Golden Record currently serves many dataflows such as events and transactions of the MoMo Super App.
    Used: dbt, Great Expectations, Airflow, Gitlab, Kubernetes, Oracle OCI, and Oracle APEX.

  • Cost Optimization - Reduce cost on GCP
    Support other teams to optimize queries: move services, ETL, and ELT to on-premise Kubernetes. Try to shift from Bigquery to Vertica. Manage GCP resources for each team in MoMo by the divide-and-conquer principle.
    Conclusion: 40% cost saved without any stuck workload.
    Fluent in: Bigquery, Vertica, Kubernetes, Oracle APEX, GCP gRPC API.

  • Data Observability - Data Governance
    Just a project which helps end-user monitor five pillars of data: Freshness, Volume, Quality, Schema, and Lineage. This project aims to reduce the workload of the data-platform team in responsiveness to data for both info and incident.
    Fluent in: Datahub, dbt, Great Expectations, Airflow.

  • Data Lakehouse
    Collaborate with the team to build a lakehouse solution to reduce the cost of all workloads at Momo. Trino/Spark run on GKE as a query engine to process large batch data stored in GCS. Reduce up to 70% cost per workload thanks to Spot instance without any data SLA.
    Fluent in: Trino, Spark, GKE, GCS, Bigquery Storage, dbt, Airflow, Apache Ranger, Delta Lake, Apache Iceberg

University projects


Badges

There are a lot of badges (with AI, Machine Learning, Deep Learning, and Data Scientist) I have reached from that base on Google Cloud Platform.

Let's check out my Qwiklabs Public Profile.

Programming Languages

Top Langs

Duy's GitHub stats


Contact

Website

Github Page: viplazylmht.github.io

Pinned

  1. Predict_Covid19 Predict_Covid19 Public

    Forked from caotatcuong/Predict_Covid19

    Jupyter Notebook

  2. Esmart Esmart Public

    A Education Technology project to support people practice English skill

    Java

  3. PublicIDConverter PublicIDConverter Public

    Public ID Converter is a tool for Android that can convert ids when porting/modifing apk.

    Java 1 1

  4. B3T4shark/B3T4shark B3T4shark/B3T4shark Public

    JavaScript

  5. cttn18ctt3/cttn18ctt3.github.io cttn18ctt3/cttn18ctt3.github.io Public

    Portal that help students contributing to class easily

    HTML

  6. hcmus-n930215/PythonGame-G15 hcmus-n930215/PythonGame-G15 Public

    This project is game assigned in university

    Python 1