Hi, I am a data engineer who build data infrastructure on cloud.
Currently focusing on Apache Spark and Distributed Systems.
- Bangkok, Thailand
Pinned Loading
-
PySpark-ETL-with-AWS-DMS-and-Databricks
PySpark-ETL-with-AWS-DMS-and-Databricks PublicAn example of processing change data capture with AWS DMS and store in delta table in Databricks.
Jupyter Notebook
-
PySpark-ETL-with-MySQL-in-Databricks
PySpark-ETL-with-MySQL-in-Databricks PublicAn example of reading MySQL data incrementally in Databricks with PySpark, process and store it in Delta table.
Jupyter Notebook
-
Clustering-with-KModes-in-Python
Clustering-with-KModes-in-Python PublicUsing KModes in Python for categorical clustering on adult dataset.
Jupyter Notebook 1
-
Spark-Structured-Streaming-in-Databricks
Spark-Structured-Streaming-in-Databricks PublicStreaming data processing with Spark's Structured Streaming examples.
Scala
-
just-me-learning-scala
just-me-learning-scala Publicjust to keep track of my learning progress.
Scala
-
mini-data-catalog-with-flask
mini-data-catalog-with-flask PublicA simple mini data catalog with Flask and Docker.
Python
If the problem persists, check the GitHub status page or contact support.