Skip to content
View Turnipdo's full-sized avatar
  • New Taipei City, Taiwan
Block or Report

Block or report Turnipdo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Turnipdo/README.md

Hey :shipit:

I'm currently on a path to learning data engineering tools to expand my knowledge and share it with the world.
I hope it helps in some way!

Currently Learning 📑

AWS Cloud Services Trino Apache Spark Apache Kafka

Contact 📨

For a quick response, You can connect with me on LinkedIn.

Pinned Loading

  1. PyODBC-Data-Import-for-SSMS-AlexTheAnalyst-Ref- PyODBC-Data-Import-for-SSMS-AlexTheAnalyst-Ref- Public

    Using Python's pyodbc module to connect to Microsoft SQL Server and import data into SSMS.

    Jupyter Notebook

  2. SSMS-SQL-PowerQueryM-Functions SSMS-SQL-PowerQueryM-Functions Public

    Using Power Query M to extract values from Excel workbooks for dynamic insertion into SQL code.

  3. Real-Time-BTC-USD-Airflow-DAG-Extract-In-Excel Real-Time-BTC-USD-Airflow-DAG-Extract-In-Excel Public

    Using yfinance, we grab minute-by-minute BTC-USD data, dump it into PostgreSQL, and link Excel via ODBC for quick analysis!

    Python

  4. Spark-Standalone-Cluster-Setup Spark-Standalone-Cluster-Setup Public

    To facilitate the initial setup of Apache Spark, this repository provides a beginner-friendly, step-by-step guide on setting up a master node and two worker nodes.

    Python 2

  5. Docker-Spark-Setup Docker-Spark-Setup Public

    Setting up a Spark cluster in a Docker environment for improved repeatability and reliability. This project includes a simple transformation on a dataset containing approximately 31 million rows.

    Python 2