Skip to content
View hrishikesh-2000's full-sized avatar

Block or report hrishikesh-2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hrishikesh-2000/README.md

Hi Folks πŸ‘‹, I'm Hrishikesh Sutar

Hey there! I'm a data engineering enthusiast from India, diving deep into the world of PySpark and large-scale data processing. If not plumbing, you will quite often find me strumming my guitar 😜! πŸš€



azure git hive linux mssql mysql pandas postgresql python Databricks Devops Datafactory Apache Spark YAML

πŸš€ About Me

  • Passionate about designing, building, and scaling data processing systems.

  • I am much more into understanding fundamentals and trying to tackle complex challenges and finding if something valuable I can turn into from raw data

  • πŸ“« How to reach me? sutarhrishikesh00@gmail.com

πŸ“š I'm Currently Certified In:

DP203 DP900 Databricks

πŸ”§ Technical Skills

  • Languages: Python, PySpark, SQL
  • Big Data Tools: Hadoop, Spark, HDFS, Hive, HBase
  • ETL Tools: Informatica PowerCenter, Azure Data Factory, SSIS
  • Cloud: Azure (Data Factory, Databricks, Synapse, DevOps)
  • Data Engineering: Data Warehousing, Data Modeling, CI/CD, API Integration

πŸ’Ό Building Stuff and Hoping It Works

  • Currently working on project end-to-end ETL pipeline for the insurance domain as of (25th Jan, 2025), including schema modeling.
  • Building and optimizing data pipelines with Azure Data Factory and Databricks for large-scale data ingestion.
  • Implementing Medallion Architecture and creating curated business views in SQL for data modeling.

πŸ“˜ On a Quest to Figure This Stuff Out

  • Deepening my knowledge of PySpark for handling large-scale data applications.
  • Expanding CI/CD and DevOps skills for seamless data pipeline deployments.
  • Learning best practices in cloud data engineering focusing on Azure services.
  • 🌱 I’m currently learning and expanding my knowledge in Docker, PySpark, Python DSA, and Kafka
  • 🎸 Aspiring Rockstar in the Making! Currently learning guitar, so if you hear any out-of-tune noises, it's all part of the "creative process"! 🎢

πŸ“ˆ That's what I wanna achieve

  1. Master complex data engineering concepts and tools.
  2. Specialize in designing scalable data pipelines with PySpark.
  3. Contribute to impactful data projects and evolve as a well-rounded data engineer.

🀝 Let's Connect!

Let's collaborate and grow in the world of data engineering!

Pinned Loading

  1. ETL-Pipeline-For-Stock-Market Public

    ETL pipeline for stock market

    Python

  2. Python-Web-Scraping Public

    Web scrapper using python to scrap Flipkart

    Jupyter Notebook

  3. IPL-Exploratory-Data-Analysis-Season-2008-19 Public

    Jupyter Notebook