Skip to content
View priyam-choksi's full-sized avatar

Block or report priyam-choksi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
priyam-choksi/README.md

Header Image

LinkedIn Personal Website Email GitHub Kaggle

Hi there! 👋 I'm Priyam (pronounced "PREE-yum"), a Data Engineer and ML Engineer with hands-on experience in developing data pipelines, optimizing machine learning models, creating dashboards and visualizing data, and integrating business intelligence solutions. I’m all about making data work for us in practical, exciting ways.

Currently, I’m learning about Generative AI and Prompt Engineering, learning how to create advanced AI systems that generate content and interact with users in creative ways.

When I'm not exploring data projects, I enjoy Cloud wathcing, Hiking and Sleeping.


🛠️ Skills

🧑‍💻 Languages: Python, SQL, R, Java, C++, JavaScript

🗄️ Databases: MongoDB, PostgreSQL, MySQL, Snowflake

☁️ Big Data & Cloud: AWS (S3, EC2, Lambda), Azure, Spark, Kafka, Athena, Databricks, Docker, Kubernetes

🤖 Machine Learning & AI: TensorFlow, Keras, PyTorch, OpenCV, Neural Networks, NLP, BERT, LLM, GPT

📊 BI/ETL: Power BI, Tableau, Excel, Looker, Talend, Jupyter, Colab, Google Analytics, Qlik Sense

Feel free to reach out if you have any questions or just want to connect!


Profile Views


Pinned Loading

  1. Scalable-Data-Engineering-Pipeline-using-Apache-Kafka-Apache-Spark-and-Cassandra Scalable-Data-Engineering-Pipeline-using-Apache-Kafka-Apache-Spark-and-Cassandra Public

    An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All comp…

    Python

  2. Real-Time-Stock-Market-Data-Processing Real-Time-Stock-Market-Data-Processing Public

    This project focuses on constructing a real-time data engineering pipeline for stock market data using Apache Kafka, Python, and various AWS services. The goal is to demonstrate an end-to-end imple…

    Jupyter Notebook

  3. Data-Integration-and-Business-Intelligence Data-Integration-and-Business-Intelligence Public

    This project involves creating a robust data warehouse to support the sales and purchasing operations of the AdventureWorks company. Utilizing multiple data sources from different database systems,…

  4. Uber-ETL-Data-Engineering-Project Uber-ETL-Data-Engineering-Project Public

    The goal of this project is to perform data analytics on Uber data using various tools and technologies, including GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Look…

    Jupyter Notebook

  5. Diabetes-Streamlit-App Diabetes-Streamlit-App Public

    This Diabetes Prediction App aims to assess the likelihood of diabetes based on various health metrics provided by the user. The application leverages a Logistic Regression model, well-suited for b…

    Python

  6. DS-ML-Notebooks DS-ML-Notebooks Public

    This repository is a showcase of my data science and machine learning projects. Each notebook is an independent project where I explore different datasets, apply various data processing techniques,…

    Jupyter Notebook