Welcome to my GitHub profile! I specialize in data engineering and love turning data into insights and action. Below is a snapshot of my skills and some projects I'm proud of.
- SQL & Databases: Advanced querying, performance tuning, and schema design.
- Python: Data analysis, automation scripts, and back-end development.
- PySpark: Large-scale data processing in distributed environments.
- Shell Scripting: Automating tasks and managing systems.
- AWS: Leveraging cloud resources for scalable data solutions.
Here are a few highlights of my projects. Check out the repositories for more details and demonstrations!
- Data Warehouse Optimization: Optimized SQL queries and designed an efficient schema that improved data retrieval times by 50%.
- Analytics Dashboard: Developed a Python-based analytics tool to process and visualize data, enhancing decision-making processes.
- Automated Data Pipeline: Implemented a data pipeline using PySpark and AWS to streamline data ingestion and processing.
π Formula1-Data-Pipeline
π Technologies Used: Python, PySpark, Databricks, Delta Lake
π Description: A scalable data pipeline to analyze Formula 1 standings using Medallion Architecture.
π Check it out: Formula 1 Data Pipeline Repository
- Email: itakvishal@gmail.com
- LinkedIn: LinkedIn Profile
I love exploring the latest technology trends and am always on the lookout for new challenges in the data sphere!