Welcome to my GitHub! I'm an Azure Data Engineer with 2.7 years of experience currently working at Infosys. My expertise lies in crafting robust data solutions, optimizing data pipelines, and empowering businesses with actionable insights through the power of cloud computing and big data technologies.
- Cloud Platforms: Azure (Data Factory, Synapse Analytics, Databricks, Azure SQL, Blob Storage)
- Big Data Tools: Apache Spark (PySpark), Hadoop
- Programming Languages: Python, SQL, Scala
- Data Integration & ETL: Expertise in building efficient and scalable ETL pipelines
- Database Management: SQL Server, PostgreSQL, Cosmos DB
- Workflow Orchestration: Azure Data Factory, Apache Airflow
- Version Control: Git, GitHub
- Enhancing my knowledge of distributed computing and data architecture.
- Diving deeper into Data Lakehouse architectures using Delta Lake.
- Building scalable and resilient data pipelines for real-world projects.
- Built a scalable pipeline on Azure Synapse and Databricks to ingest and analyze customer data.
- Reduced data processing time by 30% using PySpark optimizations.
- Automated complex ETL workflows using Azure Data Factory and Python.
- Increased data reliability and reduced manual intervention by 50%.
- Designed dashboards using Power BI integrated with Azure Data sources.
- Provided actionable insights leading to a 15% increase in sales efficiency.
- LinkedIn: Sivaprasad V
- GitHub: Sivaprasad V
I’m passionate about solving real-world data problems and enjoy learning the latest advancements in cloud computing and big data. When I’m not coding, you can find me exploring new places or trying out new tech gadgets.
Feel free to explore my repositories, and don’t hesitate to reach out for collaboration or discussions!