π Data Engineer | Cloud Enthusiast | Python Developer
I specialize in designing and building efficient, scalable, and reliable data systems to power data-driven decision-making. With a strong foundation in cloud technologies, big data, and automation, I am passionate about transforming raw data into actionable insights.
- Languages: Python, SQL, Java (beginner to intermediate), Scala,
- Automation: Bash, Python scripts
- ETL/ELT Tools: PySpark, Kafka, AWS Glue
- Data Warehousing: BigQuery, AWS Redshift,Snowflake ,Azure Synapse Analytics,Hive
- Databases: MySQL, PostgreSQL, MSSQL server
- Data Storage: AWS S3,Azure Data Lake ,Google Cloud Storage
- AWS: S3, Redshift, EC2, Lambda, Glue, ECR
- GCP: BigQuery, Cloud Functions, Looker,
- Azure: Azure DataBricks,Azure Data Factory, Azure Function, Azure Fabric
- Tools: Docker, Kubernetes, Terraform ,AWS cloudFormation, Azure ARM Templates
- CI/CD: Jenkins, GitHub Actions, GitLab CI/CD
- Prometheus, Grafana, ELK Stack
- TensorFlow, PyTorch, Langchain, llamaIndex ,OpenAi, Gemini,Llama, Mistral
- Automated ETL workflows with Python scripts and Cloud Functions.
- Leveraged BigQuery for large-scale data processing and analytics.
- Designed Looker dashboards, enhancing business insights delivery by 30%.
- Developed a deep learning-based object detection system using TensorFlow and PyTorch.
- Deployed models on AWS infrastructure using Docker and FastAPI.
- Designed a real-time data pipeline using Kafka, MySQL, PySpark, and AWS services.
- Enabled seamless data flow into AWS Redshift for analytics.
- Advanced concepts in Heap and Priority Queues for coding interviews.
β Fun Fact: I enjoy blending data engineering with DevOps principles to build robust and automated workflows!