Skip to content
View deepapanicker's full-sized avatar

Block or report deepapanicker

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
deepapanicker/README.md

Hi there πŸ‘‹

Senior Data Engineer | Cloud Data Pipelines | Scalable AWS & ETL Automations

LinkedIn Email Portfolio


πŸ‘¨β€πŸ’» About Me

Senior Data Engineer with 8+ years of experience designing and delivering scalable cloud data platforms and ETL pipelines. Based in Toronto, ON, I specialize in building robust data infrastructure that enables data-driven decision making.

What I do:

  • πŸ—οΈ Design and build scalable cloud data platforms (AWS, GCP, Azure)
  • πŸ”„ Develop production-ready ETL/ELT pipelines with Airflow, dbt, and Matillion
  • πŸ“Š Model data warehouses using dimensional modeling (Star/Snowflake schemas)
  • πŸ€– Explore GenAI tools, LLMs, and AI technologies for intelligent data processing
  • πŸ› οΈ Automate infrastructure provisioning with Terraform and CI/CD pipelines
  • πŸ“ˆ Enable A/B testing, user engagement analysis, and real-time analytics

Tech Arsenal πŸ› οΈ:

aws gcp azure airflow dbt spark databricks python postgresql mysql oracle redshift sqlserver terraform jenkins git docker github grafana cloudwatch bash openai anthropic looker kafka nginx apache


πŸ› οΈ Tech Stack

Cloud Platforms

AWS GCP Azure

Services: S3, Lambda, Redshift, Glue, RDS, DynamoDB, ECS Fargate, BigQuery, Cloud Storage, Cloud Dataflow

Data Engineering & ETL

Airflow dbt Matillion

Tools: Informatica PowerCenter, SSIS, Autosys, Databricks, PySpark, Apache Spark

Programming & Scripting

Python SQL Shell

Libraries: pandas, numpy, requests | Languages: Python, Advanced SQL, PL/SQL, Shell Scripting

Data Warehousing

Databases: Redshift, SQL Server, Oracle, Teradata, DB2 | Modeling: Dimensional Modeling (Star/Snowflake)

Infrastructure & DevOps

Terraform Jenkins Git

Tools: Terraform, Jenkins, Git, CI/CD Pipelines, Containerized Services

AI & Machine Learning

OpenAI Anthropic

Technologies: LLMs (OpenAI, Anthropic), RAG, Vector Databases, Model Context Protocol (MCP)

Monitoring & Governance

Tools: CloudWatch, Grafana, Data Quality Frameworks, Metadata Management

Business Intelligence

Tools: Looker (LookML), SSRS, Report Builder


οΏ½οΏ½ GitHub Stats

GitHub Stats

Top Languages

GitHub Streak


πŸ“« How to Reach Me


⚑ Fun Facts

  • 🎯 Passionate about building scalable data solutions that enable data-driven decision making
  • πŸš€ Always exploring new technologies to enhance data processing capabilities
  • πŸ“š Love sharing knowledge and mentoring others in data engineering
  • β˜• Coffee enthusiast and problem solver

Thanks for visiting my profile! Feel free to connect or reach out for collaboration opportunities. πŸš€

Visitor Count

Popular repositories Loading

  1. study-python study-python Public

  2. deepapanicker deepapanicker Public

  3. deepapanicker.github.io deepapanicker.github.io Public

  4. my-portfolio my-portfolio Public

  5. etl-pipeline-airflow etl-pipeline-airflow Public

    Production-ready ETL pipeline framework with Apache Airflow, Python, and PostgreSQL

    Python

  6. data-quality-framework data-quality-framework Public

    Comprehensive data quality validation and monitoring framework with Great Expectations

    Python