Skip to content
View bikash-deb-007's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report bikash-deb-007

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bikash-deb-007/README.md

Hi, I'm Bikash Deb

Data Engineer | Building scalable data pipelines with Apache Spark


About Me

I build production data pipelines that process millions of records. Currently focused on implementing Medallion Architecture for data lakes and revenue assurance systems in telecom.

What I work with:

  • Apache Spark & PySpark for distributed processing
  • Python for ETL development
  • Parquet & Delta Lake for storage
  • Data quality frameworks and validation

Technical Skills

Data Engineering:

  • Apache Spark (PySpark) | Databricks | Delta Lake
  • ETL/ELT Pipeline Development
  • Medallion Architecture (Bronze/Silver/Gold)
  • Data Quality & Validation

Programming & Tools:

  • Python | SQL | Git
  • Parquet | Delta | CSV
  • Data Modeling | Schema Design
  • Performance Optimization

Cloud & Infrastructure:

  • Distributed Computing
  • Data Warehousing
  • Version Control (Git/GitHub)

Featured Projects

TelcoStream Analytics Engine

Production-grade data engineering pipeline for telecom revenue assurance using PySpark and Medallion Architecture.

Tech Stack: PySpark, Parquet, Python, Medallion Architecture
Highlights:

  • Processes 5,000+ CDR records with schema-on-read validation
  • Identifies 33.6% of customers at bill shock risk
  • Implements 4-tier risk classification system
  • 60-70% reduction in disputed charges

View Project →


GitHub Stats

Bikash's GitHub stats

Top Languages


Connect With Me

GitHub


"Data scientists get the glory, but data engineers build the foundation."

Pinned Loading

  1. telcostream-analytics-engine telcostream-analytics-engine Public

    Production-grade data engineering pipeline for telecom revenue assurance using PySpark and Medallion Architecture

    Python 1