Skip to content
View ibrahimakerkouch's full-sized avatar
  • Washington, D.C.
  • 07:59 (UTC -12:00)

Block or report ibrahimakerkouch

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ibrahimakerkouch/README.md

πŸ‘‹ Hi, I'm brahim

Welcome to my GitHub! I’m a data professional specializing in data engineering, ETL pipelines, and data quality. I enjoy solving complex data problems and building reliable, scalable data solutions. My work focuses on performing data validation, anomaly detection, root-cause analysis, and dashboard reporting to ensure accurate and trustworthy data across systems.

Skills & Competencies

πŸ“₯ Data Collection & Data Ingestion
πŸ—οΈ Data Engineering & ETL Pipeline Development
βœ… Data Quality Management & Validation
πŸ” Anomaly Detection & Root-Cause Analysis
πŸ”„ Data Integration & Migration
πŸ§ͺ Data Cleaning, Transformation & Enrichment
πŸ“ˆ Dashboarding, Reporting & Data Visualization

What You’ll Find Here

βš™οΈ End-to-end ETL and data integration projects
πŸ” Data quality, duplicate-detection, and issue detection workflows
🧬 Real-world datasets and pipeline testing setups
πŸ“ˆ Dashboards and visual analytics for KPIs, patterns, and trends
🧱 Documentation and best practices for scalable data solutions

Pinned Loading

  1. Patient-Registry-ETL Patient-Registry-ETL Public

    Automated ETL pipeline for patient registry data using PySpark and Airflow: extracts, transforms, and loads health condition, treatment, and patient records into Delta tables on S3 for analytics an…

    Python

  2. OpenTriviaDB-Analytics OpenTriviaDB-Analytics Public

    Automated ETL pipeline that ingests trivia data from the Open Trivia Database API into MySQL and delivers interactive analytics through a Power BI dashboard.

    Python

  3. Donor-Mailing-Data-PreProcessing Donor-Mailing-Data-PreProcessing Public

    Python ETL pipeline for cleaning, standardizing, and preparing donor data for mailing campaigns and BCC Software upload.

    Python

  4. Excel-Interactive-Dashboard Excel-Interactive-Dashboard Public

    A dynamic Excel dashboard featuring PivotTables, charts, and slicers to analyze sales performance by country, product, and month. Includes a β€œNew” sheet to demonstrate refresh functionality.

  5. TheMovieDB-Integration TheMovieDB-Integration Public

    Automated Python workflow for collecting and organizing movie data from TheMovieDB into MongoDB.

    Python

  6. ClinicalTrials.gov-Data-Pipeline ClinicalTrials.gov-Data-Pipeline Public

    End-to-end ETL pipeline for ClinicalTrials.gov cancer trials data with PostgreSQL storage and Power BI dashboard.

    Python