This repository contains a collection of hands-on Data Engineering projects and learning materials focused on building practical data pipelines and processing workflows. It is intended to demonstrate real-world usage of data engineering concepts using commonly adopted tools and best practices.
The repository can be used both as a learning reference and as a portfolio to showcase applied data engineering skills.
The projects in this repository cover the following areas:
- ETL pipeline development
- Data extraction from files, APIs, and databases
- Data transformation and cleansing using Python
- Batch data processing workflows
- Basic automation and scripting
- Data storage using SQL and NoSQL concepts
- Practice material related to certifications and learning exercises
- Python
- SQL
- NoSQL databases (where applicable)
- File formats such as CSV and JSON
- Basic automation and workflow logic
Specific tools and dependencies may vary by project and are documented within individual folders.