This capstone project highlights the hands-on application of core relational database administration skills by simulating a real-world enterprise environment.
I played the role of a Database Administrator (DBA) responsible for designing, managing, and securing both OLTP and OLAP systems using MySQL and PostgreSQL, with automation and performance tuning included.
As a final project in the IBM Relational Database Administrator Professional Certificate, this capstone involved:
- Designing transactional and analytical databases
- Implementing ETL workflows using Apache Airflow
- Performing backups and automating restore procedures
- Securing data with encryption and granular access control
- Optimizing SQL queries and database memory usage
β
OLTP and Data Warehouse schema design and deployment
β
Data encryption, column-level access control, and secure user management
β
Backup, restore, and cron job automation
β
Performance tuning with indexing and query optimization
β
End-to-end ETL pipeline development using Apache Airflow
- π¬ MySQL & π PostgreSQL
- βοΈ Apache Airflow
- π SQL Grants & Data Encryption
- πΎ Backup & Restore Scripting (cron,
.sh
) - π OLTP & OLAP Systems
- π§± Schema Design & Indexing
- π§ Linux Terminal, Shell Scripting
Module | Description |
---|---|
π§ 1. OLTP Database & Platform Architecture | Created MySQL-based OLTP schema with indexes and constraints |
ποΈ 2. Data Warehouse Design | Built dimensional models using PostgreSQL with star schema |
π 3. ETL & Pipelines | Developed Apache Airflow DAGs to automate data ingestion and transformation |
πΎ 4. Backup & Restore Automation | Scripted full backup and recovery pipelines using cron and shell |
π 5. Security & Optimization | Encrypted sensitive data, configured user access roles, and optimized queries |
π€ 6. Final Submission | Submitted full database system with automation and documentation |
π OLTP Schema & Sales Database (MySQL)
π PostgreSQL Data Warehouse
π Apache Airflow DAGs & ETL Scripts
π Backup/Restore Scripts (cron, bash)
π Security Scripts (GRANT, ENCRYPTION)
π Optimization SQL (Indexes, EXPLAIN)
- Defined roles:
db_admin
,db_analyst
,db_reporter
,db_external
- Encrypted sales amount fields using
VARBINARY
and passphrases - Column- and table-level GRANT statements
- Created cron jobs to back up the sales database every 3 minutes
- Wrote scripts to truncate tables, zip backups, and restore via terminal
- Maintained logs and
.tar.gz
backup archives
- Parsed web logs and loaded structured data to databases
- Implemented DAGs for extract β transform β load
- Used custom Python scripts for ingestion and transformation
- Created dimensional tables:
DimDate
,DimCategory
,DimCountry
,FactSales
- Performed OLAP queries using
ROLLUP
,CUBE
, andGROUPING SETS
- Generated Materialized Query Tables (MQTs)
- Created indexes and tested pre/post-query performance
- Analyzed memory usage using MySQL EXPLAIN
- Improved query execution time by optimizing data types and schema
This project brought together all major elements of relational database administration, allowing me to:
- Simulate enterprise-grade database systems
- Automate routine tasks using Linux tools
- Secure and manage data access at scale
- Apply performance tuning methods in real-world contexts
β
Confident in managing RDBMS systems (MySQL, PostgreSQL)
β
Experienced with automation, security, and optimization workflows
β
Ready for DBA or backend-focused roles involving data infrastructure
- Recruiters seeking skilled DBAs or backend engineers
- Employers looking for automated, secure data systems
- Projects requiring optimized RDBMS solutions and ETL integration
- ETL Automation using Apache Airflow
- Data Warehouse with PostgreSQL
- SQL Query Optimization & Indexing
Explore more of my work and collaborations:
π My Portfolio | πΌ LinkedIn | π GitHub Projects