This project demonstrates end-to-end Data Analysis using SQL on an Oracle Database 21c environment. It includes DDL, DML, and DQL operations on a simulated Employee-Department dataset. The analysis focuses on extracting actionable insights from data through query-based manipulation and exploration techniques. This project can serve as a great reference for beginners and intermediates working on database management, data retrieval, and data pipeline development using SQL.
- Project Description
- Installation & Setup
- Solution Methodology
- Key Features
- SQL Commands Covered
- Project Folder Structure
- Author
The Data Analysis Project focuses on understanding datasets through SQL queries by performing various CRUD (Create, Read, Update, Delete) operations. We utilize Oracle SQL Developer and Oracle Database 21c for executing all the queries.
The agenda includes:
- Creating databases and tables.
- Populating datasets using DML operations.
- Retrieving data based on specific conditions.
- Handling NULLs, sorting, and filtering using complex nested queries.
- Backing up data and implementing transaction control using COMMIT and ROLLBACK.
- Conducting exploratory data analysis (EDA) via SQL.
- Download and install Oracle Database 21c from the official Oracle website.
- Launch the installer and follow these steps:
- Accept terms and conditions.
- Provide a destination folder.
- Set a password for the
SYSTEMuser. - Complete the installation.
- Download and install SQL Developer.
- Connect to the Oracle Database via SQL Developer:
- Enter connection details (hostname, port, SID/Service, username, password).
- Establish a successful connection to the
SYSTEMuser.
For MAC users: Oracle Database isn't supported. Use the provided labs or a Windows virtual machine.
The methodology applied in this project follows the Data Pipeline approach:
- Data Extraction: SQL queries are used to extract employee and department data.
- Data Transformation: Apply DML operations to manipulate data (INSERT, UPDATE, DELETE).
- Data Loading: Perform data loading into Oracle Database tables.
- Data Analysis:
- Display records based on conditions and sort them accordingly.
- Handle NULL values efficiently.
- Implement pattern searching using
LIKE, wildcards, and logical operators. - Backup tables and use transaction control commands.
- Execute complex nested queries for deeper insights.
✔️ Creation of database schemas using DDL
✔️ Data manipulation and retrieval using DML and DQL
✔️ Handling complex queries with nested conditions
✔️ Transaction control using COMMIT and ROLLBACK
✔️ Backup management and recovery operations
✔️ Extensive use of SQL Developer shortcuts for efficiency
✔️ Best practices in SQL query optimization
- DDL (Data Definition Language):
CREATE,ALTER,DROP
- DML (Data Manipulation Language):
INSERT,UPDATE,DELETE
- DQL (Data Query Language):
SELECT,WHERE,ORDER BY,GROUP BY,HAVING
- Transaction Control:
COMMIT,ROLLBACK
- Advanced Queries:
- Complex nested queries
- Handling
NULLvalues - Pattern matching with
LIKEand wildcards - Joins and subqueries
- Backup and data restoration operations
📂 SQL-Data-Analysis-Project
├── 📁 Installation & Execution
├── 📁 Codes
│ ├── Data_Analysis_Part_1.sql
│ ├── Data_Analysis_Part_2.sql
│ └── DDL.sql
├── 📄 Notes.pdf
├── 📄 Oracle_Database_Setup_v1.docx
├── 📄 Solution Methodology - 1.pdf
└── 📄 README.md
👨💻 Lakshmipathiraju Pericharla
📅 Date: March 11, 2025