# **Lesson 1 - Database Concepts**

---

## DATA VS. INFORMATION

**Data** – raw or unprocessed facts.  
**Information** – when data is processed, organized, interpreted, and structured.

---

## COMPONENTS OF DATABASE ENVIRONMENTS

- **Hardware** – refers to the physical, electronic devices such as computers and hard disks that offer the interface between computers and real-world systems.  
- **Software** – software that controls a database, including the database software, operating system, network tools, and applications for data access.  
- **Data** – facts and information that need to be organized and processed to make it more meaningful.  
- **Procedures** – refer to the instructions utilized in a database management system, which include instructions for installation and setup, login and logout, daily operations management, data backups, and report generation.  
- **Database Access Language** – language used to write commands to access, update, and delete data stored in a database.

### ADDITIONALS:

- **CASE Tools** – computer-aided software engineering  
- **Repository** – a centralized storehouse of metadata.  
- **Database Management System (DBMS)** – software for managing the database.  
- **Database** – storehouse of the data  
- **Application Programs** – software using the data  
- **User Interface** – text and graphical displays to users  
- **Data Administrators** – personnel responsible for maintaining the database.  
- **System Developers** – personnel responsible for designing databases and software.  
- **End Users** – people who use the applications and databases

---

## METADATA

Information that describes the structure, content, and context of a database. It's also known as "data about data".

**Characteristics of Metadata**  
Metadata contains information about a data asset such as:  
- Properties  
- Origin  
- History  
- Location  
- Creation date  
- Ownership  
- Versions  

---

## TRADITIONAL FILE PROCESSING

Traditional file processing is a system where data is stored for each application within an organization. In this approach:  
- Each department typically has its own set of files, which leads to data redundancy and data isolation.  
- There is no overall plan or model to guide application growth.  

**Some disadvantages of traditional file processing are:**  
- **Duplication of Data** – This leads to inefficient use of storage space and potential compromises to data integrity. Databases centralize storage for consistency.  
- **Interdependence** – Data and programs are tightly linked, unlike databases that separate them.  
- **Limited Data Sharing** – File systems struggle with data sharing and report generation, while databases streamline both.  
- **Security & Integrity** – Databases enforce better security and data consistency constraints.  
- **Efficiency & Maintenance** – File systems are harder to maintain, while databases optimize storage and updates.  
- **Concurrency & Atomicity** – Databases handle simultaneous access and transactions better.  
- **Failure & Cost** – File system failures are isolated, while database failures impact multiple applications. However, file systems are cheaper in hardware costs.

---

## THE DATABASE APPROACH

An approach where data are logically stored in databases, managed by a DBMS.  
A database is designed using data models which define the nature and relationships among data.  
The effectiveness and efficiency of a database are directly associated with the structure of the database.

**Advantages of the Database Approach**  
- Planned data redundancy  
- Minimal data duplication  
- Improved data consistency  
- Program data independence  
- Allows data to evolve without changing the application programs  
- Reduced program maintenance  
- Improved data sharing  
- Increased productivity of application development  
- Enforcement of standards  
- Improved data quality  
- Improved data accessibility and responsiveness  
- Improved decision support  

---

## RANGE OF DATABASE APPLICATIONS

- **Personal Databases** – supports one user  
- **Workgroup Databases (less than 25 users)** – supported by two-tier client/server databases  
- **Department/Division Databases (between 25 to 100 users)** – supported by multitier client/server databases  
- **Enterprise Databases** – scope is an entire organization

---

## EVOLUTION OF DATABASE SYSTEMS

- **1960s** – Experimental proof of concept.  
- **1970 - 1990** – Hierarchical, Network  
- **1980 - Present** – Relational  
- **1990 - Present** – Object-Oriented  
- **1990 - Present** – Object-Relational  
- **2000 - Present** – Not only SQL (NoSQL)

### Evolution of Database Systems:

- **1970 - 1990 - Hierarchical**  
  Hierarchical – data model in which the data is organized into a tree-like structure; confined to up to ‘one to many relationships’.  
- **1970 - 1990 - Network**  
  Network – much like the hierarchical model except that it permitted many to many relationships.  
- **1980 – present - Relational**  
  Relational (RDB) – establish the relationships between entities through common fields included in a file, called a relation  
- **1990 - present - Object-Oriented**  
  Object-oriented (OODB) – subscribes to a model with information represented by objects; encapsulates both data and behavior  
- **1990 - present – Object-relational**  
  Object-relational (ORDB) – provides a middle ground between relational databases and object-oriented databases  

---

## OTHER CONCEPTS

### Data Warehouse

**Data Warehouse** – subject oriented, integrated, time variant, non volatile collection of data used in support of management decision making and business intelligence.

### Voluminous Time-Variant Data

- A **voluminous database of time-variant data** contains large amounts of historical data.
- This data is often **non-volatile** (i.e., it is not deleted — *‘di tinatanggal*).
- It is **useful for time-series analysis**, allowing you to observe how things have changed over time.
- Common use cases:
  - Forecasting and prediction
  - Business Intelligence (BI)
  - Decision-making support

### Data Mart

- A **Data Mart** is a **smaller, more focused version of a Data Warehouse**.
- It is typically organized by **specific subjects or business areas** to make data management and access easier.

### Web-Enabled Data

- A **web-enabled database** refers to a database with a **web-based interface**.
- It offers **standard database functionalities**, but is accessed **remotely through a browser** or web application.
