Here’s a full summary note covering **Arity**, **Internal Nodes**, **Leaf Nodes**, and related tree properties in DBMS and Data Structures:

---

### 🌳 **Tree Terminologies Summary (DBMS + DSA)**

#### 1. **Arity**

* **Definition**: The maximum number of children a node can have.
* **Arity of a Tree**: The maximum number of children any node in the tree has.

  * For example, in a **binary tree**, the arity is **2**.
  * If a tree has nodes with up to 4 children, its arity is **4**.

---

#### 2. **Internal Node**

* **Definition**: A node that has **at least one child**.
* **Property**:

  * Internal nodes ≠ Leaf nodes.
  * Root can be an internal node if it has children.

---

#### 3. **Leaf Node**

* **Definition**: A node with **no children**.
* **Property**:

  * Always at the bottom-most level (height-wise).
  * All children of internal nodes are either internal or leaf.

---

#### 4. **Height of Tree**

* **Definition**: Number of edges on the longest path from **root** to a **leaf**.
* **Alternate Definition**: Number of levels - 1 (if counting edges); if counting nodes, it's the number of levels.

---

#### 5. **Total Number of Nodes (n)**

* Includes root + internal + leaf nodes.
* **Formula** (for full m-ary tree):
  If tree has `i` internal nodes and arity `m`, then:

  $$
  n = mi + 1
  $$

---

#### 6. **Number of Leaf Nodes (l)**

* In a full m-ary tree:

  $$
  l = (m - 1)i + 1
  $$

---

#### 7. **Full Tree vs Complete Tree**

* **Full Tree**: Every node has 0 or m children.
* **Complete Tree**: All levels except the last are full; the last level is filled left to right.

---

#### 8. **Perfect Tree**

* All internal nodes have exactly m children and all leaves are at the same level.

---

#### 📌 Example Breakdown from Given Image:

* **Arity** = 4
  ⇒ Max 4 children per node
* **Height** = 3
  ⇒ Root to deepest leaf = 3 levels (counted as edges)
* **Leaf Nodes** = All nodes at the bottom with no children
* **Internal Nodes** = All non-leaf nodes with ≥1 child

---
---
---
---

# DBMS: Storage and File Structure

## Module 39: Physical Storage

### Learning Objectives

* Understand various **physical storage media** for high-volume, fast, reliable, and cost-effective database storage.
* Learn about **tertiary storage** options for large-volume, low-cost backup.

---

## 1. Introduction to Physical Storage in DBMS

In-memory data structures are important for query and update operations. However, persistent data storage is essential for:

* Large volumes of data
* Archival and future access

### Key Considerations for Physical Storage:

1. **Speed** – Fast access is essential for performance.
2. **Cost per Unit** – Should be low due to data volume.
3. **Reliability/Persistency** – Must survive power failure, crashes, or device failure.

---

## 2. Types of Storage

### 2.1 Volatile Storage

* **CPU Registers**: Few, very fast
* **Cache**:

  * Fastest memory (excluding registers)
  * Small and expensive
* **Main Memory (RAM)**:

  * Fast (nanoseconds)
  * Volatile
  * Too small/expensive for full DB storage
* **Flash Memory**:

  * Survives power failure
  * Write-once per location; needs erasing to rewrite
  * Limited erase-write cycles (\~10,000 to 1M)

### 2.2 Non-Volatile Storage (Persistent)

#### Magnetic Disks

* **Primary medium for DB storage**
* Uses spinning platters and magnetic heads
* **Random/Direct Access** possible (like arrays)
* Capacity: 16–32 TB; doubles every 2–3 years
* Can survive power failures, but disk failure is fatal (rare)

#### Optical Storage

* CD-ROM, DVD, Blu-ray
* Write-once, read-many
* Ideal for archival (not live DB storage)

#### Magnetic Tapes

* **Sequential access** (slow)
* **Very large capacity** (hundreds of TB or PB)
* **Very cheap**
* Ideal for **archival/tertiary storage**

---

## 3. Storage Hierarchy Overview

| Type          | Speed     | Cost     | Capacity   | Volatility                      |
| ------------- | --------- | -------- | ---------- | ------------------------------- |
| Cache         | Fastest   | Highest  | Small      | Volatile                        |
| Main Memory   | Fast      | High     | Small      | Volatile                        |
| Flash Memory  | Moderate  | Medium   | Moderate   | Non-Volatile (limited lifespan) |
| Magnetic Disk | Slower    | Low      | Large      | Non-Volatile                    |
| Optical Disk  | Very Slow | Very Low | Medium     | Non-Volatile                    |
| Magnetic Tape | Very Slow | Lowest   | Very Large | Non-Volatile                    |

* **Primary Storage**: Cache, Main Memory (fast, volatile)
* **Secondary Storage**: Magnetic Disks (live database)
* **Tertiary Storage**: Optical Disks, Magnetic Tapes (archival)

---

## 4. Magnetic Disk Structure

### Components:

* **Platters**: Disks stacked vertically on a spindle
* **Read/Write Heads**: One for each surface
* **Arm Assembly**: All heads move together

### Terminologies:

* **Track**: Concentric circles on a platter
* **Sector**: Segment of a track (e.g., 512 bytes)
* **Block/Cluster**: Group of contiguous sectors
* **Cylinder**: Collection of corresponding tracks (across platters)

### Disk Controller:

* Manages disk operations (seek, read, write)
* Handles **checksums** to detect errors
* Maps **bad sectors** to usable ones

---

## 5. Disk Performance Metrics

### 1. **Access Time**

* Time from request to start of data transfer
* Components:

  * **Seek Time**: Time to position head over correct track (avg \~4-11 ms)
  * **Rotational Latency**: Time to rotate to correct sector

### 2. **Data Transfer Rate**

* \~25–100 MB/s (depends on track position and controller)
* Outer tracks transfer faster due to more data per track

### 3. **Reliability**

* **MTTF (Mean Time To Failure)**:

  * Avg time between failures
  * New disk: \~1.2 million hours (1 in 1000 disks fail every 1200 hours)
  * Decreases as disks age

---

## 6. Tertiary Storage: Magnetic Tapes

* Linear, sequential access
* High transfer rate
* Extremely cheap, large capacity (TBs to PBs)
* Ideal for backup, not live usage
* Formats: DAT, DDS, etc.

---

## 7. Modern Alternatives: Cloud Storage

### Features:

* **Low upfront cost**
* **Highly reliable** (RAID, error recovery, backups)
* **Accessible** from anywhere
* **Managed backups**

### Examples:

* Google Drive, OneDrive, Amazon Drive, Dropbox

### Suitable For:

* Small-scale or personal DB applications
* Less efficient than local storage but cost-effective

---

## 8. Other Storage Media

### 1. Flash Drives (USB, Pen Drives)

* Fast, permanent
* No moving parts
* Limited erase cycles

### 2. SD Cards (Secure Digital)

* Used in mobile devices
* Based on NAND/NOR flash
* Non-volatile

### 3. Solid State Drives (SSD)

* Flash-based, no moving parts
* Very fast access
* Larger than main memory
* Expensive compared to HDD

| Feature     | HDD                      | SSD                      |
| ----------- | ------------------------ | ------------------------ |
| Speed       | Slower                   | Much faster              |
| Reliability | Lower (mechanical parts) | Higher (no moving parts) |
| Cost        | Lower                    | Higher                   |
| Power Usage | More                     | Less                     |
| Noise       | Noisy                    | Silent                   |

---

## 9. Emerging Storage Technologies

### DNA-Based Storage

* Uses nucleotide strands for storage (A, T, G, C)
* Ultra-high density (Exabyte per cubic nanometer)
* Still expensive and slow

### Quantum Storage

* Uses **qubits** instead of bits
* Early-stage development
* Potentially revolutionary in speed and capacity

---

## Summary

* DBMS needs both in-memory structures and persistent storage.
* Physical storage varies by **speed**, **cost**, **capacity**, and **reliability**.
* Magnetic disks are primary DB storage medium.
* Tertiary storage (tape, optical) is used for backups.
* SSDs and Flash are newer, faster alternatives.
* Cloud storage and emerging tech (DNA, Quantum) are promising future trends.