# Chapter 15: Sorting Algorithms

## Concept: Importance of Sorting and Stability

### Why Sorting is Important:
Sorting organizes data to enable efficient searching, analysis, and decision-making. Applications include:
- **Databases**: Query optimization.
- **Scheduling**: Organizing tasks or events.
- **Data Visualization**: Displaying data trends.

### Stability in Sorting:
A sorting algorithm is **stable** if it preserves the relative order of elements with equal values.

For example, sorting `[2a, 1, 2b]` should yield `[1, 2a, 2b]`, where `2a` and `2b` retain their original order.


### Visual Representation: Sorting Algorithms

This diagram categorizes sorting algorithms based on their efficiency and use cases:

![Sorting Algorithms Overview](https://upload.wikimedia.org/wikipedia/commons/8/8c/Sorting_algorithm_complexity.png)

This visual highlights time and space complexities for various sorting algorithms.

## Implementation: Sorting Algorithms

We will implement six popular sorting algorithms: Bubble Sort, Selection Sort, Insertion Sort, Merge Sort, Quick Sort, and Heap Sort.

In [None]:
# Bubble Sort
def bubble_sort(arr):
    n = len(arr)
    for i in range(n):
        for j in range(0, n-i-1):
            if arr[j] > arr[j+1]:
                arr[j], arr[j+1] = arr[j+1], arr[j]
    return arr

# Selection Sort
def selection_sort(arr):
    n = len(arr)
    for i in range(n):
        min_idx = i
        for j in range(i+1, n):
            if arr[j] < arr[min_idx]:
                min_idx = j
        arr[i], arr[min_idx] = arr[min_idx], arr[i]
    return arr

# Insertion Sort
def insertion_sort(arr):
    for i in range(1, len(arr)):
        key = arr[i]
        j = i-1
        while j >= 0 and key < arr[j]:
            arr[j+1] = arr[j]
            j -= 1
        arr[j+1] = key
    return arr

# Merge Sort
def merge_sort(arr):
    if len(arr) > 1:
        mid = len(arr) // 2
        L = arr[:mid]
        R = arr[mid:]

        merge_sort(L)
        merge_sort(R)

        i = j = k = 0
        while i < len(L) and j < len(R):
            if L[i] <= R[j]:
                arr[k] = L[i]
                i += 1
            else:
                arr[k] = R[j]
                j += 1
            k += 1

        while i < len(L):
            arr[k] = L[i]
            i += 1
            k += 1

        while j < len(R):
            arr[k] = R[j]
            j += 1
            k += 1
    return arr

# Quick Sort
def quick_sort(arr):
    if len(arr) <= 1:
        return arr
    pivot = arr[len(arr) // 2]
    left = [x for x in arr if x < pivot]
    middle = [x for x in arr if x == pivot]
    right = [x for x in arr if x > pivot]
    return quick_sort(left) + middle + quick_sort(right)

# Heap Sort
def heapify(arr, n, i):
    largest = i
    l = 2 * i + 1
    r = 2 * i + 2

    if l < n and arr[l] > arr[largest]:
        largest = l
    if r < n and arr[r] > arr[largest]:
        largest = r
    if largest != i:
        arr[i], arr[largest] = arr[largest], arr[i]
        heapify(arr, n, largest)

def heap_sort(arr):
    n = len(arr)
    for i in range(n // 2 - 1, -1, -1):
        heapify(arr, n, i)
    for i in range(n-1, 0, -1):
        arr[i], arr[0] = arr[0], arr[i]
        heapify(arr, i, 0)
    return arr

# Example Usage
unsorted = [64, 34, 25, 12, 22, 11, 90]
print("Bubble Sort:", bubble_sort(unsorted[:]))
print("Selection Sort:", selection_sort(unsorted[:]))
print("Insertion Sort:", insertion_sort(unsorted[:]))
print("Merge Sort:", merge_sort(unsorted[:]))
print("Quick Sort:", quick_sort(unsorted[:]))
print("Heap Sort:", heap_sort(unsorted[:]))


## Quiz

1. Which sorting algorithm uses divide and conquer to split the array?
   - A. Bubble Sort
   - B. Merge Sort
   - C. Selection Sort

2. Which sorting algorithm is not stable by default?
   - A. Quick Sort
   - B. Bubble Sort
   - C. Merge Sort

3. What is the average time complexity of Quick Sort?
   - A. O(n)
   - B. O(n log n)
   - C. O(n²)

### Answers:
1. B. Merge Sort
2. A. Quick Sort
3. B. O(n log n)


## Exercise: Identify the Best Sorting Algorithm

### Problem Statement
Given the following datasets, identify the best sorting algorithm to use and justify your choice:

1. A dataset with 10 elements and many duplicates.
2. A large dataset (1,000,000 elements) that requires stable sorting.
3. A dataset with 10,000 elements that is nearly sorted.

### Solution:
Discuss which sorting algorithm is most suitable for each case based on time complexity, stability, and dataset characteristics.
