# Heaps

Heaps are useful in optimizing the insert, find min and delete min operations

Heaps can be implemented as a binary tree

Binary heaps are complete binary trees where each node has 2 child nodes and all the leaf nodes are as left as possible

# Array Representation of a heap

Heap can be represented as a array, where parent can be at ith index, its left child can be at 2*i +1 th index and its right child can be at 2*i +2 th index

To get the parent node of any node, floor(i-1/2), will give the parent node

Reason why complete binary tree is recommended is because otherwise we might end up an array with gaps

# Properties of Complete Binary Tree

<strong>Height of a node</strong> = Number of edges from a particular node to the longest possible leaf

<strong>Height of a Tree</strong> = Height of the root node

<strong>Given height of a tree, max number of nodes </strong> = 2<sup>h+1</sup>-1

<strong>Given n nodes, minimum height of a tree</strong> = floor(log<sub>2</sub>n)

# Build a heap

One way is to sort the array in descending order(max heap), time for this will be nlogn

Another Property of a complete binary tree is, The leaves start from floor(n/2) till n [with n being the size of the binary tree]. We are interested in finding the leaves because every leaf is a tree with 1 node. Each tree with 1 node is already a heap with either min heap property or max heap property

Leaves are already a heap so we need to take the largest index which is a non leaf and start heapifying it

In [3]:
#Building a maxheap using the array
from math import floor

def maxHeapify(arr,i):
    heapsize=len(arr)
    l=2*i+1
    r=2*i+2
    if l<heapsize and arr[i]<arr[l]:
        largest=l
    else:
        largest=i
    if r<heapsize and arr[r]>arr[largest]:
        largest=r
    if largest!=i:
        arr[i],arr[largest]=arr[largest],arr[i]
        maxHeapify(arr,largest)

def buildHeap(arr,n):
    for i in range(floor(n//2)-1,-1,-1):
        maxHeapify(arr,i)

if __name__ == '__main__':
    arr=[9,5,10,11,12,1,3,2]
    buildHeap(arr,len(arr))
    print(arr)


[12, 11, 10, 9, 5, 1, 3, 2]


Space Complexity -> O(logn), (number of recursive calls made= number of levels (logn)), where n is the number of nodes in the subtree for which i is the root. So space complexity depends on where the maxHeapify is called

Time Complexity-> O(n) (notes)

Building a heap takes O(n) time and heapify takes O(logn) time

<pre>
Heaps can be of two types:

<strong>Max-Heap:</strong> In a Max-Heap the key present at the root node must be greatest among the keys present at all of it’s children. The same property must be recursively true for all sub-trees in that Binary Tree.

<strong>Min-Heap:</strong> In a Min-Heap the key present at the root node must be minimum among the keys present at all of it’s children. The same property must be recursively true for all sub-trees in that Binary Tree.
</pre>

The traversal method use to achieve Array representation is Level Order

# Applications of Heaps:

Heap Sort-> Uses binary heap to sort the array in O(nlogn) time

Priority Queue: Priority queues can be efficiently implemented using Binary Heap because it supports insert(), delete() and extractmax(), decreaseKey() operations in O(logn) time. Binomoial Heap and Fibonacci Heap are variations of Binary Heap. These variations perform union also efficiently.

Graph Algorithms: The priority queues are especially used in Graph Algorithms like Dijkstra’s Shortest Path and Prim’s Minimum Spanning Tree.

Order Statistics

# Operations on Min Heap:

getMini(): It returns the root element of Min Heap. Time Complexity of this operation is O(1).

extractMin(): Removes the minimum element from MinHeap. Time Complexity of this Operation is O(Logn) as this operation needs to maintain the heap property (by calling heapify()) after removing root.

decreaseKey(): Decreases value of key. The time complexity of this operation is O(Logn). If the decreases key value of a node is greater than the parent of the node, then we don’t need to do anything. Otherwise, we need to traverse up to fix the violated heap property.

insert(): Inserting a new key takes O(Logn) time. We add a new key at the end of the tree. IF new key is greater than its parent, then we don’t need to do anything. Otherwise, we need to traverse up to fix the violated heap property.

delete(): Deleting a key also takes O(Logn) time. We replace the key to be deleted with minum infinite by calling decreaseKey(). After decreaseKey(), the minus infinite value must reach root, so we call extractMin() to remove the key.

In [5]:
from heapq import heapify,heappop,heappush

class MinHeap:
    def __init__(self):
        self.heap=[]
        heapify(self.heap)

    def getMin(self):
        return self.heap[0]

    def insertKey(self,item):
        heappush(self.heap,item)

    def parent(self,i):
        return (i-1)//2

    def extractMin(self):
        return heappop(self.heap)

    def decreaseKey(self,i,newVal):
        self.heap[i]=newVal
        while (i!=0 and self.heap[self.parent(i)]>self.heap[i]):
            parent=self.parent(i)
            self.heap[i],self.heap[self.parent(i)]=(self.heap[self.parent(i)],self.heap[i])
            i=parent
        #heapify(self.heap)

    def deleteKey(self,i):
        self.decreaseKey(i,float('-infinity'))
        #print(self.heap)
        self.extractMin()

if __name__ == '__main__':
    heapObj=MinHeap()
    heapObj.insertKey(1)
    heapObj.insertKey(9)
    heapObj.insertKey(10)
    heapObj.insertKey(5)
    heapObj.insertKey(200)
    # print(heapObj.heap)
    heapObj.decreaseKey(4,0)
    # heapObj.deleteKey(2)
    #print(heapObj.getMin())
    #print(heapObj.heap)
    print(heapObj.heap)


[0, 1, 10, 9, 5]


# Binomial Heap

The main application of Binary Heap is in implementing priority queue.

Binomial Heap is an extension of Binary Heap that provides faster union or merge operation together with other operations provided by Binary Heap.

A Binomial Heap is a collection of Binomial Trees

<pre>
What is a Binomial Tree?
A Binomial Tree of order 0 has 1 node. A Binomial Tree of order k can be constructed by taking two binomial trees of order k-1 and making one as leftmost child or other.
A Binomial Tree of order k has following properties.
a) It has exactly 2^k nodes.
b) It has depth as k.
c) There are exactly kCi nodes at depth i for i = 0, 1, . . . , k.
d) The root has degree k and children of root are themselves Binomial Trees with order k-1, k-2,.. 0 from left to right.
</pre>

<pre>
k = 0 (Single Node)

 o

k = 1 (2 nodes) 
[We take two k = 0 order Binomial Trees, and
make one as child of other]
 o
   \
     o

k = 2 (4 nodes)
[We take two k = 1 order Binomial Trees, and
make one as child of other]
   o
 /   \
o     o
       \
        o

k = 3 (8 nodes)
[We take two k = 2 order Binomial Trees, and
make one as child of other]
    o   
 /  |  \ 
o   o    o
    |   /  \
    o  o    o
             \
              o
</pre>

<strong>Binomial Heap</strong>

A Binomial Heap is a set of Binomial Trees where each Binomial Tree follows Min Heap property. And there can be at most one Binomial Tree of any degree.

<pre>
12------------10--------------------20
             /  \                 /  | \
           15    50             70  50  40
           |                  / |    |     
           30               80  85  65 
                            |
                           100
A Binomial Heap with 13 nodes. It is a collection of 3 
Binomial Trees of orders 0, 2 and 3 from left to right. 
</pre>

<strong>Binary Representation of a number and Binomial Heaps</strong>

A Binomial Heap with n nodes has the number of Binomial Trees equal to the number of set bits in the Binary representation of n. For example let n be 13, there 3 set bits in the binary representation of n (00001101), hence 3 Binomial Trees. We can also relate the degree of these Binomial Trees with positions of set bits. With this relation, we can conclude that there are O(Logn) Binomial Trees in a Binomial Heap with ‘n’ nodes.

<strong>Operations of Binomial Heap</strong>

The main operation in Binomial Heap is union(), all other operations mainly use this operation. The union() operation is to combine two Binomial Heaps into one.

insert(H, k): Inserts a key ‘k’ to Binomial Heap ‘H’. This operation first creates a Binomial Heap with single key ‘k’, then calls union on H and the new Binomial heap.

getMin(H): A simple way to getMin() is to traverse the list of root of Binomial Trees and return the minimum key. This implementation requires O(Logn) time(because there will be logn binomial trees for n nodes, so total logn nodes to be checked). It can be optimized to O(1) by maintaining a pointer to minimum key root.

extractMin(H): This operation also uses union(). We first call getMin() to find the minimum key Binomial Tree, then we remove the node and create a new Binomial Heap by connecting all subtrees of the removed minimum node. Finally, we call union() on H and the newly created Binomial Heap. This operation requires O(Logn) time.

delete(H): Like Binary Heap, delete operation first reduces the key to minus infinite, then calls extractMin().

decreaseKey(H): decreaseKey() is also similar to Binary Heap. We compare the decreases key with it parent and if parent’s key is more, we swap keys and recur for the parent. We stop when we either reach a node whose parent has a smaller key or we hit the root node. Time complexity of decreaseKey() is O(Logn).

<strong>Union operation in Binomial Heap</strong>

<pre>
Given two Binomial Heaps H1 and H2, union(H1, H2) creates a single Binomial Heap.

1) The first step is to simply merge the two Heaps in non-decreasing order of degrees. In the following diagram, figure(b) shows the result after merging.

2) After the simple merge, we need to make sure that there is at most one Binomial Tree of any order. To do this, we need to combine Binomial Trees of the same order. We traverse the list of merged roots, we keep track of three-pointers, prev, x and next-x. There can be following 4 cases when we traverse the list of roots.
—–Case 1: Orders of x and next-x are not same, we simply move ahead.
In following 3 cases orders of x and next-x are same.
—–Case 2: If the order of next-next-x is also same, move ahead.
—–Case 3: If the key of x is smaller than or equal to the key of next-x, then make next-x as a child of x by linking it with x.
—–Case 4: If the key of x is greater, then make x as the child of next.
</pre>

![](https://media.geeksforgeeks.org/wp-content/uploads/Bionomial_tree_2.png)

# Fibonacci Heap

In terms of Time Complexity, Fibonacci Heap beats both Binary and Binomial Heaps.

<pre>
1) Find Min:      Θ(1)     [Same as both Binary and Binomial]
2) Delete Min:    O(Log n) [Θ(Log n) in both Binary and Binomial]
3) Insert:        Θ(1)     [Θ(Log n) in Binary and Θ(1) in Binomial]
4) Decrease-Key:  Θ(1)     [Θ(Log n) in both Binary and Binomial]
5) Merge:         Θ(1)     [Θ(m Log n) or Θ(m+n) in Binary and
                            Θ(Log n) in Binomial]
</pre>

Like Binomial Heap, Fibonacci Heap is a collection of trees with min-heap or max-heap property. In Fibonacci Heap, trees can can have any shape even all trees can be single nodes (This is unlike Binomial Heap where every tree has to be Binomial Tree).

![](https://media.geeksforgeeks.org/wp-content/uploads/Fibonacci-Heap.png)

Fibonacci Heap maintains a pointer to minimum value (which is root of a tree). All tree roots are connected using circular doubly linked list, so all of them can be accessed using single ‘min’ pointer.

<pre>
Facts about Fibonacci Heap

The reduced time complexity of Decrease-Key has importance in Dijkstra and Prim algorithms. With Binary Heap, time complexity of these algorithms is O(VLogV + ELogV). If Fibonacci Heap is used, then time complexity is improved to O(VLogV + E)

Although Fibonacci Heap looks promising time complexity wise, it has been found slow in practice as hidden constants are high.

Fibonacci heap are mainly called so because Fibonacci numbers are used in the running time analysis. Also, every node in Fibonacci Heap has degree at most O(log n) and the size of a subtree rooted in a node of degree k is at least Fk+2, where Fk is the kth Fibonacci number.
</pre>

# Leftist Tree / Leftist Heap

# K-ary Heap

# HeapSort

In [11]:
from math import floor

def maxHeapify(arr,n,i):
    left=2*i+1
    right=2*i+2
    if left<n and arr[i]<arr[left]:
        largest=left
    else:
        largest=i
    if right<n and arr[right]>arr[largest]:
        largest=right
    if largest!=i:
        arr[largest],arr[i]=arr[i],arr[largest]
        maxHeapify(arr,n,largest)

def buildMaxHeap(arr,n):
    for i in range(floor(n//2)-1,-1,-1):
        maxHeapify(arr,n,i)
    # print(arr)

def heapSort(arr):
    n=len(arr)
    buildMaxHeap(arr,n)
    #get the largest element and store it at the end, now maxheapify the first replaced node with the updated heapsize
    for i in range(n-1,-1,-1):
        arr[0],arr[i]=arr[i],arr[0]
        maxHeapify(arr,i,0)

if __name__ == '__main__':
    arr=[ 12, 11, 13, 5, 6, 7]
    heapSort(arr)
    print(arr)


[5, 6, 7, 11, 12, 13]


Heap sort is an in-place algorithm.

Time Complexity: Time complexity of heapify is O(Logn). Time complexity of createAndBuildHeap() is O(n) and overall time complexity of Heap Sort is O(nLogn).

Not stable (check on this)

# Iterative HeapSort

HeapSort is a comparison based sorting technique where we first build Max Heap and then swaps the root element with last element (size times) and maintains the heap property each time to finally make it sorted.

In [12]:
def buildHeap(arr,n):
    for i in range(n):
        if arr[i]>arr[int((i-1)/2)]:
            j=i
            while arr[j]>arr[int((j-1)/2)]:
                arr[j],arr[int((j-1)/2)]=arr[int((j-1)/2)],arr[j]
                j=int((j-1)/2)

def heapSort(arr,n):
    buildHeap(arr,n)
    # print(arr)
    for i in range(n-1,0,-1):
        arr[0],arr[i]=arr[i],arr[0]
        j,index=0,0
        while True:
            index=2*j+1
            # if left child is less than right child
            if index<i-1 and arr[index]<arr[index+1]:
                index+=1
            # if parent is less than the child
            if index<i and arr[index]>arr[j]:
                arr[index],arr[j]=arr[j],arr[index]

            j=index
            if index>=i:
                break

if __name__ == '__main__':
    arr=[10,20,15,17,9,21]
    n=len(arr)
    heapSort(arr,n)
    print(arr)


[9, 10, 15, 17, 20, 21]


Time Complexity - O(nlogn)

# K largest(or smallest) elements in an array

Approach 1-> Use Max heap

In [13]:
from heapq import _heapify_max, _heappop_max

def getKLargest(arr,k):
    n=len(arr)
    _heapify_max(arr)
    for i in range(k):
        value=_heappop_max(arr)
        print(value,end=" ")

if __name__ == '__main__':
    arr=[1, 23, 12, 9, 30, 2, 50]
    k=3
    result=getKLargest(arr,k)


50 30 23 

Time Complexity -> O(n +klogn) (n times to build a maxeheap and k times you will have to maxheapify)

Approach 2-> Use Min heap. Maintain a heap of size k, then for each element greater than the root of the heap, include that value as root of the heap and heapify

In [14]:
from heapq import heapify,heappop

def getKLargest(arr,k):
    heapArr=[]
    for i in range(k):
        heapArr.append(arr[i])
    heapify(heapArr)
    for i in range(k,len(arr)):
        if arr[i]>heapArr[0]:
            heapArr[0]=arr[i]
            heapify(heapArr)
    print(" ".join(map(str,heapArr)))

if __name__ == '__main__':
    arr=[1, 23, 12, 9, 30, 2, 50]
    k=3
    result=getKLargest(arr,k)


23 50 30


Time Complexity-> O(k+(n-k)logk) this does not give result in sorted order. If results are to be included in sorted manner than O(klogk) extra time is required

# K’th Smallest/Largest Element in Unsorted Array

In [16]:
from heapq import heapify,heappop

def getKSmallestElement(arr,k):
    heapify(arr)
    for i in range(k):
        value=heappop(arr)

    return value

if __name__ == '__main__':
    arr=[7, 10, 4, 3, 20, 15]
    k=3
    result=getKSmallestElement(arr,k)
    print(result)


7


Time Complexity -> O(n +klogn)

Another Approach -> Using max heap

In [17]:
from heapq import _heapify_max, _heappop_max

def getKSmallestElement(arr,k):

    heapArr=[]
    for i in range(k):
        heapArr.append(arr[i])
    _heapify_max(heapArr)

    for i in range(k,len(arr)):
        if arr[i]<heapArr[0]:
            heapArr[0]=arr[i]
            _heapify_max(heapArr)

    return heapArr[0]


if __name__ == '__main__':
    arr=[7, 10, 4, 3, 20, 15]
    k=3
    result=getKSmallestElement(arr,k)
    print(result)


7


Time Complexity -> O(k + (n-k)logk)

# Sort a nearly sorted (or K sorted) array

Given an array of n elements, where each element is at most k away from its target position, devise an algorithm that sorts in O(n log k) time. For example, let us consider k is 2, an element at index 7 in the sorted array, can be at indexes 5, 6, 7, 8, 9 in the given array.

In [18]:
from heapq import heapify,heappop,heappush

def sortNearlySortedArray(arr,k):
    heapArr=[]

    # size k+1 because element is having its actual location atmost k positions away
    for i in range(k+1):
        heapArr.append(arr[i])

    heapify(heapArr)

    index=0
    for i in range(k+1,len(arr)):
        arr[index]=heappop(heapArr)
        heappush(heapArr,arr[i])
        index+=1

    while heapArr:
        arr[index]=heappop(heapArr)
        index+=1

if __name__ == '__main__':
    arr=[6, 5, 3, 2, 8, 10, 9]
    k=3
    sortNearlySortedArray(arr,k)
    print(arr)


[2, 3, 5, 6, 8, 9, 10]


Time Complexity -> O(k +(n-k)logk)

Another Approach-> Using insertion sort

In [19]:
def sortNearlySortedArray(arr,k):
    for i in range(len(arr)):
        j=i-1
        key=arr[i]
        # this will run atmost k times
        while j>=0 and arr[j]>key:
            arr[j+1]=arr[j]
            j-=1
        arr[j+1]=key


if __name__ == '__main__':
    arr=[6, 5, 3, 2, 8, 10, 9]
    k=3
    sortNearlySortedArray(arr,k)
    print(arr)


[2, 3, 5, 6, 8, 9, 10]


Time Complexity-> O(nk), the inner loop will run atmost k times and outer loop will run for n times

We can also use a Balanced Binary Search Tree instead of Heap to store K+1 elements. The insert and delete operations on Balanced BST also take O(Logk) time. So Balanced BST based method will also take O(nLogk) time, but the Heap bassed method seems to be more efficient as the minimum element will always be at root. Also, Heap doesn’t need extra space for left and right pointers.

# Tournament Tree

# Check if a given Binary Tree is Heap

<pre>
Given a binary tree, we need to check it has heap property or not, Binary tree need to fulfill the following two conditions for being a heap –

It should be a complete tree (i.e. all levels except last should be full).
Every node’s value should be greater than or equal to its child node (considering max-heap).
</pre>

In [1]:
class Node:
    def __init__(self,data):
        self.data=data
        self.left=None
        self.right=None

def isComplete(root,index,total):
    if root is None:
        return True
    if index>=total:
        return False
    return isComplete(root.left,2*index+1,total) and isComplete(root.right,2*index+2,total)

def countNodes(root):
    if root is None:
        return 0
    return 1+countNodes(root.left)+countNodes(root.right)

def doesSatisfyProperty(root):
    if root.left is None and root.right is None:
        return True
    if root.right is None:
        return root.data>root.left.data
    else:
        if root.data>root.left.data and root.data>root.right.data:
            return doesSatisfyProperty(root.left) and doesSatisfyProperty(root.right)
        else:
            return False

def isHeap(root):
    if root is None:
        return True
    n=countNodes(root)
    return isComplete(root,0,n) and doesSatisfyProperty(root)


if __name__ == '__main__':
    root = Node(5)
    root.left = Node(2)
    root.right = Node(3)
    root.left.left = Node(1)
    print(isHeap(root))


True


Time Complexity ->O(n)