# Heap

Um Heap é uma estrutura de dados de árvore binária completa que satisfaz a propriedade heap: para cada nó, o valor de seus filhos é maior ou igual ao seu próprio valor. Heaps são geralmente usados ​​para implementar filas de prioridade, onde o menor (ou maior) elemento está sempre na raiz da árvore.


Estrutura de Dados de Heap

![Logo do R](MinHeapAndMaxHeap1.png)

# Find Median From Data Stream

The median is the middle value in an ordered integer list. If the size of the list is even, there is no middle value, and the median is the mean of the two middle values.

- For example, for arr = [2,3,4], the median is 3.
- For example, for arr = [2,3], the median is (2 + 3) / 2 = 2.5.

Implement the MedianFinder class:

- MedianFinder() initializes the MedianFinder object.
- void addNum(int num) adds the integer num from the data stream to the data structure.
- double findMedian() returns the median of all elements so far. Answers within 10-5 of the actual answer will be accepted.
 

**Example 1:**

**Input**
["MedianFinder", "addNum", "addNum", "findMedian", "addNum", "findMedian"]
[[], [1], [2], [], [3], []]
**Output**
[null, null, null, 1.5, null, 2.0]

**Explanation**
MedianFinder medianFinder = new MedianFinder();
medianFinder.addNum(1);    // arr = [1]
medianFinder.addNum(2);    // arr = [1, 2]
medianFinder.findMedian(); // return 1.5 (i.e., (1 + 2) / 2)
medianFinder.addNum(3);    // arr[1, 2, 3]
medianFinder.findMedian(); // return 2.0
 

**Constraints:**

- -105 <= num <= 105
- There will be at least one element in the data structure before calling findMedian.
- At most 5 * 104 calls will be made to addNum and findMedian.
 

**Follow up:**

- If all integer numbers from the stream are in the range [0, 100], how would you optimize your solution?
- If 99% of all integer numbers from the stream are in the range [0, 100], how would you optimize your solution?

In [None]:
import unittest
from typing import *
from heapq import heappush, heappop


class MedianFinder:

    def __init__(self):
        self.left_heap = []
        self.right_heap = []

    def addNum(self, num: int) -> None: #time complecity O(logn)
        # 1 < 2 < 3
        # 2 < 3 is True

        if self.right_heap and self.right_heap[0] < num:
            heappush(self.right_heap, num)
        # 3 < 2 is False
        else:
            heappush(self.left_heap, num * -1)

        self.rebalance()

    def findMedian(self) -> float: #time complecity O(logn)
        if len(self.right_heap) == len(self.left_heap):
            return (-self.left_heap[0] + self.right_heap[0]) / 2

        if len(self.right_heap) > len(self.left_heap):
            return self.right_heap[0]

        return -self.left_heap[0]

    def rebalance(self) -> None:
        if len(self.left_heap) > len(self.right_heap) + 1:
            left_value = heappop(self.left_heap) * -1
            heappush(self.right_heap, left_value)

        if len(self.right_heap) > len(self.left_heap) + 1:
            right_value = heappop(self.right_heap * -1)
            heappush(self.right_heap, right_value)


# soluction 2, optimization

- The idea is to divide numbers into 2 balanced halves, one half low stores low numbers, the other half high stores high numbers. To access the median in O(1), we need a data structure that give us the maximum of low half and the minimum of high half in O(1). That's where maxHeap and minHeap come into play.

- We use maxHeap to store a half of low numbers, top of the maxHeap is the highest number among low numbers.

- We use minHeap to store a half of high numbers, top of the minHeap is the lowest number among high numbers.

- We need to balance the size between maxHeap and minHeap while processing. Hence after adding k elements,
  - If k = 2 * i then maxHeap.size = minHeap.size = i
  - If k = 2 * i + 1, let maxHeap store 1 element more than minHeap, then maxHeap.size = minHeap.size + 1.

- When adding a new number num into our MedianFinder:
  - Firstly, add num to the maxHeap, now maxHeap may contain the big element (which should belong to minHeap). So we need to balance, by removing the highest element from maxHeap, and offer it to minHeap.
  - Now, the minHeap might hold more elements than maxHeap, in that case, we need to balance the size, by removing the lowest element from minHeap and offer it back to maxHeap.

- When doing findMedian():
  - If maxHeap.size > minHeap.size return top of the maxHeap, which is the highest number amongs low numbers.
  - Else if maxHeap.size == minHeap return the (maxHeap.top() + minHeap.top()) / 2.

  ![solution](image-solution-heap-find-median.png)

In [28]:
#complexity in Space: O(N)
#complexity in
class MedianFinder:
    def __init__(self): #O(1)
        self.minHeap = []
        self.maxHeap = []

    def addNum(self, num: int) -> None: # this heap is O(logN)
        heappush(self.maxHeap, -num)
        heappush(self.minHeap, -heappop(self.maxHeap))
        if len(self.minHeap) > len(self.maxHeap):
            heappush(self.maxHeap, -heappop(self.minHeap))

    def findMedian(self) -> float: #O(1)
        if len(self.maxHeap) > len(self.minHeap):
            return -self.maxHeap[0]
        return (-self.maxHeap[0] + self.minHeap[0]) / 2

In [22]:

# Your MedianFinder object will be instantiated and called as such:
obj = MedianFinder()
# obj.addNum(num)
# param_2 = obj.findMedian()

In [23]:
obj.addNum(1)


In [24]:
obj.addNum(2)

In [25]:
obj.findMedian()

1.5

In [26]:
obj.addNum(3)

In [27]:
obj.findMedian()

2