# Time/Space Complexity - Intro to Data Structures (User Defined)

### Topics to discuss today:

<ul>
    <li>Time and Space Complexity - What is it/How do we measure it</li>
    <li>Asymptotic Analysis</li>
    <li><strong>Data Structures</strong></li>
    <li>Some of the popular sorting algorithms</li>
</ul>

### Data Structures to discuss:
- Arrays
- Stacks
- Queues
- Linked Lists
    - Singly Linked Lists
    - Traversing A Linked List
    - Finding a node in a linked list
    - Adding to a linked list
- Binary Search Trees
    - Construction
    - Traversal


## Time and Space Complexity

#### What is it?

Time and space complexity is the measure of how much time a given action(function) will take to solve a problem. In the same fashion, we determine how much a given data structure will need in terms of memory allocation. A problem can have multiple solutions and finding the optimal solution for the problem needs to be analyzed in time and space.

#### How do we measure Time and Space Complexity?

In order to measure time and space complexity we use Asymptotic analysis. The reason for this is because we need a way to measure different algorithms (functions) based on the size of their inputs in a mathmatical way. For example, we could have a function that is computed as f(n) and another that is g(n^2). All things around the function staying constant, the only thing that changes is the size of the input. Below is the chart that shows the different Asymptotic analysis formats. 

<table style="text-align:center;" class="table table-bordered">
<tbody><tr>
<td>constant</td>
<td>−</td>
<td>Ο(1)</td>
</tr>
<tr>
<td>logarithmic</td>
<td>−</td>
<td>Ο(log n)</td>
</tr>
<tr>
<td>linear</td>
<td>−</td>
<td>Ο(n)</td>
</tr>
<tr>
<td>Linear Logarithmic</td>
<td>−</td>
<td>Ο(n log n)</td>
</tr>
<tr>
<td>quadratic</td>
<td>−</td>
<td>Ο(n<sup>2</sup>)</td>
</tr>
<tr>
<td>cubic</td>
<td>−</td>
<td>Ο(n<sup>3</sup>)</td>
</tr>
<tr>
<td>polynomial</td>
<td>−</td>
<td>n<sup>Ο(1)</sup></td>
</tr>
<tr>
<td>exponential</td>
<td>−</td>
<td>2<sup>Ο(n)</sup></td>
</tr>
</tbody></table>

## Arrays

In python we benefit from the dynamic array which means the block of memory will expand as needed for the given input to the array. In traditional arrays (depending on the type of operating system) we will usually store our inputs in 4 or 8 consecutive blocks of memory. 

## Which in python looks like this:

In [1]:
list_of_ten = []
for i in range(1,11):
    list_of_ten.append(i)
print(list_of_ten)


list_of_ten_comp = [i for i in range(1,11)]
print(list_of_ten_comp)

[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]
[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]


### Let's take a look at some of the time and space analysis of arrays

In [2]:
import random

my_arr = [random.randint(1,100) for _ in range(10)]
print(my_arr)

[48, 91, 23, 58, 19, 14, 48, 60, 14, 60]


In [8]:
# Indexing a list
# Constant Time and Space - O(1)

indexing = my_arr[4]
print(indexing)


# Searching through an array
# Linear Time - O(n) and Constant Space - O(1)
print(600 in my_arr)


# Copying a list
# Linear Time - O(n) and Linear Space - O(n)
copied_arr = my_arr[:]
print(copied_arr)


# Assigning an index vaue to a list
# Constant Time - O(1) and Constant Space - O(1)
my_arr[4] = 1000
print(my_arr)

19
False
[48, 91, 23, 58, 19, 14, 48, 60, 14, 60]
[48, 91, 23, 58, 1000, 14, 48, 60, 14, 60]


In [9]:
def some_func(arr):
    for x in range(len(arr)): # O(n) - Linear Time
        arr[x] = arr[x]**2   # O(1) - Constant Time
    for i in arr:  # O(n) - Linear Time
        for j in arr: # O(n) - Linear Time
            print(i*j) # O(1) - Constant Time
    for r in arr: # O(n) - Linear Time
        print(r) # O(1) - Constant Time
    return arr # O(1) - Constant Time

some_func([1, 2, 3]) # O((n * 1) + (n * n * 1) + (n * 1) + (1))

# O(n + n**2 + n + 1)
# O(n**2 + 2n + 1)
# Drop all coefficients and lower order complexities
# O(n**2 + n + 1)
# O(n**2)

1
4
9
4
16
36
9
36
81
1
4
9


[1, 4, 9]

In [13]:
def median(lst): # O(4n + 2)  => O(3n + 1) OR O(2n + 1)
    if len(lst) % 2 == 0: # O(n)
        left_mid = len(lst) // 2 - 1 # O(n)
        right_mid = len(lst) // 2 # O(n)
        return (lst[left_mid] + lst[right_mid]) / 2 # O(1)
    else:
        mid = len(lst) // 2 # O(n)
        return lst[mid] # O(1)
    
    
print(median([10, 12, 13, 14, 17, 19, 22]))
print(median([10, 12, 13, 14, 17, 19, 22, 24]))

14
15.5


In [19]:
def print_all_elements(a_list): # O(n)
    num_operations = 0
    for item in a_list:
        print(item)
        num_operations += 1
    print('Num Operations:', num_operations)
        
        
def print_first_item(a_list): # O(1)
    num_operations = 0
    print(a_list[0])
    num_operations += 1
    print('Num Operations:', num_operations)
    
    
    
print_all_elements([i for i in range(1,11)])
print_first_item([i for i in range(1,11)])   

1
2
3
4
5
6
7
8
9
10
Num Operations: 10
1
Num Operations: 1


## Stacks and Queues (Review)

** Stacks ** as the name suggests is a data structure that allows for data to follow the Last In First Out priciple(LIFO). Think of a stack of pancakes for example. To get the first pancake you would  start with the top and go down.

##### Searching through a stack will be Linear Time O(n) - Constant Space O(1)
##### Selecting the first item will be done in Linear Time O(n) - Constant Space O(1)
##### Selecting the last item will be done in Constant Time O(1) - Constant Space O(1)
##### Adding to the stack will be done in Constant Time O(1) - Constant Space O(1)

** Queues ** are similar but in this case follow the First In First Out principle(FIFO). Think of this as a line in a black friday sale. The first person camped out for the big screen tv is the first to get it.

##### Searching through a queue will be Linear Time O(n) - Constant Space O(1)
##### Selecting the first item will be done in Constant Time O(1) - Constant Space O(1)
##### Selecting the last item will be done in Linear Time O(n) - Constant Space O(1)
##### Adding to the queue should take Constant Time O(1) - Constant Space O(1)

In [20]:
# https://docs.python.org/3/library/collections.html#collections.deque
from collections import deque

In [21]:
print('Stack:')

stack = deque([])

stack.append(10)
stack.append(20)
stack.append(30)
stack.append(40)
stack.append(50)

print(stack)

while stack:
    print(stack.pop())
    print(stack)

Stack:
deque([10, 20, 30, 40, 50])
50
deque([10, 20, 30, 40])
40
deque([10, 20, 30])
30
deque([10, 20])
20
deque([10])
10
deque([])


In [23]:
print('Queue:')

queue = deque([])

queue.append('Brian')
queue.append('Sarah')
queue.append('Kevin')
queue.append('Sam')
queue.append('Aften')

print(queue)

while queue:
    print(queue.popleft())
    print(queue)

Queue:
deque(['Brian', 'Sarah', 'Kevin', 'Sam', 'Aften'])
Brian
deque(['Sarah', 'Kevin', 'Sam', 'Aften'])
Sarah
deque(['Kevin', 'Sam', 'Aften'])
Kevin
deque(['Sam', 'Aften'])
Sam
deque(['Aften'])
Aften
deque([])


## Linked List (Data Structure)

A linked list is created by using the node class. We create a Node object and create another class to use this node object. We pass the appropriate values thorugh the node object to point the to the next data elements.

There are some advantages and disadvantages with this data structure. **Advantages** Linked Lists can save memory because they can be flexibile with memory management which saves memory. **Disadvantages** Finding or adding to the list requires traversing the entire list.

In [24]:
class LinkedListNode:
    def __init__(self, value):
        self.value = value
        self.next = None
        
    def traverse_list(self):
        node = self
        while node is not None:
            print(node.value)
            node = node.next
            
            
node1 = LinkedListNode('Monday')
node2 = LinkedListNode('Tuesday')
node3 = LinkedListNode('Wednesday')

node1.next = node2
node2.next = node3

node1.traverse_list()

Monday
Tuesday
Wednesday


In [59]:
# Complete Implementation of Linked List
# - Add new node to the front of the Linked List
# - Add new node to the end of the Linked List
# - Get a node by it's value
# - Insert new node after a particular node
# - Traverse through the Linked List and print values

# 2 Classes - Node Class and Linked List Class

class Node:
    def __init__(self, value):
        self.value = value
        self.next = None
        
    def __str__(self):
        return str(self.value)
    
    def __repr__(self):
        return f"<Node|{self.value}>"

class LinkedList:
    def __init__(self, head_node=None):
        # head attribute will point to the first node of the Linked List
        self.head = head_node
        
    # Method to add a new node to the front of the linked list
    def push_on(self, new_value): # O(1) - Constant Time
        # Create a new node with the value passed in
        new_node = Node(new_value)
        # Set the new node's next attribute to be the current head
        new_node.next = self.head
        # Set the new node to the front of the list (aka the head)
        self.head = new_node
        
    # Method to print out all of the node's in the linked list in order
    def traverse_list(self):
        # Start at the beginning of the list
        node = self.head
        # While the node is not None, continue to loop
        while node is not None:
            # Print the node (which will call the Node __str__ method)
            print(node)
            # Set the node to the next node in the list
            node = node.next
            
    # Method to add a new node to the end of the linked list
    def append(self, new_value):
        # Create a new node with the value passed in
        new_node = Node(new_value)
        # Check if the linked list is empty
        if self.head is None:
            # Set the head to the new node
            self.head = new_node
        # If not empty
        else:
            # Traverse to the last node in the linked list (aka the node.next is None)
            node = self.head
            while node.next is not None:
                # move to the next node
                node = node.next
            # set the last node's next attribute to the new node
            node.next = new_node
            
    # Method that will return a node based on the value or return None if not in the list
    def get_node(self, value_to_get):
        # Start with the fist node
        node_to_check = self.head
        # while the node to check is still a node
        while node_to_check is not None:
            # if the value of the node to chek is equal to the value to get
            if node_to_check.value == value_to_get:
                # return that node
                return node_to_check
            # if not, move on to the next node
            node_to_check = node_to_check.next
        # Once the node to check is None, we know that the value to get is not in the linked list
        return None
    
    # Method to insert a new node in the linked list after a certain node (by value)
    def insert_after(self, prev_value, new_value):
        # Get the previous node by its value
        prev_node = self.get_node(prev_value)
        # Check if the previous node exists
        if prev_node is None:
            print(f"{prev_value} is not in the linked list.")
        else:
            # Create a new node witht he new value passed in 
            new_node = Node(new_value)
            # point the new_node's next attribute to the prev_node's next
            new_node.next = prev_node.next
            # point the previous node's .next to the new node
            prev_node.next = new_node
            
    def find_before(self, value_to_get):
        node = self.head
        while node.next is not None:
            if node.next.value == value_to_get:
                return node
            node = node.next
        return None
            

        
        
months = LinkedList()
months.append('August')
months.push_on('July')
months.push_on('April')
months.push_on('January')
months.append('September')
months.insert_after('April', 'May')
months.insert_after('January', 'February')
# months.insert_after('October', 'November')
months.traverse_list()

January
February
April
May
July
August
September


In [60]:
months.find_before('April')

<Node|February>

In [38]:
import time

In [47]:
# Adding a new node the end of the list - O(n) - Linear Time
a_linked_list = LinkedList()

start = time.time()

for i in range(1000):
    a_linked_list.append(i)

end = time.time()

print(end - start)

# Adding a new node to the beginning of the list - O(1) - Constant Time
b_linked_list = LinkedList()

start = time.time()

for i in range(1000):
    b_linked_list.push_on(i)

end = time.time()

print(end - start)

0.5616040229797363
0.0061266422271728516


In [55]:
# Adding to the end of Python's built in list - O(1) - Constant Time
normal_list_a = []

start = time.time()

for i in range(1000):
    normal_list_a.append(i)

end = time.time()

print(end - start)

# Adding a to the beginning of Python's built in list - O(n) - Linear Time
normal_list_b = []

start = time.time()

for i in range(1000):
    normal_list_b.insert(0, i)

end = time.time()

print(end - start)

0.0009753704071044922
0.0031566619873046875


## Binary Search Trees

In [72]:
class BST:
    def __init__(self, value):
        self.value = value
        self.left = None
        self.right = None
        
    def __repr__(self):
        return f"<BST|{self.value}>"
    
    # Method to add a new node to the tree
    def insert(self, new_value):
        # if the new value is less than the current node's value
        if new_value < self.value:
            # if the current node has no left subtree
            if self.left is None:
                # Set the left subtree to be a new instance of BST
                self.left = BST(new_value)
            # if the current node does have a left subtree
            else:
                # call the insert method from the left subtree
                self.left.insert(new_value)
        # if the new value is greater than or equal to the current node's value
        else:
            # if the current node has no right subtree
            if self.right is None:
                # Set the right subtree to be a new instance of BST
                self.right = BST(new_value)
            # if the current node does have a right subtree
            else:
                # call the insert method from the right subtree
                self.right.insert(new_value)
      
    # Method to find a node based on value - will either return node or None
    def find_node(self, target):
        # if target is equal to self.value, we found our node
        if target == self.value:
            return self
        # if not, check if our target is less than the self value
        elif target < self.value:
            # if node's left subtree is empty (None)
            if self.left is None:
                # We know the target is not in the tree because it would be here
                return None
            # if the node does have a left subtree
            else:
                # call the find_node method from the left subtree and return that value
                return self.left.find_node(target)
        # if the target is greater than the self value
        elif target > self.value:
            # if node's right subtree is empty (None)
            if self.right is None:
                # We know the target is not in the tree because it would be here
                return None
            # if the node does have a right subtree
            else:
                # call the find_node method from the right subtree and return that value
                return self.right.find_node(target)
            
    # Method to get the max value in a tree
    def get_max_value(self):
        if self.right is None:
            return self.value
        else:
            return self.right.get_max_value()
        
    # Method to get the min value in a tree
    def get_min_value(self):
        if self.left is None:
            return self.value
        else:
            return self.left.get_min_value()
        
    # Remove a node from the tree (by value)
    def remove(self, value_to_remove, parent=None):
        # move left or right to find the node to delete
        if value_to_remove < self.value:
            if self.left is not None:
                self.left.remove(value_to_remove, self)
        elif value_to_remove > self.value:
            if self.right is not None:
                self.right.remove(value_to_remove, self)
        # When we finally find the node to delete
        else:
            # if the node to delete has both a let and right subtree - node has two children
            if self.left is not None and self.right is not None:
                # Find the larget value in the left subtree and copy the value to the current node
                self.value = self.left.get_max_value()
                # Remove the node from which we copied
                self.left.remove(self.value, self)
            # if the left or right is None but node has no parent - node has at most one child
            elif parent is None:
                # if left side is not empty
                if self.left is not None:
                    # Set root node to current node's left
                    self.value = self.left.value
                    self.right = self.left.right
                    self.left = self.left.left
                # if right side is not empty
                elif self.right is not None:
                    self.value = self.right.value
                    self.left = self.right.left
                    self.right = self.right.right
                # if both are empty
                else:
                    self.value = None
            # if the node to delete is to the left of its parent node
            elif parent.left == self:
                # Set the parent node's left attribute to the node to delete's left
                if self.left is not None:
                    parent.left = self.left
                else:
                    # or right if it doesn't have a left
                    parent.left = self.right
            elif parent.right == self:
                if self.left is not None:
                    parent.right = self.left
                else:
                    parent.right = self.right
            
                
tree = BST(50)
tree.insert(25)
tree.insert(10)
tree.insert(75)
tree.insert(64)
tree.insert(60)

In [73]:
print(tree.find_node(10))

<BST|10>


In [74]:
print(tree.find_node(99))

None


In [76]:
print(tree.get_max_value())
print(tree.get_min_value())

75
10


# Homework

#### Problem 1: Add a .remove method to the LinkedList

Update the `.remove` method to the LinkedList class to remove a node from the list.

The method should take in the value to remove and remove the node with that value from the LinkedList.

In [3]:
class Node:
    def __init__(self, value):
        self.value = value
        self.next = None
        
    def __str__(self):
        return str(self.value)
    
    def __repr__(self):
        return f"<Node|{self.value}>"

class LinkedList:
    def __init__(self, head_node=None):
        self.head = head_node
        
    def push_on(self, new_value):
        new_node = Node(new_value)
        new_node.next = self.head
        self.head = new_node
        
    def traverse_list(self):
        node = self.head
        while node is not None:
            print(node)
            node = node.next
            
    def append(self, new_value):
        new_node = Node(new_value)
        if self.head is None:
            self.head = new_node
        else:
            node = self.head
            while node.next is not None:
                node = node.next
            node.next = new_node
            
    def get_node(self, value_to_get):
        node_to_check = self.head
        while node_to_check is not None:
            if node_to_check.value == value_to_get:
                return node_to_check
            node_to_check = node_to_check.next
        return None
    
    def insert_after(self, prev_value, new_value):
        prev_node = self.get_node(prev_value)
        if prev_node is None:
            print(f"{prev_value} is not in the linked list.")
        else:
            new_node = Node(new_value)
            new_node.next = prev_node.next
            prev_node.next = new_node
            
    def find_before(self, value_to_get):
        node = self.head
        while node.next is not None:
            if node.next.value == value_to_get:
                return node
            node = node.next
        return None
            
    def remove(self, value_to_remove):
        pass
    

weekdays = LinkedList()
list_of_days = ['Sunday', 'Monday', 'Tuesday', 'Wednesday', 'Thursday', 'Friday', 'Saturday']
for day in list_of_days:
    weekdays.append(day)

weekdays.remove('Tuesday')

weekdays.traverse_list()

Sunday
Monday
Wednesday
Thursday
Friday
Saturday
