# Time/Space Complexity - Intro to Data Structures (User Defined)

### Topics to discuss today:

<ul>
    <li>Time and Space Complexity - What is it/How do we measure it</li>
    <li>Asymptotic Analysis</li>
    <li><strong>Data Structures</strong></li>
    <li>Some of the popular sorting algorithms</li>
</ul>

### Data Structures to discuss:
- Arrays
- Stacks
- Queues
- Linked Lists
    - Singly Linked Lists
    - Traversing A Linked List
    - Finding a node in a linked list
    - Adding to a linked list
- Binary Search Trees
    - Construction
    - Traversal


## Time and Space Complexity

#### What is it?

Time and space complexity is the measure of how much time a given action(function) will take to solve a problem. In the same fashion, we determine how much a given data structure will need in terms of memory allocation. A problem can have multiple solutions and finding the optimal solution for the problem needs to be analyzed in time and space.

#### How do we measure Time and Space Complexity?

In order to measure time and space complexity we use Asymptotic analysis. The reason for this is because we need a way to measure different algorithms (functions) based on the size of their inputs in a mathmatical way. For example, we could have a function that is computed as f(n) and another that is g(n^2). All things around the function staying constant, the only thing that changes is the size of the input. Below is the chart that shows the different Asymptotic analysis formats. 

<table style="text-align:center;" class="table table-bordered">
<tbody><tr>
<td>constant</td>
<td>−</td>
<td>Ο(1)</td>
</tr>
<tr>
<td>logarithmic</td>
<td>−</td>
<td>Ο(log n)</td>
</tr>
<tr>
<td>linear</td>
<td>−</td>
<td>Ο(n)</td>
</tr>
<tr>
<td>Linear Logarithmic</td>
<td>−</td>
<td>Ο(n log n)</td>
</tr>
<tr>
<td>quadratic</td>
<td>−</td>
<td>Ο(n<sup>2</sup>)</td>
</tr>
<tr>
<td>cubic</td>
<td>−</td>
<td>Ο(n<sup>3</sup>)</td>
</tr>
<tr>
<td>polynomial</td>
<td>−</td>
<td>n<sup>Ο(1)</sup></td>
</tr>
<tr>
<td>exponential</td>
<td>−</td>
<td>2<sup>Ο(n)</sup></td>
</tr>
</tbody></table>

## Arrays

In python we benefit from the dynamic array which means the block of memory will expand as needed for the given input to the array. In traditional arrays (depending on the type of operating system) we will usually store our inputs in 4 or 8 consecutive blocks of memory. Below is a diagram of how that looks under the hood:

<img src="http://www.mathcs.emory.edu/~cheung/Courses/170/Syllabus/09/FIGS/array02x.gif" style="height:250px; width:350px;">

## Which in python looks like this:

In [2]:
array = [23, 45, 4, 6, 7]

print(array)

[23, 45, 4, 6, 7]


### Let's take a look at some of the time and space analysis of arrays

In [37]:
# indexing a list
# constant time and space - O(1)
indexing = array[2]
print(indexing)

# search through a list
# linear time - O(n) and constant space - O(1)
for i in array:
    print(i)
    
# copy a list
# linear time and space - O(n)
copied_array = array[:]
print(copied_array)

# setting an index on a list
# constant time and space - O(1)
array[2] = 66
print(array)

66
23
45
66
6
7
[23, 45, 66, 6, 7]
[23, 45, 66, 6, 7]


## Stacks and Queues (Review)

** Stacks ** as the name suggests is a data structure that allows for data to follow the Last In First Out priciple(LIFO). Think of a stack of pancakes for example. To get the first pancake you would  start with the top and go down.

##### Searching through a stack will be Linear Time O(n) - Constant Space O(1)
##### Selecting the last item will be done in Constant Time O(1) - Constant Space O(1)
##### Adding to the stack should take Constant Time O(1) - Constant Space O(1)

** Queues ** are similar but in this case follow the First In First Out principle(FIFO). Think of this as a line in a black friday sale. The first person camped out for the big screen tv is the first to get it.

##### Searching through a queue will be Linear Time O(n) - Constant Space O(1)
##### Selecting the first item will be done in Constant Time O(1) - Constant Space O(1)
##### Adding to the queue should take Constant Time O(1) - Constant Space O(1)

In [44]:
from collections import deque

print('STACK')
stack = deque([10, 20, 30])

stack.append(40)
stack.append(50)

print(stack)

while stack:
    print(stack.pop())
    
print('='*45)

print('QUEUE')

queue = deque([])

queue.append('bob')
queue.append('ally')
queue.append('mark')
queue.append('fred')

print(queue)

while queue:
    print(queue.popleft())
    


STACK
deque([10, 20, 30, 40, 50])
50
40
30
20
10
QUEUE
deque(['bob', 'ally', 'mark', 'fred'])
bob
ally
mark
fred


## Linked List (Data Structure)

A linked list is created by using the node class. We create a Node object and create another class to use this node object. We pass the appropriate values thorugh the node object to point the head to the next data elements.

There are some advantages and disadvantages with this data structure. **Advantages** Linked Lists can save memory because they can be flexibile with memory management which saves memory. **Disadvantages** Finding or adding to the list requires traversing the entire list.

In [49]:
# Linked list is constant space, linear time - O(1) space, O(n) time, by my current understanding

class LinkedListNode:
    def __init__(self, value):
        self.value = value
        self.next = None

    def traverse_list(self):
        node = self
        while node is not None:
            print(node.value)
            node = node.next


node1 = LinkedListNode('Monday')
node2 = LinkedListNode('Tuesday')
node3 = LinkedListNode('Wednesday')

node1.next = node2
node2.next = node3

node1.traverse_list()

Monday
Tuesday
Wednesday


In [15]:
# complete implementation of linked list

# 2 classes - node class and linked list class

class Node:
    def __init__(self, value):
        self.value = value
        self.next = None
        
    def __str__(self): # called when printed
        return self.value
    
    def __repr__(self): # called when only class instance is input w/o print
        return f'<Node|{self.value}>'
    
    
class LinkedList:
    def __init__(self):
        # head attr will point to the first node of the linked list
        self.head = None
        
    # method that will return a node with value or None if no node w that value
    def _get_node(self, value_to_get):
        # start w the first node in our linked list
        check = self.head
        # while check is still a node
        while check is not None:
            # if the value of the node we are checking is equal to the value for ehich we are searching
            if check.value == value_to_get:
                # return that node
                return check
            # if not. move onto the next node
            check = check.next
        # once check is None, we know that the value to get is not in linked list, return None
        return None
        
    # add a new node to the front of the linked list
    def push_on(self, new_value):
        # create a new node w the value passed in
        new_node = Node(new_value)
        # set the new node's next attr to be the current head
        new_node.next = self.head
        # set the new node to the front of the list (aka the head)
        self.head = new_node ############# ASK WHY YOU NEED THIS AND CODE ABOVE
        
    # add a new node to the end of the linked list
    def append(self, new_value):
        # create a new node w the value passed in
        new_node = Node(new_value)
        
        # check if the linked list is empty
        if self.head is None:
            # set the head to our new node
            self.head = new_node
            #############3 ASK IF NO NODE.NEXT NEEDED BC IT IS THE only ITEM IN LINKED LIST?
        # if not empty
        else:
            # traverse to the end of the list (aka the node.next is None)
            node = self.head
            # while the node we are looking at has a .next attr
            while node.next is not None:
                # move on to the next node
                node = node.next
            # set the last node's .next attr to our new node
            node.next = new_node
            
    # add a new node to our linked list after a cartain node (by value)
    def insert_after(self, prev_value, new_value):
        prev_node = self._get_node(prev_value)
        # check if the prev_node exists
        if prev_node is None:
            print(f"{prev_value} is not in linked list")
            return
        
        # create a new node w the new value passed in
        new_node = Node(new_value)
        # point the new node's .next attr to the prev_node's .next attr
        new_node.next = prev_node.next
        # point the prev_node's .next attr to the new node
        prev_node.next = new_node ###### IS THIS ADDING NODE AFTER THE PREVIOUS?
        
    def traverse_list(self):
        node = self.head
        # while node is a node and not None
        while node:
            print(node) # print node (calls Node.__str__ method)
            node = node.next # go to the next node
    
    
# node1 = Node('Monday')
# print(node1)
# node1

weekdays = LinkedList()
weekdays.push_on('Wednesday')
weekdays.push_on('Monday')
weekdays.append('Thursday')
weekdays.append('Friday')
weekdays.insert_after('Monday', 'Tuesday')

weekdays.traverse_list()

Monday
Tuesday
Wednesday
Thursday
Friday


#### Problem 1: Linked Lists

Using the above examples as a guide, create your own interpretation of the a Linked List class. You can not use the code above exactly, but again it can be used as a guide. This problem requires you to think about how a linked list works and create one using your own logic.

*Remember* A Linked List is a list of Nodes that point to the next node in the chain. The first Node starts out as Empty(None) and each node after points to the next.

Your Linked List should have a traverse method and have the ability to add a new node

In [78]:
# NOT GOING TO BE GRADED. JUST TRY BUILDING IT AND PLAYING AROUND

# need to test and play around with the concept of linked list first.
# linked list: list of Nodes that POINT to the next node in the chain. first node starts out empty and each node after 
#    points to the next

# so, need to create a NODE CLASS and test that first w other inputs...

# class should contain self and a value for the node...planets
class LList:
    def __init__(self, planet):
        self.planet = planet
        self.next = None # this indicates the default for this instance is None...
        
    def traverse(self): # do i only need self here? you want to change self to the next iteration...
        while self: # remember self is an object, so you are saying while this object has no next, do this...
            print(f'I am currently {self.planet}')
            self = self.next # here i am saying that this instance of planet now equals the next planet...
            print(f'''I am now {self.planet} {self} and will next turn to {self.next}\n{"="*50}''')
            
    def __str__(self):
        return f'{self.planet}'
            
mercury = LList('mercury')
venus = LList('venus')
earth = LList('earth')
mars = LList('mars')
jupiter = LList('jupiter')

mercury.next = venus
venus.next = earth
earth.next = mars
mars.next = jupiter

mercury.traverse()

I am currently mercury
I am now venus venus and will next turn to earth
I am currently venus
I am now earth earth and will next turn to mars
I am currently earth
I am now mars mars and will next turn to jupiter
I am currently mars
I am now jupiter jupiter and will next turn to None
I am currently jupiter


AttributeError: 'NoneType' object has no attribute 'planet'

In [25]:
import time

big_set = {x for x in range(1000000)}

big_list = [v for v in range(1000000)]

start = time.time()

print(999999 in big_set)

end = time.time()

print(end - start)


start = time.time()

print(999999 in big_list)

end = time.time()

print(end - start)

True
0.0
True
0.015956878662109375


## Binary Search Trees

<p>Advantage is that you dont go through all of the items existing, only as a logarithmic scale where you continually divide by half and save time

In [28]:
class BST: # BinarySearchTree
    def __init__(self, value):
        self.value = value
        self.left = None
        self.right = None
        
    # add a new node to the tree
    def insert(self, new_value):
        # if the new value is < the current node's value
        if new_value < self.value:
            # if the current node has no left subtree
            if self.left is None:
                # set the left subtree to be a new instance of BST w the new value
                self.left = BST(new_value)
            # if the node does have a left subtree
            else:
                self.left.insert(new_value)
        # if new value is >= current node's value
        else:
            if self.right is None:
                self.right = BST(new_value)
            else:
                self.right.insert(new_value)
                
    # return True or False if value is in tree
    def contains(self, target):
        # if target is < node value
        if target < self.value:
            # if node's left value is empty
            if self.left is None:
                # we know the target is not in the tree bc it would be here
                return False
            else:
                return self.left.contains(target)
        # if target valuue is > than the node's value
        elif target > self.value:
            if self.right is None:
                return False
            else:
                return self.right.contains(target)
        # if target's value is equal to the node's value
        else:
            return True
    
    # method to get the max value of a tree
    def get_max_value(self):
        # if the node has no right value, we know it's the largest in the tree
        if self.right is None:
            return self.value
        # otherwise, move to the right node and check again
        else:
            return self.right.get_max_value()
    
    # method to get the min value of a tree
    def get_min_value(self):
        # if the node has no left value, we know it's the largest in the tree
        if self.left is None:
            return self.value
        # otherwise, move to the left node and check again
        else:
            return self.left.get_min_value()
        
    # remove a node from the tree
    def remove(self, value_to_remove, parent=None):
        # move left or right until we find the node to delete
        if value_to_remove < self.value:
            # move left
            if self.left is not None: # if self.left is something (aka a node and not Nonetype)
                # call the remove method w the left node as self and current node as parent
                self.left.remove(value_to_remove, self) # THIS SETS THE PARENT TO SELF
        elif value_to_remove > self.value:
            # move right
            if self.right is not None:
                self.right.remove(value_to_remove, self) # THIS SETS THE PARENT TO SELF
        # when we finally find the node to delete
        else:
            # if the node to delete has both a left adn right subtree
            if self.left is not None and self.right is not None:
                # find the largest value in the left subtree and copy value to the current node
                self.value = self.left.get_max_value()
            # if the left or right is None but node has no parent
            if parent is None:
                # if the right side is blank
                if self.left is not None:
                    # set root node to current node's left
                    self.value = self.left.value
                    self.right = self.left.right
                    self.left = self.right.left
                # if the left side is blank
                elif self.right is not None:
                    # set the root node to current node's right
                    self.value = self.right.value
                    self.right = self.right.right
                    self.left = self.right.left
                # if both left and right side are None
                else:
                    self.value = None
            # if the node to delete is to the left of the parent
            elif parent.left = self:
                # if node to delete has subtree
                if self.left is not None:
                    parent.left = self.left
                else:
                    parent.left = self.right
            elif parent.right = self:
                # if node to delete has subtree
                if self.left is not None:
                    parent.right = self.left
                else:
                    parent.right = self.right    
                

my_tree = BST(50)
my_tree.insert(25)
my_tree.insert(35)
my_tree.insert(15)
my_tree.insert(75)
print(my_tree.contains(35))
print(my_tree.contains(5))
print(my_tree.get_max_value())
print(my_tree.get_min_value())

                

True
False
75
15


# Homework

#### Problem 2: Binary Search Tree

Using the above examples as a guide, create your own interpretation of the a Binary Search Tree class. You can not use the code above exactly, but again it can be used as a guide. This problem requires you to think about how a Binary Search Tree works and create one using your own logic.

*Remember* Binary Search Trees start with a head node and each node to the left of that will be smaller, each node to the right of it will be greater. The far left node should be the lowest number(if one exists) that is available. The far right node (if one exists) should be the greatest number

In [79]:
# NOT GOING TO BE GRADED. JUST TRY BUILDING IT AND PLAYING AROUND

class BST:
    
    def __init__(self, value):
        # give the instance of tree a value (int) a left value (which is equal to value of next node down left), right value
        # each time a new node is created, it needs to have a value (int) and left down and right down are empty by default
        self.value = value   # current node's value 
        self.left = None     # IS THIS AN INSTANCE OF NEXT NODE? YES. node down to the left's value this means there is no other node down left of current node
        self.right = None    # IS THIS AN INSTANCE OF NEXT NODE? YES. node down to the right's value this means there is no other node down right of current node
        
        
    # add a new node to the tree
    def add_node(self, node_to_be):
        # if the new_node's value is less than current node value, and node.left is None, add new node to current left value
        if node_to_be < self.value:
            if self.left is None:
                self.left = BST(node_to_be)
            else:
                self.left.add_node(node_to_be) # recursion: this will keep comparing the value through left and right until a node's self.left/right is None
        elif node_to_be >= self.value:
            if self.right is None:
                self.right = BST(node_to_be)
            else:
                self.right.add_node(node_to_be)
        else:
            print('this value is not recognized.')
            return

    # return True or False if value is in tree
    

    # method to get the max value of a tree
    

    # method to get the min value of a tree
    

    # remove a node from the tree
    

root = BST(50)
root.add_node(25)
root.add_node(49)
root.add_node(10)
root.add_node(25)
root.add_node(20)