# Time/Space Complexity - Intro to Data Structures (User Defined)

### Topics to discuss today:

<ul>
    <li>Time and Space Complexity - What is it/How do we measure it</li>
    <li>Asymptotic Analysis</li>
    <li><strong>Data Structures</strong></li>
    <li>Some of the popular sorting algorithms</li>
</ul>

### Data Structures to discuss:
- Arrays
- Stacks
- Queues
- Linked Lists
    - Singly Linked Lists
    - Traversing A Linked List
    - Finding a node in a linked list
    - Adding to a linked list


## Time and Space Complexity

#### What is it?

Time and space complexity is the measure of how much time a given action(function) will take to solve a problem. In the same fashion, we determine how much a given data structure will need in terms of memory allocation. A problem can have multiple solutions and finding the optimal solution for the problem needs to be analyzed in time and space.

#### How do we measure Time and Space Complexity?

In order to measure time and space complexity we use Asymptotic analysis. The reason for this is because we need a way to measure different algorithms (functions) based on the size of their inputs in a mathmatical way. For example, we could have a function that is computed as f(n) and another that is g(n^2). All things around the function staying constant, the only thing that changes is the size of the input. Below is the chart that shows the different Asymptotic analysis formats. 

In [None]:
# asymptotic analysis focuses on analyzing our time/space complexity as our number of inputs scales toward infinity
    # aka we only care about really large numbers of inputs
    # the benefit of this is that we can do so hand-wavy simplification
    # any change in our time complexity that is not exponential is ignored
        # we're following the idea/principle that infinity * infinity = infinity^2
        # but infinity + infinity or 2*infinity = infinity
        
        # remember back to the example of 2 separate for loops vs. nested for loops
            # 2 separate for loops always increases steps by 2 for each additional input
            # where as for nested for loops we have an ever increasing impact of each additional input

<table style="text-align:center;" class="table table-bordered">
<tbody><tr>
<td>constant</td>
<td>−</td>
<td>Ο(1)</td>
</tr>
<tr>
<td>logarithmic</td>
<td>−</td>
<td>Ο(log n)</td>
</tr>
<tr>
<td>linear</td>
<td>−</td>
<td>Ο(n)</td>
</tr>
<tr>
<td>Linear Logarithmic</td>
<td>−</td>
<td>Ο(n log n)</td>
</tr>
<tr>
<td>quadratic</td>
<td>−</td>
<td>Ο(n<sup>2</sup>)</td>
</tr>
<tr>
<td>cubic</td>
<td>−</td>
<td>Ο(n<sup>3</sup>)</td>
</tr>
<tr>
<td>polynomial</td>
<td>−</td>
<td>n<sup>Ο(1)</sup></td>
</tr>
<tr>
<td>exponential</td>
<td>−</td>
<td>2<sup>Ο(n)</sup></td>
</tr>
</tbody></table>

## Arrays

In python we benefit from the dynamic array which means the block of memory will expand as needed for the given input to the array. In traditional arrays (depending on the type of operating system) we will usually store our inputs in 4 or 8 consecutive blocks of memory. Below is a diagram of how that looks under the hood:

<img src="http://www.mathcs.emory.edu/~cheung/Courses/170/Syllabus/09/FIGS/array02x.gif" style="height:250px; width:350px;">

## Which in python looks like this:

In [1]:
mylist = ['Fennec Fox', 'Arctic Fox', 'Tibetan Fox', 'Red Fox', 'Grey Fox']

# dynamic increases in array size are allowed
mylist.append(8)
# different datatypes are allowed in the same array
print(mylist)

['Fennec Fox', 'Arctic Fox', 'Tibetan Fox', 'Red Fox', 'Grey Fox', 8]


### Let's take a look at some of the time and space analysis of arrays

In [7]:
# creation of a list - linear process for the total number of values in the list - O(n) time and space
mylist = ['Fennec Fox', 'Arctic Fox', 'Tibetan Fox', 'Red Fox', 'Grey Fox']
# copying a list - linear process - O(n) time and space
listcopy = mylist[::]

# indexing into a list - O(1) constant time operation
mylist[3]

# searching a list/looping a list - O(n) linear time processes - we have to take a step for each item in the list
for i in mylist:
    pass
mylist.index('Red Fox')
mylist.count('Grey Fox')
if 'Panda' in mylist: # membership test in a list O(n)
    pass

# adding a value to a list - efficiency depends on where you are adding the value
mylist.insert(1, 'Desert Fox') # O(n) linear time process unless adding to the end of the list
#print(mylist)
mylist.append('Panda') # O(1)* usually a constant time process to add the end of the list
    # technically called an amortized O(1) process
    # meaning it is usually a constant time O(1) process, but can be a linear O(n) process

# removing a value from a list
# .remove() - removes based on a value - O(n) -> must search for the value then move other values
mylist.remove('Panda')
# .pop() - removes based on an index number -> no searching required
# the default behavior of .pop() is to remove the last value in the list - O(1) constant time
mylist.pop()
# mylist.pop(0) - popping from the front of the list is a O(n) linear process - must move all the other values

# list comprehensions - why are they the preferred method for creating a list from another iterable?
# the answer comes down to that they are more memory efficient

# non-list comprehension - taking advantage of the dynamic array sizing
# aka the list is getting larger each time we append
# and we run the risk of needing to move the entire list in memory if we run out of consecutive memory locations
newlist = [] # 1
for i in mylist: # O(n)
    newlist.append(i) # O(1)*
print(newlist)

# with a list comprehension, python knows the maximum possible size of the new list
# with a list comprehension, max size of new list is the size of the old list
# so, python can pick a memory location ahead of time with enough consecutive memory slots for the new list
# meaning with a list comprehension, there is no risk of having to move the entire list partway through creation
newlist = [x for x in mylist]
# thats why list comprehension are considered best practice for creating a list from another iterable
# list comprehension (depending on your transformation or conditional) - O(n) linear time and space


['Fennec Fox', 'Desert Fox', 'Arctic Fox', 'Tibetan Fox', 'Red Fox']


## Stacks and Queues

** Stacks ** as the name suggests is a data structure that allows for data to follow the Last In First Out priciple(LIFO). Think of a stack of pancakes for example. To get the first pancake you would  start with the top and go down.

##### Searching through a stack will be Linear Time O(n) - Constant Space O(1)
##### Selecting the last item will be done in Constant Time O(1) - Constant Space O(1)
##### Adding to the stack should take Constant Time O(1) - Constant Space O(1)

** Queues ** are similar but in this case follow the First In First Out principle(FIFO). Think of this as a line in a black friday sale. The first person camped out for the big screen tv is the first to get it.

##### Searching through a queue will be Linear Time O(n) - Constant Space O(1)
##### Selecting the first item will be done in Constant Time O(1) - Constant Space O(1)
##### Adding to the queue should take Constant Time O(1) - Constant Space O(1)

In [10]:
# adding and removing from the end of a stack is a constant time O(1) operation
# python lists are an implementation of a stack - .append() and .pop()
mylist = ['a', 'b', 'c']
mylist.append('d') # adding to the end of the list O(1)*
mylist.pop() # removal from the end of list in O(1)

# differently - a queue only needs to support constant time removal from the front of the queue
# and constant time adding to the end of the queue
# in other words, a python list is NOT a queue -> python lists do not support O(1) constant time removal from the start
# python has no built-in implementation of a queue
# if you have a scenario where you need a queue, I personally use the collections module deque

from collections import deque

mydeque = deque()
print(mydeque)
mydeque.append('Fennec Fox')
mydeque.appendleft('Panda')
print(mydeque)
mydeque.popleft()
print(mydeque)

deque([])
deque(['Panda', 'Fennec Fox'])
deque(['Fennec Fox'])


## Linked List (Data Structure)

A linked list is created by using the node class. We create a Node object and create another class to use this node object. We pass the appropriate values thorugh the node object to point the to the next data elements.

There are some advantages and disadvantages with this data structure. **Advantages** Linked Lists can save memory because they can be flexibile with memory management which saves memory. **Disadvantages** Finding or adding to the list requires traversing the entire list.

In [3]:
# A implementation of a singly-linked list

# 2 components - a Node class and LinkedList class

class Node:
    def __init__(self, value):
        self.value = value
        self.next = None

class LinkedList:
    def __init__(self):
        self.head = None
    
    def pushOn(self, new_value):
        """
        O(1)
        adding a head/changing to a new head for the linked list
        """
        # create a new node
        new_node = Node(new_value)
        # set the new node's next to the current head
        new_node.next = self.head
        # set the head to the new node
        self.head = new_node
    
    def insertAfter(self, prev_node, new_value):
        """
        O(1)
        inserting a value after a specific node in our linked list
        """
        if not prev_node:
            print("The given previous node must exist!")
            return
        
        # if the previous node does exist, then we need to create the new node
        new_node = Node(new_value)
        
        # update the new node's next value
        new_node.next = prev_node.next
        # update the prev_node's next value
        prev_node.next = new_node
    
    def append(self, new_value):
        """
        O(n) Θ(n) Ω(n)
        adding a new value to the end of the linked list
        """
        new_node = Node(new_value)
        
        # check if the linkedList even has values - if there is no head, we just use this new node as the head
        if self.head is None:
            self.head = new_node
            return
        # BUT if the linkedList is not empty, we must traverse until we find the end
        last = self.head
        while last.next:
            last = last.next
        
        # once the loop is done, we've found the last node
        # change the current last node's next to the new node
        last.next = new_node
        
    def traverse(self):
        """
        one step for each item in the linked list - linear
        O(n) Θ(n) Ω(n)
        run through all of our values in the linked list
        """
        pointer = self.head
        while pointer:
            print(f'The day of the week is: {pointer.value}.')
            pointer = pointer.next
    
    def search(self, target):
        """
        O(n) Θ(n) Ω(1)
        determine if a value is in our linked list
        """
        pointer = self.head
        while pointer:
            if pointer.value == target:
                print(f'{target} was found in the LinkedList!')
                return
            pointer = pointer.next
        print(f'{target} was not in the LinkedList :(')

In [6]:
weekdays = LinkedList()
print(weekdays.head)
weekdays.pushOn('Monday')
print(weekdays.head.value)
weekdays.pushOn('Sunday') # change head to Sunday, Sunday node now points to the old head (Monday)
print(weekdays.head.value, weekdays.head.next.value)
# insert Tuesday after Monday
# the only access we have to the LinkedList is through the head
# so we know Monday as weekdays.head.next
weekdays.insertAfter(weekdays.head.next, 'Tuesday')
print(weekdays.head.value, weekdays.head.next.value, weekdays.head.next.next.value)
# let's try appending to the end of the linkedlist
weekdays.append('Wednesday')
print(weekdays.head.value, weekdays.head.next.value, weekdays.head.next.next.value, weekdays.head.next.next.next.value)

weekdays.traverse()

weekdays.search('Friday')

weekdays.append('Thursday')
weekdays.append('Friday')
weekdays.append('Saturday')
weekdays.traverse()
weekdays.search('Friday')

print(weekdays.head.next.next.next.next.next.value)

None
Monday
Sunday Monday
Sunday Monday Tuesday
Sunday Monday Tuesday Wednesday
The day of the week is: Sunday.
The day of the week is: Monday.
The day of the week is: Tuesday.
The day of the week is: Wednesday.
Friday was not in the LinkedList :(
The day of the week is: Sunday.
The day of the week is: Monday.
The day of the week is: Tuesday.
The day of the week is: Wednesday.
The day of the week is: Thursday.
The day of the week is: Friday.
The day of the week is: Saturday.
Friday was found in the LinkedList!
Friday
