# Time/Space Complexity - Intro to Data Structures (User Defined)

### Topics to discuss today:

<ul>
    <li>Time and Space Complexity - What is it/How do we measure it</li>
    <li>Asymptotic Analysis</li>
    <li><strong>Data Structures</strong></li>
</ul>

### Data Structures to discuss:
- Arrays
- Stacks
- Queues
- Linked Lists
    - Singly Linked Lists
    - Traversing A Linked List
    - Finding a node in a linked list
    - Adding to a linked list


## Time and Space Complexity

#### What is it?

Time and space complexity is the measure of how much time a given action(function) will take to solve a problem. In the same fashion, we determine how much a given data structure will need in terms of memory allocation. A problem can have multiple solutions and finding the optimal solution for the problem needs to be analyzed in time and space.

#### How do we measure Time and Space Complexity?

In order to measure time and space complexity we use Asymptotic analysis. The reason for this is because we need a way to measure different algorithms (functions) based on the size of their inputs in a mathmatical way. For example, we could have a function that is computed as f(n) and another that is g(n^2). All things around the function staying constant, the only thing that changes is the size of the input. Below is the chart that shows the different Asymptotic analysis formats. 

<table style="text-align:center;" class="table table-bordered">
<tbody><tr>
<td>constant</td>
<td>−</td>
<td>Ο(1)</td>
</tr>
<tr>
<td>logarithmic</td>
<td>−</td>
<td>Ο(log n)</td>
</tr>
<tr>
<td>linear</td>
<td>−</td>
<td>Ο(n)</td>
</tr>
<tr>
<td>Linear Logarithmic</td>
<td>−</td>
<td>Ο(n log n)</td>
</tr>
<tr>
<td>quadratic</td>
<td>−</td>
<td>Ο(n<sup>2</sup>)</td>
</tr>
<tr>
<td>cubic</td>
<td>−</td>
<td>Ο(n<sup>3</sup>)</td>
</tr>
<tr>
<td>polynomial</td>
<td>−</td>
<td>n<sup>Ο(1)</sup></td>
</tr>
<tr>
<td>exponential</td>
<td>−</td>
<td>2<sup>Ο(n)</sup></td>
</tr>
</tbody></table>

### Constant Time Example

In [10]:
# Example of Constant Time Function O(1)
# The runtime will stay the same regardless of input size
import time

def constant_algo(items):
    result = items[2] * items[3]
    return result

a_list = [0,1,2,3,4]
b_list = list(range(1000000))

start_time = time.time()

print(constant_algo(b_list))

elapsed_time = time.time() - start_time

print(f'Elapsed Time for this list {elapsed_time}')

6
Elapsed Time for this list 0.0004279613494873047


### Linear Time Example

In [27]:
# Example of Linear Time O(n)
# This runtime will increase 1:1 as the number of inputs increases
def linear_algo(items):
    for item in items:
        print(item)

l_1 = [0,1,2,3,4]
l_2 = list(range(50))

start_time = time.time()
linear_algo(l_1)
elapsed_time = time.time() - start_time
print(f'Elapsed Time for this list {elapsed_time}')

0
1
2
3
4
Elapsed Time for this list 0.0003790855407714844


### Quadratic Time Example

In [37]:
# Example of Quadratic Time: O(n^2)
# A function is qudratic if it requires us to complete 2 nested loops
# iterating over all items multiple times

def quadratic_algo(items):
    count = 0
    for i in items:
        for j in items:
            print(f'i = {i}, j = {j}')
            count += 1
    print(count)
            
a_list = list(range(5))

quadratic_algo(a_list)

i = 0, j = 0
i = 0, j = 1
i = 0, j = 2
i = 0, j = 3
i = 0, j = 4
i = 1, j = 0
i = 1, j = 1
i = 1, j = 2
i = 1, j = 3
i = 1, j = 4
i = 2, j = 0
i = 2, j = 1
i = 2, j = 2
i = 2, j = 3
i = 2, j = 4
i = 3, j = 0
i = 3, j = 1
i = 3, j = 2
i = 3, j = 3
i = 3, j = 4
i = 4, j = 0
i = 4, j = 1
i = 4, j = 2
i = 4, j = 3
i = 4, j = 4
25


## Arrays

In python we benefit from the dynamic array which means the block of memory will expand as needed for the given input to the array. In traditional arrays (depending on the type of operating system) we will usually store our inputs in 4 or 8 consecutive blocks of memory. Below is a diagram of how that looks under the hood:

<img src="http://www.mathcs.emory.edu/~cheung/Courses/170/Syllabus/09/FIGS/array02x.gif" style="height:250px; width:350px;">

## Which in python looks like this:

In [42]:
# Dynamic lists in python allow for O(1) additions to the end of the list
alist = []
alist.append(5)
print(alist)

# Python Dynamic lists complexity for insertions/deletions NOT at the end can get a little more complicated
# depending on the location relative to the end of the list
alist.insert(0,3)
print(alist)


# Overall python dynamic Arrays benefit us since we do not need to pre-allocate memory before working wth a list

# Static Array (maybe in a lower level language)
# "I am creating a new list with x amount of available spaces"
# var arr = new Array(10)

[5]
[3, 5]


### Let's take a look at some of the time and space analysis of arrays, dictionaries, and sets

In [43]:
alist = ['Coding', 'temple', 'value']

In [47]:
# Indexing an Array(list)
# Constant Time and Space - O(1)

# Searching through an Array(list)
# Linear Time - O(n) and Constant Space O(1)

# Copying an Array(list)
# Linear Time and Space O(n)
rev = alist[::-1]

# Setting an index in an array
# Constant Time and Space - O(1)
alist[2] = 'python'


# Working with Python lists (Modified Array)
alist[0] # O(1) Accessing
alist.index('python') # O(n) Searching
'python' in alist # O(n) Membership check
alist.append('JavaScript') # O(1) Appending to the end / or popping
alist.remove('Coding') # O(n) removing
# removing a specific index or inserting at a specific index is roughly O(n)
alist.insert(1, 'Java')

# Dictionaries (Hash Maps)
adict['key'] # O(1) Accessing value given a key
adict['key'] = 'value' # O(1) inserting or overwriting existing k,v pair
del adict['key'] #O(1) 
if key in adict # O(1) testing membership
if key in adict.keys() # O(n) NOT ADVISABLE for membership check... less time efficient because this is a list
# Value search is O(n) for adict.values() membership check
    
# Sets ( Hash Tables )
aset.add('value') # O(1) addition
aset.remove('value') # O(1) removal
if num in aset # O(1) membership check



### Example of getting the same solution at a different time/space complexity
###### Two Sum Problem
Create a function that given a list of numbers (that are sorted) and a target number as a sum, return the indices of the two numbers that when added equal the target number.
- Example Input: [2,7,11,15], target = 9
- Example Output: [0,1]

- Example Input: [4,7,8,9, 10, 15, 19, 20], target = 25
- Example Output: [4, 5]

In [59]:
# Step 1: grab onto the first number and add it to every other number in the list
# Step 2: if the number and another number in the list ad up to the target, return the location for both of them

# Cubic O(n^3)
def twoSum(arr, target):
    for x in arr:
        for y in arr:
            if x+y == target:
                return [arr.index(x), arr.index(y)]
            
twoSum([2,7,11,15], 9)

# Refactoring to use indices in loop so i don't need to find them within my nested loops
#O(n log(n))
def twoSum(arr, target):
    for i in range(len(arr)):
        for j in range(i+1,len(arr)):
            if arr[i] + arr[j] == target:
                return [i,j]

twoSum([2,7,11,15], 9)

# Refactor again O(n log(n))
def twoSum(arr, target):
    for i in range(len(arr)):
        left = i
        right = i+1
        while right < len(arr):
            if arr[left] + arr[right] == target:
                return [left, right]
            else:
                right +=1
                
                
twoSum([2,7,11,15], 9)


# Linear Solution - O(n)
def twoSum(arr, target):
    seen = {}
    for i in range(len(arr)):
        if arr[i] in seen:
            return [seen[arr[i]], i]
        else:
            match = target - arr[i]
            seen[match] = i

        
twoSum([4,7,8,9,10,15,19,20],25)


[4, 5]

## Stacks and Queues

** Stacks ** as the name suggests is a data structure that allows for data to follow the Last In First Out priciple(LIFO). Think of a stack of pancakes for example. To get the first pancake you would  start with the top and go down.

##### Searching through a stack will be Linear Time O(n) - Constant Space O(1)
##### Selecting the last item will be done in Constant Time O(1) - Constant Space O(1)
##### Adding to the stack should take Constant Time O(1) - Constant Space O(1)

** Queues ** are similar but in this case follow the First In First Out principle(FIFO). Think of this as a line in a black friday sale. The first person camped out for the big screen tv is the first to get it.

##### Searching through a queue will be Linear Time O(n) - Constant Space O(1)
##### Selecting the first item will be done in Constant Time O(1) - Constant Space O(1)
##### Adding to the queue should take Constant Time O(1) - Constant Space O(1)

In [None]:
# Stacks looking for most recent addition to access quickly
# Queues looking for the least recent -- think of an IT help ticket... support wants to help the
# person who has been waiting the longest amount of time

## Linked List (Data Structure)

A linked list is created by using the node class. We create a Node object and create another class to use this node object. We pass the appropriate values thorugh the node object to point the to the next data elements.

There are some advantages and disadvantages with this data structure. **Advantages** Linked Lists can save memory because they can be flexibile with memory management which saves memory. **Disadvantages** Finding or adding to the list requires traversing the entire list.

In [96]:
# Object with a value and a pointer

class Node:
    def __init__(self,value):
        self.value = value
        self.next = None

class LinkedList:
    def __init__(self):
        self.head = None
    
    # this method will append something onto the linked list (end)
    def append(self, new_value):
        # Create a new Node instance using my new value
        new_node = Node(new_value)
        
        # Check if LL is empty... if so.. place appended item at the head
        if self.head == None:
            self.head = new_node
        
        # BUT if the list is not empty - traverse until the end
        # and add the new value to the end of the list
        else:
            last = self.head
            while last.next:
                last = last.next
            # Change the final list item to be pointing to this appended item
            last.next = new_node
    
    def pushOn(self,new_value):
        new_node = Node(new_value)
        new_node.next = self.head
        self.head = new_node
    
    def insertAfter(self, prev_node, new_value):
        #Check if that previous node exists
        if prev_node is None:
            print('the given prev_node does not exist!')
            return
        new_node = Node(new_value)
        
        # Update the new_node to point where the prev_node used to point
        new_node.next = prev_node.next
        
        # Update the previous node to point at newly inserted node
        prev_node.next = new_node
        
    def traverse(self):
        temp = self.head
        # while temp is not None -- keep looking through these links until you reach a none value
        while temp:
            print(temp.value)
            temp = temp.next
        
weekdays = LinkedList()

In [97]:
weekdays.pushOn('Mon')
weekdays.append('Tue')
weekdays.append('Wed')
weekdays.insertAfter(weekdays.head.next, 'Tuesday.5')
weekdays.traverse()


Mon
Tue
Tuesday.5
Wed


In [98]:
weekdays.insertAfter(weekdays.head, 'monday')
weekdays.traverse()

Mon
monday
Tue
Tuesday.5
Wed


In [99]:
help(list)


Help on class list in module builtins:

class list(object)
 |  list(iterable=(), /)
 |  
 |  Built-in mutable sequence.
 |  
 |  If no argument is given, the constructor creates a new empty list.
 |  The argument must be an iterable if specified.
 |  
 |  Methods defined here:
 |  
 |  __add__(self, value, /)
 |      Return self+value.
 |  
 |  __contains__(self, key, /)
 |      Return key in self.
 |  
 |  __delitem__(self, key, /)
 |      Delete self[key].
 |  
 |  __eq__(self, value, /)
 |      Return self==value.
 |  
 |  __ge__(self, value, /)
 |      Return self>=value.
 |  
 |  __getattribute__(self, name, /)
 |      Return getattr(self, name).
 |  
 |  __getitem__(...)
 |      x.__getitem__(y) <==> x[y]
 |  
 |  __gt__(self, value, /)
 |      Return self>value.
 |  
 |  __iadd__(self, value, /)
 |      Implement self+=value.
 |  
 |  __imul__(self, value, /)
 |      Implement self*=value.
 |  
 |  __init__(self, /, *args, **kwargs)
 |      Initialize self.  See help(type(self))

In [101]:
names = []
names.append('Nate')
names

['Nate']

In [None]:
# Node(10, Node(80, Node(100, None)))