# Time/Space Complexity - Intro to Data Structures (User Defined)

### Topics to discuss today:

<ul>
    <li>Time and Space Complexity - What is it/How do we measure it</li>
    <li>Asymptotic Analysis</li>
    <li><strong>Data Structures</strong></li>
    <li>Some of the popular sorting algorithms</li>
</ul>

### Data Structures to discuss:
- Arrays
- Stacks
- Queues
- Linked Lists
    - Singly Linked Lists
    - Traversing A Linked List
    - Finding a node in a linked list
    - Adding to a linked list


## Time and Space Complexity

#### What is it?

Time and space complexity is the measure of how much time a given action(function) will take to solve a problem. In the same fashion, we determine how much a given data structure will need in terms of memory allocation. A problem can have multiple solutions and finding the optimal solution for the problem needs to be analyzed in time and space.

#### How do we measure Time and Space Complexity?

In order to measure time and space complexity we use Asymptotic analysis. The reason for this is because we need a way to measure different algorithms (functions) based on the size of their inputs in a mathmatical way. For example, we could have a function that is computed as f(n) and another that is g(n^2). All things around the function staying constant, the only thing that changes is the size of the input. Below is the chart that shows the different Asymptotic analysis formats. 

<table style="text-align:center;" class="table table-bordered">
<tbody><tr>
<td>constant</td>
<td>−</td>
<td>Ο(1)</td>
</tr>
<tr>
<td>logarithmic</td>
<td>−</td>
<td>Ο(log n)</td>
</tr>
<tr>
<td>linear</td>
<td>−</td>
<td>Ο(n)</td>
</tr>
<tr>
<td>Linear Logarithmic</td>
<td>−</td>
<td>Ο(n log n)</td>
</tr>
<tr>
<td>quadratic</td>
<td>−</td>
<td>Ο(n<sup>2</sup>)</td>
</tr>
<tr>
<td>cubic</td>
<td>−</td>
<td>Ο(n<sup>3</sup>)</td>
</tr>
<tr>
<td>polynomial</td>
<td>−</td>
<td>n<sup>Ο(1)</sup></td>
</tr>
<tr>
<td>exponential</td>
<td>−</td>
<td>2<sup>Ο(n)</sup></td>
</tr>
</tbody></table>

In [1]:
# 1 Google complexity of built-in functions

# .index() - O(n)
# sum() - O(n)
# list.remove() - O(n)

# 2 Look for nesting
# NESTING
# When an operation is occuring inside of an iteration, this will exponentially
# increase time complexity
# 1 for loop in another for loop = O(n*n) - O(n^2)
# linear operation inside for loop (.remove() from list)
nums = list(range(3))
for i in range(len(nums)):
    for j in range(len(nums)):
        print(i,j)
        
#O(logn) O(nlogn) are rarer and also harder to calculate
# same with factorial, polynomial, and exponential

0 0
0 1
0 2
1 0
1 1
1 2
2 0
2 1
2 2


In [None]:
# lists(modified arrays)
alist[1] # O(1) accessing
alist.index('value') # O(n) searching
alist.append('value') # O(1) adding to the end
# appending to the start or at some index is more compliacted and O(n)
alist.remove('value') # O(n)

# Dictionaries (Modified Hashmaps)
adict['key'] #O(1) accessing
adict['key'] = 'value' # O(1) inserting a value/updating a value
del adict['key'] # O(1) remove a k/'v'
if key in adict # testing key membership
if key in adict.keys() # BAD PRACTICE O(n), use line12 instead
# O(n) for searching for a value (.items()) (.values())
# Unordered, no indexing

# Sets (Modified Hashtable)
aset.add('val') # O(1) adding
aset.remove('val') # O(1) removal
# accessing a value in a set/searching for a value O(n)
if val in aset # O(1) membership test
# sets do not accept duplicates

## Arrays

In python we benefit from the dynamic array which means the block of memory will expand as needed for the given input to the array. In traditional arrays (depending on the type of operating system) we will usually store our inputs in 4 or 8 consecutive blocks of memory. Below is a diagram of how that looks under the hood:

<img src="http://www.mathcs.emory.edu/~cheung/Courses/170/Syllabus/09/FIGS/array02x.gif" style="height:250px; width:350px;">

## Which in python looks like this:

In [None]:
# In other languages, append method would be linear time complexity
# we benefit from dynamic array (our list) because we can expand the array as needed in python
# this makes adding to the end of our array O(1)
arr.append(1) # would be constant time

# compare this to a language that has static arrays
# we would need to set aside the memory of that array before adding
# anything to that array
let arr = new Array(5); # this says -  I'm making an array of len 5 (JS)
arr[3] = 'hi'
# if we wanted to add a 6th element to our array, we need to portion
# off another section of memory in order to do so increasing time complexity

# When we're talking about other languages Java and Javascript are not related
# Java:Javascript == Ham:Hamster

### Let's take a look at some of the time and space analysis of arrays

In [None]:
# lists(modified arrays)
alist[1] # O(1) accessing
alist.index('value') # O(n) searching
alist.append('value') # O(1) adding to the end
# appending to the start or at some index is more compliacted and O(n)
alist.remove('value') # O(n)

## Stacks and Queues

** Stacks ** as the name suggests is a data structure that allows for data to follow the Last In First Out priciple(LIFO). Think of a stack of pancakes for example. To get the first pancake you would  start with the top and go down.

##### Searching through a stack will be Linear Time O(n) - Constant Space O(1)
##### Selecting the last item will be done in Constant Time O(1) - Constant Space O(1)
##### Adding to the stack should take Constant Time O(1) - Constant Space O(1)

** Queues ** are similar but in this case follow the First In First Out principle(FIFO). Think of this as a line in a black friday sale. The first person camped out for the big screen tv is the first to get it.

##### Searching through a queue will be Linear Time O(n) - Constant Space O(1)
##### Selecting the first item will be done in Constant Time O(1) - Constant Space O(1)
##### Adding to the queue should take Constant Time O(1) - Constant Space O(1)

In [None]:
# Python lists function as stacks
alist = ['Dimitris', 'Ashley', 'Matt']
alist.append('Rashel') # O(1) to add to the end of the list
# Built-in pop function
print(alist.pop(2)) # O(1) removal from the end of a list is constant time
# Default behavior of the pop function is at the end because it is a constant time function at the end but linear if at the front

In [3]:
# Know about the existence of queues
# And know about whatever built-in queue functionality exists in the language you're working with
# In pythong, I use the collections deque when I need to implement a queue
from collections import deque
q = deque()
q.append('Dimitris')
q.appendleft('Rashel')
q.append('Ashley')
print(q)
print(q.popleft())
print(q)

deque(['Rashel', 'Dimitris', 'Ashley'])
Rashel
deque(['Dimitris', 'Ashley'])


## Linked List (Data Structure)

A linked list is created by using the node class. We create a Node object and create another class to use this node object. We pass the appropriate values thorugh the node object to point the to the next data elements.

There are some advantages and disadvantages with this data structure. **Advantages** Linked Lists can save memory because they can be flexibile with memory management which saves memory. **Disadvantages** Finding or adding to the list requires traversing the entire list.

In [8]:
# Implementation of a linked list

# 2 components to our solution - Node and LinkedList

class Node():
    def __init__(self, value):
        self.value = value
        self.next = None
    
class LinkedList():
    def __init__(self):
        self.head = None
    
    # Can be used for creating the head - but also to add something to the start of our linked list
    def pushOn(self, new_value):
        new_node = Node(new_value)
        new_node.next = self.head
        self.head = new_node
    
    # insert at a specific spot
    def insertAfter(self, prev_node, new_value):
        # check if its a valid prev_node
        if prev_node is None:
            print('Empty previous node')
            return
        # if the previous node is valid and not empty
        new_node = Node(new_value)
        
        # update the new node's next to be the prev_node's next
        new_node.next = prev_node.next
        
        # update the prev node's next to be the new node
        prev_node.next = new_node
        
    # add to the end
    def append(self, new_value):
        # create a new node
        new_node = Node(new_value)
        
        # check if linkedlist is currently empty -> if it is, this new node will be the head of our list
        if self.head is None:
            self.head = new_node
            return
        # if linkedlist is not empty -> then we have to find the tail of the linkedlist
        # means we have to traverse the list
        last = self.head
        
        # while last.next is not none -> continue our loop until we find a null value (aka we've found our tail node)
        while last.next: 
            last = last.next 
            
        # change current last node value to point to our New Node
        last.next = new_node
        
            
    #look at our whole linked list
    def traverse(self):
        # similar to lines 46 through 52
        temp = self.head
        while temp:
            print(temp.value)
            temp = temp.next
            
weekdays_linked = LinkedList()

weekdays_linked.pushOn('Monday')
weekdays_linked.append('Tuesday')
weekdays_linked.append('Thursday')
weekdays_linked.insertAfter(weekdays_linked.head.next, 'Wednesday')
weekdays_linked.pushOn('Sunday')
weekdays_linked.traverse()


Sunday
Monday
Tuesday
Wednesday
Thursday


In [None]:
import re
a_text = 'In computing, a hash table hash map is a data structure which implements an associative array abstract data type, a structure that can map keys to values. A hash table uses a hash function to compute an index into an array of buckets or slots from which the desired value can be found'
def countme(stringy):
    wordpat = re.compile("([\w]+)")
    wordy = sorted(wordpat.findall(stringy.lower()))      	#O(nlog(n)) for sorted
    wordict = {}
    for word in wordy:                         			#O(n)
        if word not in wordict.keys():         			#O(1) == because searching in a dict is constant
            wordict[word] = 1
        else:
            wordict[word] += 1
    return wordict
print(countme(a_text))

# Regular Expressions' time complexity is variable