# Self-Check

## 1.4 – Review of Basic Python

Taken from __Problem Solving with Algorithms and Data Structures__

### Task

Here is a self check that really covers everything so far. You may have heard of the infinite monkey theorem? The theorem states that a monkey hitting keys at random on a typewriter keyboard for an infinite amount of time will almost surely type a given text, such as the com- plete works of William Shakespeare. Well, suppose we replace a monkey with a Python func- tion. How long do you think it would take for a Python function to generate just one sentence of Shakespeare? The sentence we’ll shoot for is: 
> methinks it is like a weasel

You are not going to want to run this one in the browser, so fire up your favorite Python IDE. The way we will simulate this is to write a function that generates a string that is 27 characters long by choosing random letters from the 26 letters in the alphabet plus the space. We will write another function that will score each generated string by comparing the randomly generated string to the goal.

A third function will repeatedly call generate and score, then if 100% of the letters are correct we are done. If the letters are not correct then we will generate a whole new string. To make it easier to follow your program’s progress this third function should print out the best string generated so far and its score every 1000 tries.

In [1]:
import random

def infiniteMonkey():
    iters = 0
    score = 0
    bestStr = None
    bestScore = 0
    while score != 28:
        string = genStr()
        score = calcScore(string)
        if score > bestScore:
            bestStr = string
            bestScore = score
        if iters % 100000 == 0:
            print("Best string: {:s} with score {:d} ({}%).".format(bestStr, bestScore, bestScore*100/28.0))
        if score == 28:
            print("Finished in {} iterations.".format(iters))
            break
        iters += 1

def genStr():
    alphabet = 'abcdefghijklmnopqrstuvwxyz '
    string = ''
    for _ in range(28):
        string = string + alphabet[random.randrange(0,len(alphabet))]
    return string

def mutateStr(string, indices):
    alphabet = 'abcdefghijklmnopqrstuvwxyz '
    newString = string
    for idx in indices:
        try:
            newString = newString[:idx] + alphabet[random.randrange(0, len(alphabet))] + newString[idx+1:]
        except:
            pass
    return newString

def calcScore(string):
    targetStr = 'methinks it is like a weasel'
    score = 0
    indices = []
    for i in range(28):
        if string[i] == targetStr[i]:
            score += 1
        else:
            indices.append(i)
    return score, indices

In [2]:
#infiniteMonkey()
# Functional but takes wayy too long

### Self-check Challenge

See if you can improve upon the program in the self check by keeping letters that are correct and only modifying one character in the best string so far. This is a type of algorithm in the class of “hill climbing” algorithms, that is we only keep the result if it is better than the previous one.

In [72]:
def infiniteMonkeyImproved():
    iters = 0
    score = 0
    bestStr = 'NONE'
    bestScore = 0
    
    string = genStr()
    indices = []
    print("Initial string: {} with score {}.".format(string, calcScore(string)[0]))
    while score != 28:
        string = mutateStr(string, indices)
        score, indices = calcScore(string)
        if score > bestScore:
            bestStr = string
            bestScore = score
        if score == 28:
            print("Best string: {} | Score: {} | Iteration: {}.".format(bestStr, bestScore, iters))
            print("====FINISHED====")
            break
        if iters % 2 == 0 and iters != 0:
            print("Best string: {} | Score: {} | Iteration: {}.".format(bestStr, bestScore, iters))
        iters += 1

In [73]:
infiniteMonkeyImproved()

Initial string: bfkftwqcsfjdkfbgyqeuufr mkea with score 2.
Best string: vhcow xshyrugbxnnjeochwelheb | Score: 5 | Iteration: 2.
Best string: iogsphxsxhxpar liuemgpwezueg | Score: 8 | Iteration: 4.
Best string: iogsphxsxhxpar liuemgpwezueg | Score: 8 | Iteration: 6.
Best string: iogsphxsxhxpar liuemgpwezueg | Score: 8 | Iteration: 8.
Best string: ya vvdis oudba lideqxqwebqeh | Score: 9 | Iteration: 10.
Best string: zhhjcnhs qm tk lije kywewneh | Score: 12 | Iteration: 12.
Best string: rglhpnss as pv liae ecweanee | Score: 14 | Iteration: 14.
Best string: rglhpnss as pv liae ecweanee | Score: 14 | Iteration: 16.
Best string: xnthsnps  e qg life hzweawet | Score: 15 | Iteration: 18.
Best string: jathsnws ss cu liee  eweasev | Score: 16 | Iteration: 20.
Best string: uothnnrs ih is life voweaseb | Score: 19 | Iteration: 22.
Best string: uothnnrs ih is life voweaseb | Score: 19 | Iteration: 24.
Best string: mythrnms it is liae ekweaser | Score: 21 | Iteration: 26.
Best string: mythrnms it is