## Problem 59: XOR decryption
Each character on a computer is assigned a unique code and the preferred standard is ASCII (American Standard Code for Information Interchange). For example, uppercase A = 65, asterisk (*) = 42, and lowercase k = 107.

A modern encryption method is to take a text file, convert the bytes to ASCII, then XOR each byte with a given value, taken from a secret key. The advantage with the XOR function is that using the same encryption key on the cipher text, restores the plain text; for example, 65 XOR 42 = 107, then 107 XOR 42 = 65.

For unbreakable encryption, the key is the same length as the plain text message, and the key is made up of random bytes. The user would keep the encrypted message and the encryption key in different locations, and without both "halves", it is impossible to decrypt the message.

Unfortunately, this method is impractical for most users, so the modified method is to use a password as a key. If the password is shorter than the message, which is likely, the key is repeated cyclically throughout the message. The balance for this method is using a sufficiently long password key for security, but short enough to be memorable.

Your task has been made easy, as the encryption key consists of three lower case characters. Using p059_cipher.txt (right click and 'Save Link/Target As...'), a file containing the encrypted ASCII codes, and the knowledge that the plain text must contain common English words, decrypt the message and find the sum of the ASCII values in the original text.



In [1]:
function filetolist(filename)
    s = open(filename) do file
        read(file, String)
    end
    ss = split(s, ",")
    return [parse(Int, x) for x in ss]
end

function intlisttobytes(sl)
    return [reverse(digits(x, base=2, pad=8)) for x in sl]
end

function pwdtokey(pwd, length)
    pwdi = [Int(x) for x in pwd]
    nextra = mod(length, 3)
    nreps = div(length, 3)
    pwdb = intlisttobytes(pwdi)
    key = repeat(pwdb, nreps)
    for i = 1:nextra
        push!(key, pwdb)
    end
    return key
end

function bytestoascii(bytes)
    return parse(Int, join([string(x) for x in bytes]), base=2)
end

function asciilisttotext(alist)
    charlist = [Char(x) for x in alist]
    text = join(charlist)
    textsplit = split(text, " ")
end

function decodedwords(encodedbytes, pwd)
    key = pwdtokey(pwd, length(encodedbytes))
    decodedbytes = [x .⊻ y for (x,y) in zip(encodedbytes, key)]
    decodedascii = [bytestoascii(x) for x in decodedbytes];
    decodedtext = asciilisttotext(decodedascii)
    return decodedtext
end

decodedwords (generic function with 1 method)

In [2]:
encodedbytes = intlisttobytes(filetolist("p059_cipher.txt"))
commonwords = ["the", "to", "be", "of", "and"]
pwdsave = "000"
for s1 = 97:122
    println("$s1")
    for s2 = 97:122
        for s3 = 97:122
            pwd = join([Char(s1), Char(s2), Char(s3)])
            text = decodedwords(encodedbytes, pwd)
            havewords = [w in text for w in commonwords]
            if sum(havewords) > 2
                pwdsave = pwd
                break
            end
        end
    end
end

97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122


In [16]:
text = decodedwords(encodedbytes, "exp");
textall = join(text, " ");
textascii = [Int(c) for c in textall];
textsum = sum(textascii);

In [17]:
println("Sum of ASCII characters: $textsum")

Sum of ASCII characters: 129448
