# Run Length Encoding

Implement run-length encoding and decoding.

Run-length encoding (RLE) is a simple form of data compression, where runs
(consecutive data elements) are replaced by just one data value and count.

For example we can represent the original 53 characters with only 13.

```text
"WWWWWWWWWWWWBWWWWWWWWWWWWBBBWWWWWWWWWWWWWWWWWWWWWWWWB"  ->  "12WB12W3B24WB"
```

RLE allows the original data to be perfectly reconstructed from
the compressed data, which makes it a lossless data compression.

```text
"AABCCCDEEEE"  ->  "2AB3CD4E"  ->  "AABCCCDEEEE"
```

For simplicity, you can assume that the unencoded string will only contain
the letters A through Z (either lower or upper case) and whitespace. This way
data to be encoded will never contain any numbers and numbers inside data to
be decoded always represent the count for the following character.

## Source

Wikipedia [https://en.wikipedia.org/wiki/Run-length_encoding](https://en.wikipedia.org/wiki/Run-length_encoding)

## Version compatibility
This exercise has been tested on Julia versions >=1.0.

## Submitting Incomplete Solutions
It's possible to submit an incomplete solution so you can see how others have completed the exercise.

## Your solution

In [None]:
# first try
"""
    encode(s)

Performs Run-length encoding on string s.

"""
# function encode(s)
#     code = ""
#     letters = unique(s)
#     foreach(l -> code = join([code, count(c -> c == l, s), l]), letters)
#     return code
# end
function encode(s)
    length(s) == 0 && return ""
    code = ""
    letter_count = 0
    letter_change = s[begin]
    letters = unique(s)
    for l ∈ s
        if letter_change == l
            letter_count += 1
        else
            code = join([code, letter_count == 1 ? "" : letter_count, letter_change])
            letter_count = 1
            letter_change = l
        end
    end
    code = join([code, letter_count == 1 ? "" : letter_count, letter_change])
    return code
end


"""with
    decode(s)

Performs Run-length decoding on string s.

"""
function decode(s)
    letters = [l for l ∈ s if isletter(l)]
    number = "0"
    for c ∈ s
        numbers = split(join([isnumeric(n) ? n : ";" for n ∈ s]), ';')
    return join([l^parse(Int, n) for (l,n) ∈ zip(letters, numbers)])
end

In [146]:
# submit
"""
    encode(s)

Performs Run-length encoding on string s.

"""
function encode(s)
    # replace every repeating char by length and char
    replace(s, r"(.)\1+" => x -> string(length(x) == 1 ? "" : length(x)) * x[begin])
end

"""
    decode(s)

Performs Run-length decoding on string s.

"""
function decode(s)
    # replace every number and char with char repeated number times
    replace(s, r"\d+." => x -> x[end] ^ parse(Int, x[begin:end-1]))
end

decode

In [145]:
s = "AABWWFFFXXCCCCCY   YYWW"
replace(s, r"(.)\1+" => x -> string(length(x) == 1 ? "" : length(x)) * x[begin])
# string(3)

"2AB2W3F2X5CY3 2Y2W"

In [137]:
s = "AABWWFFFXXCCCCCY   YYWW"
encode(s)

decode(encode(s))

# s = "3W5B"
# letters = [l for l ∈ s if isletter(l)]
# decode(encode(s))
# num = split(join([isnumeric(n) ? n : ";" for n ∈ s]), ';')
# join([l^parse(Int, n) for (l,n) ∈ zip(['B', 'W'], ['4', '2'])])

"AABWWFFFXXCCCCCY   YYWW"

## Test suite

In [147]:
using Test

# include("run-length-encoding.jl")


# Tests adapted from `problem-specifications//canonical-data.json` @ v1.0.0
# Encode and decode the strings under the given specifications.

@testset "encode strings" begin
    @test encode("") == ""
    @test encode("XYZ") == "XYZ"
    @test encode("AABBBCCCC") == "2A3B4C"
    @test encode("WWWWWWWWWWWWBWWWWWWWWWWWWBBBWWWWWWWWWWWWWWWWWWWWWWWWB") == "12WB12W3B24WB"
    @test encode("aabbbcccc") == "2a3b4c"
    @test encode("  hsqq qww  ") == "2 hs2q q2w2 "
end

@testset "decode strings" begin
    @test decode("") == ""
    @test decode("XYZ") == "XYZ"
    @test decode("2A3B4C") == "AABBBCCCC"
    @test decode("12WB12W3B24WB") == "WWWWWWWWWWWWBWWWWWWWWWWWWBBBWWWWWWWWWWWWWWWWWWWWWWWWB"
    @test decode("2a3b4c") == "aabbbcccc"
    @test decode("2 hs2q q2w2 ") == "  hsqq qww  "
end

@testset "encode then decode" begin
    @test decode(encode("zzz ZZ  zZ")) == "zzz ZZ  zZ"
end

[37m[1mTest Summary:  | [22m[39m[32m[1mPass  [22m[39m[36m[1mTotal[22m[39m
encode strings | [32m   6  [39m[36m    6[39m
[37m[1mTest Summary:  | [22m[39m[32m[1mPass  [22m[39m[36m[1mTotal[22m[39m
decode strings | [32m   6  [39m[36m    6[39m
[37m[1mTest Summary:      | [22m[39m[32m[1mPass  [22m[39m[36m[1mTotal[22m[39m
encode then decode | [32m   1  [39m[36m    1[39m


Test.DefaultTestSet("encode then decode", Any[], 1, false)

## Prepare submission
To submit your exercise, you need to save your solution in a file called `run-length-encoding.jl` before using the CLI.
You can either create it manually or use the following functions, which will automatically write every notebook cell that starts with `# submit` to the file `run-length-encoding.jl`.


In [149]:
using Pkg; Pkg.add("Exercism")
using Exercism
Exercism.create_submission("run-length-encoding")

[32m[1m  Resolving[22m[39m package versions...
[32m[1mNo Changes[22m[39m to `~/.julia/environments/v1.5/Project.toml`
[32m[1mNo Changes[22m[39m to `~/.julia/environments/v1.5/Manifest.toml`


453