# String Compression

## Problem

Given a string in the form 'AAAABBBBCCCCCDDEEEE' compress it to become 'A4B4C5D2E4'. For this problem, you can falsely "compress" strings of single or double letters. For instance, it is okay for 'AAB' to return 'A2B1' even though this technically takes more space. 

The function should also be case sensitive, so that a string 'AAAaaa' returns 'A3a3'.

## Solution
Since Python strings are immutable, we'll need to work off of a list of characters, and at the end convert that list back into a string with a **join** statement.

The solution below should yield us with a Time and Space complexity of O(n). Let's take a look with careful attention to the explanatory comments:

In [1]:
def compress(s):
    """
    This solution compresses without checking. Known as the RunLength Compression algorithm.
    """
    
    # Begin Run as empty string
    r = ""
    l = len(s)
    
    # Check for length 0
    if l == 0:
        return ""
    
    # Check for length 1
    if l == 1:
        return s + "1"
    
    #Intialize Values
    last = s[0]
    cnt = 1
    i = 1
    
    while i < l:
        
        # Check to see if it is the same letter
        if s[i] == s[i - 1]: 
            # Add a count if same as previous
            cnt += 1
        else:
            # Otherwise store the previous data
            r = r + s[i - 1] + str(cnt)
            cnt = 1
            
        # Add to index count to terminate while loop
        i += 1
    
    # Put everything back into run
    r = r + s[i - 1] + str(cnt)
    
    return r

In [2]:
compress('AAAAABBBBCCCC')

'A5B4C4'

In [8]:
def compress2(s):
    # Initialize an empty string to store the compressed string
    compressed = ""

    # Initialize a counter to count the number of consecutive occurrences of each character
    count = 0

    # Iterate through the characters in the string
    for i in range(len(s)):
        # Increment the counter
        count += 1

        # If the current character is different from the next character,
        # or if it is the last character in the string
        if i + 1 >= len(s) or s[i] != s[i + 1]:
            # Append the current character and its count to the compressed string
            compressed += s[i] + str(count)

            # Reset the counter
            count = 0

    # Return the compressed string
    return compressed


In [17]:
def compress3(s):
    if len(s) < 1:
        return ""

    compressed_s = ""
    curr_char = s[0]
    curr_count = 0
    for i, char in enumerate(s):
        if char == curr_char:
            curr_count += 1
        else:
            # Store
            compressed_s += curr_char + str(curr_count)
            # Restart
            curr_char = char
            curr_count = 1
        
        if i == len(s) - 1:
            # Store
            compressed_s += curr_char + str(curr_count)            
    return compressed_s

In [18]:
compress3('AABBCC')

'A2B2C2'

# Test Your Solution

In [19]:
"""
RUN THIS CELL TO TEST YOUR SOLUTION
"""
class TestCompress():

    def test(self, sol):
        assert (sol('') == '')
        assert (sol('AABBCC') == 'A2B2C2')
        assert (sol('AAABCCDDDDD') == 'A3B1C2D5')
        print('ALL TEST CASES PASSED')

# Run Tests
t = TestCompress()
t.test(compress)
t.test(compress2)
t.test(compress3)

ALL TEST CASES PASSED
ALL TEST CASES PASSED
ALL TEST CASES PASSED


## Good Job!