# Lesson Goals:

- Understand exception handling syntax and techniques
- Understand functions and the difference between recursive and iterative use
- Understand complex dictionary use
- Understand techniques for formatting output for user presentation

# Exception handling
Exception handling is important in real world use of any programming language. Not all actions will succeed 100% of the time. That could be perfectly OK depending on your intended use. Let's explore this technique with an example.

In [1]:
import os     # used for various file/directory operations
import pprint # used for cleaner dictionary printing

try:
    from termcolor import colored # termcolor module used for fancy printing to user
except ModuleNotFoundError:
    !pip install termcolor
    from termcolor import colored

Collecting termcolor
  Using cached https://files.pythonhosted.org/packages/8a/48/a76be51647d0eb9f10e2a4511bf3ffb8cc1e6b14e9e4fab46173aa79f981/termcolor-1.1.0.tar.gz
Building wheels for collected packages: termcolor
  Running setup.py bdist_wheel for termcolor: started
  Running setup.py bdist_wheel for termcolor: finished with status 'done'
  Stored in directory: C:\Users\estyw\AppData\Local\pip\Cache\wheels\7c\06\54\bc84598ba1daf8f970247f550b175aaaee85f68b4b0c5ab2c6
Successfully built termcolor
Installing collected packages: termcolor
Successfully installed termcolor-1.1.0


### Warning
The above is an anti-pattern not for real world use in your code. It's bad practice to assume on behalf of the users of your code that they want additional packages installed and this could be security vulnerability if ever actively targeted. Additionally, you won't always know if you have permissions to install packages depending on if code is run using the system Python or in a virtual environment.

Additional reading: https://docs.python.org/3/tutorial/errors.html

In [2]:
'''
What termcolor gives us is easy use of ANSI escape sequences for color coding our output.
'''
print(colored('hello', 'red'), colored('world', 'green'))

[31mhello[0m [32mworld[0m


# Functions

##### Recursive Functions
It's been said that to understand recursion you must understand recursion. In programming terms, a recursion function is one where the function will call itself inside of it's function definition. Its value in practice depends on the programming language and use case. It can result in a very compact piece of code or it can result in infinite loop that never terminates if not used carefully.

The example below uses a recursive function so that I can pass a reference a directory path that may be several folders deep in the dictionary mapping of file and folder names.

#### Iterative Functions
Iteration is a repeated action through a set of data until completed or a condition is reached. Python is very good at iteration and you are familiar with iterative actions such as `for` and `while` loops on data.

In [3]:
'''
Populate a dictionary given a path and an empty dictionary.

This uses the os.scandir which provides additional metadata about
items in the path than os.listdir. The result is an DirEntry object
with items we can enumerate and act upon.

https://docs.python.org/3/library/os.html#os.scandir
https://docs.python.org/3/library/os.html#os.DirEntry
'''

def recursive_directory_tree(path, structure):

    try:
        with os.scandir(path) as it:
            for entry in it:

                # Add files that are not hidden files (starting with . on Linux/Mac)
                if not entry.name.startswith('.') and entry.is_file():
                    print(colored("[+] Found file, updating value None for key:", 'green'),
                          colored(entry.name, 'blue'))
                    structure.update({entry.name: None})

                # Add directories that are not hidden and not shortcuts/symlinks
                elif not entry.name.startswith('.') and entry.is_dir() and not entry.is_symlink():
                    print(colored("[+] Found directory, updating value {} for key:", 'green'),
                          colored(entry.name, 'blue'))
                    structure.update({entry.name: {}})

                    print(colored("[*] Performing recursive function call on:", 'green'),
                          colored(entry.path, 'blue'))
                    recursive_directory_tree(entry.path, structure[entry.name])

    # Print colored warning when there is no permissions to directory
    except PermissionError:
        print(colored("[!] Skipping due to PermissionError for path:", 'red'),
            colored(path, 'red', attrs=['bold']))
    
    # Skip "osError: [Errno 34] Result too large" issue, why? I don't know...
    except OSError as e:
        if e == 34:
            pass

# '''
# With the above defined, we can define a path to enumerate and provide
# an empty dictionary.

# You may enumerate /Users (on Mac) or your "C:\Users" (on Windows) however
# be prepared for it to take a bit longer to complete.
# '''
path = '.'
structure = {}
recursive_directory_tree(path, structure)

[32m[+] Found file, updating value None for key:[0m [34mapi_output.csv[0m
[32m[+] Found file, updating value None for key:[0m [34mbasic_database_tech.ipynb[0m
[32m[+] Found file, updating value None for key:[0m [34mbasic_web_beautifulsoup4.ipynb[0m
[32m[+] Found file, updating value None for key:[0m [34mbasic_web_plus_api_call_interactions.ipynb[0m
[32m[+] Found file, updating value None for key:[0m [34mcamels.jpg[0m
[32m[+] Found file, updating value None for key:[0m [34mClasses.ipynb[0m
[32m[+] Found file, updating value None for key:[0m [34mdata_serialization.ipynb[0m
[32m[+] Found file, updating value None for key:[0m [34mdedup_files.ipynb[0m
[32m[+] Found directory, updating value {} for key:[0m [34mduplicated[0m
[32m[*] Performing recursive function call on:[0m [34m.\duplicated[0m
[32m[+] Found file, updating value None for key:[0m [34mbptgxdeeipxu.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mfjiwtpgtgcdb.txt[0m
[

[32m[*] Performing recursive function call on:[0m [34m.\matplotlibstuff\basemap-1.1.0\geos-3.3.3\include\geos\index\sweepline[0m
[32m[+] Found file, updating value None for key:[0m [34mMakefile.am[0m
[32m[+] Found file, updating value None for key:[0m [34mMakefile.in[0m
[32m[+] Found file, updating value None for key:[0m [34mSweepLineEvent.h[0m
[32m[+] Found file, updating value None for key:[0m [34mSweepLineIndex.h[0m
[32m[+] Found file, updating value None for key:[0m [34mSweepLineInterval.h[0m
[32m[+] Found file, updating value None for key:[0m [34mSweepLineOverlapAction.h[0m
[32m[+] Found file, updating value None for key:[0m [34mindexBintree.h[0m
[32m[+] Found file, updating value None for key:[0m [34mindexChain.h[0m
[32m[+] Found file, updating value None for key:[0m [34mindexQuadtree.h[0m
[32m[+] Found file, updating value None for key:[0m [34mindexStrtree.h[0m
[32m[+] Found file, updating value None for key:[0m [34mindexSweepline.h

[32m[+] Found file, updating value None for key:[0m [34mMakefile.am[0m
[32m[+] Found file, updating value None for key:[0m [34mMakefile.in[0m
[32m[+] Found directory, updating value {} for key:[0m [34mpython[0m
[32m[*] Performing recursive function call on:[0m [34m.\matplotlibstuff\basemap-1.1.0\geos-3.3.3\swig\python[0m
[32m[+] Found file, updating value None for key:[0m [34mgeos.pth[0m
[32m[+] Found file, updating value None for key:[0m [34mgeos.py[0m
[32m[+] Found file, updating value None for key:[0m [34mgeos_wrap.cxx[0m
[32m[+] Found file, updating value None for key:[0m [34mMakefile.am[0m
[32m[+] Found file, updating value None for key:[0m [34mMakefile.in[0m
[32m[+] Found file, updating value None for key:[0m [34mpython.i[0m
[32m[+] Found directory, updating value {} for key:[0m [34mtests[0m
[32m[*] Performing recursive function call on:[0m [34m.\matplotlibstuff\basemap-1.1.0\geos-3.3.3\swig\python\tests[0m
[32m[+] Found file, upda

[32m[+] Found file, updating value None for key:[0m [34mpj_msfn.c[0m
[32m[+] Found file, updating value None for key:[0m [34mpj_mutex.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_natearth.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_natearth2.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_nell.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_nell_h.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_nocol.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_nsper.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_nzmg.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_ob_tran.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_ocea.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_oea.c[0m
[32m[+] Found file, updating value None for key:[0m [34mPJ_omerc.c[0m
[32m[+] Found file, updating valu

[32m[+] Found file, updating value None for key:[0m [34mnzlfbmbbjbgn.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mohrgpnbfpmwb.txt[0m
[32m[+] Found file, updating value None for key:[0m [34monedtkhfbrxi.txt[0m
[32m[+] Found file, updating value None for key:[0m [34moqqlhlrckcqg.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mosxbesnjeecg.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mouhoyiysioiu.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mowytaxovvnqj.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mozclhofadaoc.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mpgwtzgvwuuok.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mpoojoxkjizfy.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mpqqwmeldolmp.txt[0m
[32m[+] Found file, updating value None for key:[0m [34mptpwwqxzcilz.txt[0m
[32m[+] Found file, updating value None

#### Exercise 1

Earlier we imported the 'pprint' module. Let's use that now and investigate the output.

The indentation helps the human eye quite a bit here. Note that this is very complicated dictionary. There would be nothing stopping you from enumerating a key directly as well using bracketed dictionary notation.

In [8]:
pprint.pprint(structure)

{'Learning_Python3.ipynb': None,
 'Pandas_Tutorial': {'Files': {'BL-Flickr-Images-Book.csv': None,
                               'olympics.csv': None,
                               'university_towns.txt': None},
                     'Pandas_Tutorial.ipynb': None},
 'PythonClasses.ipynb': None,
 'basic_database_tech.ipynb': None,
 'basic_web_beautifulsoup4.ipynb': None,
 'basic_web_plus_api_call_interactions.ipynb': None,
 'dedup_files.ipynb': None,
 'duplicated': {'bptgxdeeipxu.txt': None,
                'fjiwtpgtgcdb.txt': None,
                'ibdlnqtlvaim.txt': None,
                'icspsjlksrbn.txt': None,
                'ioipomykrfom.txt': None,
                'jjiihzujdnki.txt': None,
                'jnvkounpkeit.txt': None,
                'lixeoutjwbtq.txt': None,
                'ljmndgqywrsi.txt': None,
                'lksequmqneep.txt': None,
                'nexygkahtozc.txt': None,
                'nhjxkxizisju.txt': None,
                'nlptvthvukcc.txt': None,

In [None]:
# NOTE: This may vary depending on Python version installed and Windows/Linux
# Ensure you can print a valid path below based off looking at the earlier dictionary structure contents
pprint.pprint(structure['venv']['lib']['python3.7'])

##### Exercise 2

Convert the following pseudo code to a valid recursive function.

Note we haven't explored the concept of arbitrary keyword arguements passed to a function. Here we have an optional 'depth' keyword argument that is used to appropriately pad the output for the human at the terminal. You'll need to do some research on the syntax of kwargs to see how to pass this appropriately.

In [9]:
'''
Print a tree directory from the structure dictionary

This is loosely based on the Linux 'tree' command.

https://www.cyberciti.biz/faq/linux-show-directory-structure-command-line/
'''

def recursive_directory_print(structure, **kwargs):
    for item in structure:

        depth       = kwargs.get('depth', 0)
        file_prefix = '|' + '-' * depth
        dir_prefix  = '+' + '-' * depth
        
        # NOT IMPLEMENTED YET - make me print yo
        
        if type(structure[item]) == dict:
            print(dir_prefix, item)
            depth = depth + 1
            recursive_directory_print(structure[item], depth=depth)
                
        
        elif structure[item] == None:
            print(file_prefix, item)

        # if type of structure[item] is dictionary
            # print directory prefix, item
            # add 1 to depth of printint
            # recursive function call passing structure[item] and depth kwarg
            
        # if structure[item] equals None
            # print file prefix, item

recursive_directory_print(structure)

| basic_database_tech.ipynb
| basic_web_beautifulsoup4.ipynb
| basic_web_plus_api_call_interactions.ipynb
| dedup_files.ipynb
+ duplicated
|- bptgxdeeipxu.txt
|- fjiwtpgtgcdb.txt
|- ibdlnqtlvaim.txt
|- icspsjlksrbn.txt
|- ioipomykrfom.txt
|- jjiihzujdnki.txt
|- jnvkounpkeit.txt
|- lixeoutjwbtq.txt
|- ljmndgqywrsi.txt
|- lksequmqneep.txt
|- nexygkahtozc.txt
|- nhjxkxizisju.txt
|- nlptvthvukcc.txt
|- ocdwilpcpyme.txt
|- odzmgnfcelwt.txt
|- ogtwqijwefst.txt
|- ovcsvnhebmxw.txt
|- pdqzkppwvtbg.txt
|- pedjujehkslo.txt
|- qcdjkdlivhdl.txt
|- qqrtcsambixf.txt
|- rgnxruhulrqj.txt
|- rsxuxfpjcpjg.txt
|- rvavktnfzdhr.txt
|- sfhxhyvdufzy.txt
|- snqnqfljwwyi.txt
|- tfceklaffttv.txt
|- tnyyedaupagf.txt
|- wexmbvsmavlo.txt
|- xksdojojgywa.txt
|- xpqwrdqcodwb.txt
|- xqagxkswzyuu.txt
|- xwbozdzaqmbm.txt
|- ydutrxodquup.txt
|- yvmexqkrvceb.txt
|- zblwvaddjjpv.txt
| exceptions_functions_and_dictionaries.ipynb
| Learning_Python3.ipynb
+ Pandas_Tutorial
+- Files
|-- BL-Flickr-Images-Book.csv
|-- olympics.

#### Exercise 3

Copy/paste the recursive_directory_tree() above into the code block below. Add a new kwarg for a specific file type and add a check for "endswith" that file type.  Search for a 'mp3' or another type of media you have on your system and ensure after running the block of code below that the code above will only print that type of file.

In [11]:
# NOT IMPLEMENTED YET - exercise 3 goes here

def recursive_directory_print(path, **kwargs, **kwargs2  #ALL OF THIS SHIT IS WRONG AS FUCK
    for item in structure:

        depth       = kwargs.get('depth', 0)
        docx        = kwargs2
        file_prefix = '|' + '-' * depth
        dir_prefix  = '+' + '-' * depth
        
        # NOT IMPLEMENTED YET - make me print yo
        
        if type(structure[item]) == dict:
            print(dir_prefix, item)
            depth = depth + 1
            recursive_directory_print(structure[item], depth=depth)
                
        
        elif structure[item] == None:
            print(file_prefix, item)


#### Exercise 4

Review alternate techniques for accomplishing the above. Keep in mind that sometimes you won't have to reinvent the wheel.

In [12]:
# Basic example of printing all files and directories (non tree style)
for root, dirs, files in os.walk(path, topdown=False):
   for name in files: print(os.path.join(root, name))
   for name in dirs:  print(os.path.join(root, name))

.\.ipynb_checkpoints\basic_database_tech-checkpoint.ipynb
.\.ipynb_checkpoints\basic_web_beautifulsoup4-checkpoint.ipynb
.\.ipynb_checkpoints\dedup_files-checkpoint.ipynb
.\.ipynb_checkpoints\exceptions_functions_and_dictionaries-checkpoint.ipynb
.\duplicated\bptgxdeeipxu.txt
.\duplicated\fjiwtpgtgcdb.txt
.\duplicated\ibdlnqtlvaim.txt
.\duplicated\icspsjlksrbn.txt
.\duplicated\ioipomykrfom.txt
.\duplicated\jjiihzujdnki.txt
.\duplicated\jnvkounpkeit.txt
.\duplicated\lixeoutjwbtq.txt
.\duplicated\ljmndgqywrsi.txt
.\duplicated\lksequmqneep.txt
.\duplicated\nexygkahtozc.txt
.\duplicated\nhjxkxizisju.txt
.\duplicated\nlptvthvukcc.txt
.\duplicated\ocdwilpcpyme.txt
.\duplicated\odzmgnfcelwt.txt
.\duplicated\ogtwqijwefst.txt
.\duplicated\ovcsvnhebmxw.txt
.\duplicated\pdqzkppwvtbg.txt
.\duplicated\pedjujehkslo.txt
.\duplicated\qcdjkdlivhdl.txt
.\duplicated\qqrtcsambixf.txt
.\duplicated\rgnxruhulrqj.txt
.\duplicated\rsxuxfpjcpjg.txt
.\duplicated\rvavktnfzdhr.txt
.\duplicated\sfhxhyvdufzy.txt
.\d

In [24]:
path = 'C:\Users\estyw\Desktop'   ####THIS SHIT IS WRONG AS FUCK ALSO
!pip install py-tree
import py_tree
py_tree.main(path)

SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 2-3: truncated \UXXXXXXXX escape (<ipython-input-24-5eda1643b23b>, line 1)

#### Advanced: Exercise 5

Copy/paste the recursive_directory_tree() function and rename it to just directory_tree().  Convert the function to an iterative use where you keep track of the current path you are scanning and iterate through all contents of the file path. You will not call the function within itself in this variant.

In [None]:
# NOT IMPLENTED YET - exercise 5 goes here