## Naming Convention
The notebooks are named `dd-xyz-title.ipynb` where:
* `dd` is an integer indicating the notebook sequence. This is critical when there are dependencies between notebooks
* `xyz` is the author's initials, to help avoid namespace clashes when multiple parties are committing to the same repo
* `title` is the name of the notebook, words separated by hyphens.


## Useful Header Cells
Make jupyter notebook use the full screen width

In [None]:
from IPython.display import display, HTML
display(HTML("<style>.container { width:100% !important; }</style>"))

When developing code in the `src` module, it's very useful to enable auto-reload:

In [None]:
%load_ext autoreload
%autoreload 2

## Python Libraries
Imports you'll almost always want

In [None]:
# Python Imports, alphabetized
import pathlib

#3rd party python modules, alphabetized

import pandas as pd

# Source module imports 
from src import paths
from src.data import DataSource, Dataset, Catalog

## Logging
Enable logging and crank up log level to DEBUG. This is particularly useful when developing code in your project module and using it from a notebook.

In [None]:
import logging
from src.log import logger

logger.setLevel(logging.DEBUG)

## Working with a Dataset from the catalog
List available datasets


In [None]:
c = Catalog.load('datasets'); c

Note: The first time running a `load` function on a new dataset may be slow, as it is doing all the work to generate and verify the contents of a dataset. However, on subsequent runs, it will use a cached copy of the dataset and be quick. 

In [None]:
%%time
ds = Dataset.load('20_newsgroups') # replace my-dataset with the name of a dataset you have a recipe for

In [None]:
len(ds.data)

In [None]:
ds.data[:5]

In [None]:
print(ds.README)

In [None]:
print(ds.LICENSE)

If you have data, you're up and running with a working installation.

## Some data science libraries built in to the base conda environment

In [None]:
# basic data science and visualization libraries
import sklearn
import matplotlib
import scipy
import pandas