# National Park Boundaries

The __[National Park Boundary Set](https://catalog.data.gov/dataset/national-park-boundariesf0a4c)__ seems like it might be fun to explore.

In [1]:
# Import the pandas module to do some exploration of data.
import pandas as pd

# Some of the row data gets truncated, modify the dataframe to fit it all in.
pd.set_option('display.max_colwidth', -1)

# Save the csv file to a sensible location. Use pandas to read in the file.
nps_boundaries = pd.read_csv('data/nps_boundary.csv')

In [2]:
# What kinds of columns do we have.
nps_boundaries.dtypes

GIS_LOC_ID    float64
UNIT_CODE     object 
GROUP_CODE    float64
UNIT_NAME     object 
UNIT_TYPE     object 
META_MIDF     float64
LANDS_CODE    float64
DATE_EDIT     object 
GIS_NOTES     object 
observed      float64
dtype: object

# Content Analysis

Let's take a look at the first few rows of data to see what we're working with.

In [3]:
nps_boundaries.head()

Unnamed: 0,GIS_LOC_ID,UNIT_CODE,GROUP_CODE,UNIT_NAME,UNIT_TYPE,META_MIDF,LANDS_CODE,DATE_EDIT,GIS_NOTES,observed
0,,NACE,,Marion Park,Park,,,,,
1,,APCO,,Appomattox Court House,National Historical Park,,,2006/01/04,Lands - http://landsnet.nps.gov/tractsnet/documents/APCO/Metadata/apco_metadata.xml,
2,,ORPI,,Organ Pipe Cactus,National Monument,,,2007/07/10,Lands - http://landsnet.nps.gov/tractsnet/documents/ORPI/Metadata/orpi_metadata.xml,
3,,PINN,,Pinnacles,National Monument,,,,Shifted 0.06 miles,
4,,TUIN,,Tuskegee Institute,National Historical Site,,,,Good,


It looks like there is a bunch of missing data (NaN's) that make this dataset appear almost useless, in addition there's not a ton of context. Let's take a look at the last few rows to see if it's consistent throughout.

In [6]:
nps_boundaries.tail()

Unnamed: 0,GIS_LOC_ID,UNIT_CODE,GROUP_CODE,UNIT_NAME,UNIT_TYPE,META_MIDF,LANDS_CODE,DATE_EDIT,GIS_NOTES,observed
505,,KAHO,,Kaloko-Honokohau,National Historical Park,,,2008/06/25,Lands - http://landsnet.nps.gov/tractsnet/documents/KAHO/Metadata/kaho_metadata.xml,
506,,CANA,,Canaveral,National Seashore,,,2008/07/30,Lands - http://landsnet.nps.gov/tractsnet/documents/CANA/Metadata/cana_metadata.xml,
507,,MEVE,,Mesa Verde,National Park,,,2008/08/22,Lands - http://landsnet.nps.gov/tractsnet/documents/MEVE/Metadata/meve_metadata.xml,
508,,VAFO,,Valley Forge,National Historical Park,,,2008/09/10,Lands - http://landsnet.nps.gov/tractsnet/documents/VAFO/Metadata/vafo_metadata.xml,
509,,CABR,,Cabrillo,National Monument,,,2008/08/28,Lands - http://landsnet.nps.gov/tractsnet/documents/CABR/Metadata/cabr_metadata.xml,


Indeed the results look just as meaningless at the bottom.

# Next Steps

The only column that looks like it might give us something useful is GIS_NOTES. Let's look more closely at that to see what we can glean.

In [5]:
nps_boundaries['GIS_NOTES']

0      NaN                                                                                                                        
1      Lands - http://landsnet.nps.gov/tractsnet/documents/APCO/Metadata/apco_metadata.xml                                        
2      Lands - http://landsnet.nps.gov/tractsnet/documents/ORPI/Metadata/orpi_metadata.xml                                        
3      Shifted 0.06 miles                                                                                                         
4      Good                                                                                                                       
5      POC for this update:  richard_menicke@nps.gov - http://landsnet.nps.gov/tractsnet/documents/GLAC/Metadata/glac_metadata.xml
6      Lands - http://landsnet.nps.gov/tractsnet/documents/CEBR/Metadata/cebr_metadata.xml                                        
7      Lands                                                                       

Clicking on a few of the links leads 

# Conclusion

This was a fruitless endeavor with the exception that I learned how Jupyter can be utilized to tell a good story.