## Loading using Pandas

If programming in Python 3, you can load use a variety of packages to load the CSV data files we have available.  We will demonstrate one of the most popular here: pandas.

In [12]:
import pandas as pd

In [13]:
network = pd.read_csv('/data/PDSI/Network.csv')

In [14]:
network

Unnamed: 0,BusinessYear,StateCode,IssuerId,SourceName,VersionNum,ImportDate,IssuerId2,StateCode2,NetworkName,NetworkId,NetworkURL,RowNumber,MarketCoverage,DentalOnlyPlan
0,2014,AK,21989,HIOS,6,2014-03-19 07:06:49,21989,AK,ODS Premier,AKN001,https://www.modahealth.com/ProviderSearch/face...,13,,
1,2014,AK,38344,HIOS,6,2013-08-28 08:15:53,38344,AK,HeritagePlus,AKN001,https://www.premera.com/wa/visitor/,13,,
2,2014,AK,38536,HIOS,2,2013-08-01 12:48:00,38536,AK,Lincoln Dental Connect,AKN001,http://lfg.go2dental.com/member/dental_search/...,13,,
3,2014,AK,42507,HIOS,3,2013-09-02 11:39:25,42507,AK,DentalGuard Preferred,AKN001,https://www.guardiananytime.com/fpapp/FPWeb/de...,13,,
4,2014,AK,73836,HIOS,6,2014-04-18 11:49:29,73836,AK,Moda Plus AK Regional,AKN001,https://www.modahealth.com/ProviderSearch/face...,13,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
3817,2016,WV,75441,SERFF,2,2015-08-20 12:28:36,75441,WV,"DenteMax, LLC",WVN001,www.dentemax.com,13,SHOP (Small Group),Yes
3818,2016,WV,76526,SERFF,4,2015-08-20 12:28:36,76526,WV,Delta Dental PPO,WVN001,https://www.deltadentalins.com/find-a-dentist,13,SHOP (Small Group),Yes
3819,2016,WV,76526,SERFF,4,2015-08-20 12:28:36,76526,WV,Delta Dental PPO,WVN001,https://www.deltadentalins.com/find-a-dentist,13,Individual,Yes
3820,2016,WV,94562,SERFF,2,2015-08-20 12:28:36,94562,WV,PDP PLUS,WVN001,https://metlocator.metlife.com/metlocator/exec...,13,SHOP (Small Group),Yes


There are a lot more attributes you can set for the "read_csv" method but the defaults do a pretty good job.  You can read more here:
https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html

## Moving around the filesystem

To provide further examples of how to access the data files on the DSP, you'll find a couple examples of working the Python's "os" package below.

In [1]:
import os

In [2]:
os.listdir('/data/PDSI')

['health-insurance-marketplace.zip',
 'Crosswalk2015.csv',
 'raw',
 'ServiceArea.csv',
 'Rate.csv',
 'database.sqlite',
 'BusinessRules.csv',
 'PlanAttributes.csv',
 'BenefitsCostSharing.csv',
 'hashes.txt',
 'Crosswalk2016.csv',
 'Network.csv']

In [15]:
for root, dirs, files in os.walk('/data/PDSI'):
    print('Root:' + root)
    print('Directories:')
    print(dirs)
    print('Files:')
    print(files)
    print('--------------------')

Root:/data/PDSI
Directories:
['raw']
Files:
['health-insurance-marketplace.zip', 'Crosswalk2015.csv', 'ServiceArea.csv', 'Rate.csv', 'database.sqlite', 'BusinessRules.csv', 'PlanAttributes.csv', 'BenefitsCostSharing.csv', 'hashes.txt', 'Crosswalk2016.csv', 'Network.csv']
--------------------
Root:/data/PDSI/raw
Directories:
['2016', '2014', '2015']
Files:
['Business_Rules_PUF.csv', '._Service_Area_PUF.csv', '._Plan_Attributes_PUF_2015-12-08.csv', '00Tree.html', 'Benefits_Cost_Sharing_PUF_2015-12-08.csv', 'Plan_Attributes_PUF_2015-12-08.csv', '._Plan_Crosswalk_PUF_2014-12-22.csv', '.sentinel', 'Network_PUF.csv', '._Plan_Attributes_PUF.csv', 'Rate_PUF.csv', '._Rate_PUF_2015-12-08.csv', 'Plan_ID_Crosswalk_PUF_2015-12-07.CSV', 'Business_Rules_PUF_2015-12-08.csv', '._Plan_Attributes_PUF_2014_2015-03-09.csv', 'Plan_Crosswalk_PUF_2014-12-22.csv', 'Plan_Attributes_PUF.csv', '._Machine_Readable_PUF_2015-12-21.xlsx', '._Network_PUF.csv', '._Network_PUF_2015-12-08.csv', 'Service_Area_PUF.csv', '.

In [18]:
for root, dirs, files in os.walk('/data/PDSI'):
    for file in files:
        if os.path.splitext(file)[1] == '.csv':
            print(file)

Crosswalk2015.csv
ServiceArea.csv
Rate.csv
BusinessRules.csv
PlanAttributes.csv
BenefitsCostSharing.csv
Crosswalk2016.csv
Network.csv
Business_Rules_PUF.csv
._Service_Area_PUF.csv
._Plan_Attributes_PUF_2015-12-08.csv
Benefits_Cost_Sharing_PUF_2015-12-08.csv
Plan_Attributes_PUF_2015-12-08.csv
._Plan_Crosswalk_PUF_2014-12-22.csv
Network_PUF.csv
._Plan_Attributes_PUF.csv
Rate_PUF.csv
._Rate_PUF_2015-12-08.csv
Business_Rules_PUF_2015-12-08.csv
._Plan_Attributes_PUF_2014_2015-03-09.csv
Plan_Crosswalk_PUF_2014-12-22.csv
Plan_Attributes_PUF.csv
._Network_PUF.csv
._Network_PUF_2015-12-08.csv
Service_Area_PUF.csv
._Rate_PUF.csv
Benefits_Cost_Sharing_PUF.csv
Plan_Attributes_PUF_2014_2015-03-09.csv
Network_PUF_2015-12-08.csv
._Business_Rules_PUF_Reformat.csv
Rate_PUF_2015-12-08.csv
Business_Rules_PUF_Reformat.csv
._ServiceArea_PUF_2015-12-08.csv
._Business_Rules_PUF_2015-12-08.csv
._Benefits_Cost_Sharing_PUF.csv
._Business_Rules_PUF.csv
ServiceArea_PUF_2015-12-08.csv
._Benefits_Cost_Sharing_PUF_2