# Analysis of BRFSS

### About the Dataset:
- Who?
    - BRFSS (Behavioral Risk Factor Survelliance System) is the nation's premier system of health-related telephone surveys that collect state data about U.S. residents regarding their health-related risk behaviors, chronic health conditions, and use of preventive services. 
- What?
    - The objective of the BRFSS is to collect uniform, state-specific data on preventive health practices and risk behaviors that are linked to chronic diseases, injuries, and preventable infectious diseases in the adult population.
    - Factors assessed by the BRFSS include tobacco use, health care coverage, HIV/AIDS knowledge or prevention,
physical activity, and fruit and vegetable consumption.
- Where?
    - Data are collected from a random sample of adults (one per household) through a telephone survey.
- When?
    - Established in 1984 with 15 states, BRFSS now collects data in all 50 states as well as the District of Columbia and three U.S. territories. 
    - BRFSS completes more than 400,000 adult interviews each year, making it the largest continuously conducted health survey system in the world.


### Exploration Ideas:



### Import Libraries

In [1]:
# file paths
import os
from pathlib import Path

# data cleaning and exploration
import pandas as pd

# data viz
import matplotlib.pyplot as plt
import seaborn as sns

### Import data

In [2]:
pwd = os.getcwd()

In [4]:
df = pd.read_csv('2015.csv')

### Explore the data

In [4]:
pd.set_option('display.max_columns', None)
df.head()

Unnamed: 0,_STATE,FMONTH,IDATE,IMONTH,IDAY,IYEAR,DISPCODE,SEQNO,_PSU,CTELENUM,PVTRESD1,COLGHOUS,STATERES,CELLFON3,LADULT,NUMADULT,NUMMEN,NUMWOMEN,CTELNUM1,CELLFON2,CADULT,PVTRESD2,CCLGHOUS,CSTATE,LANDLINE,HHADULT,GENHLTH,PHYSHLTH,MENTHLTH,POORHLTH,HLTHPLN1,PERSDOC2,MEDCOST,CHECKUP1,BPHIGH4,BPMEDS,BLOODCHO,CHOLCHK,TOLDHI2,CVDINFR4,CVDCRHD4,CVDSTRK3,ASTHMA3,ASTHNOW,CHCSCNCR,CHCOCNCR,CHCCOPD1,HAVARTH3,ADDEPEV2,CHCKIDNY,DIABETE3,DIABAGE2,SEX,MARITAL,EDUCA,RENTHOM1,NUMHHOL2,NUMPHON2,CPDEMO1,VETERAN3,EMPLOY1,CHILDREN,INCOME2,INTERNET,WEIGHT2,HEIGHT3,PREGNANT,QLACTLM2,USEEQUIP,BLIND,DECIDE,DIFFWALK,DIFFDRES,DIFFALON,SMOKE100,SMOKDAY2,STOPSMK2,LASTSMK2,USENOW3,ALCDAY5,AVEDRNK2,DRNK3GE5,MAXDRNKS,FRUITJU1,FRUIT1,FVBEANS,FVGREEN,FVORANG,VEGETAB1,EXERANY2,EXRACT11,EXEROFT1,EXERHMM1,EXRACT21,EXEROFT2,EXERHMM2,STRENGTH,LMTJOIN3,ARTHDIS2,ARTHSOCL,JOINPAIN,SEATBELT,FLUSHOT6,FLSHTMY2,IMFVPLAC,PNEUVAC3,HIVTST6,HIVTSTD3,WHRTST10,PDIABTST,PREDIAB1,INSULIN,BLDSUGAR,FEETCHK2,DOCTDIAB,CHKHEMO3,FEETCHK,EYEEXAM,DIABEYE,DIABEDU,PAINACT2,QLMENTL2,QLSTRES2,QLHLTH2,CAREGIV1,CRGVREL1,CRGVLNG1,CRGVHRS1,CRGVPRB1,CRGVPERS,CRGVHOUS,CRGVMST2,CRGVEXPT,VIDFCLT2,VIREDIF3,VIPRFVS2,VINOCRE2,VIEYEXM2,VIINSUR2,VICTRCT4,VIGLUMA2,VIMACDG2,CIMEMLOS,CDHOUSE,CDASSIST,CDHELP,CDSOCIAL,CDDISCUS,WTCHSALT,LONGWTCH,DRADVISE,ASTHMAGE,ASATTACK,ASERVIST,ASDRVIST,ASRCHKUP,ASACTLIM,ASYMPTOM,ASNOSLEP,ASTHMED3,ASINHALR,HAREHAB1,STREHAB1,CVDASPRN,ASPUNSAF,RLIVPAIN,RDUCHART,RDUCSTRK,ARTTODAY,ARTHWGT,ARTHEXER,ARTHEDU,TETANUS,HPVADVC2,HPVADSHT,SHINGLE2,HADMAM,HOWLONG,HADPAP2,LASTPAP2,HPVTEST,HPLSTTST,HADHYST2,PROFEXAM,LENGEXAM,BLDSTOOL,LSTBLDS3,HADSIGM3,HADSGCO1,LASTSIG3,PCPSAAD2,PCPSADI1,PCPSARE1,PSATEST1,PSATIME,PCPSARS1,PCPSADE1,PCDMDECN,SCNTMNY1,SCNTMEL1,SCNTPAID,SCNTWRK1,SCNTLPAD,SCNTLWK1,SXORIENT,TRNSGNDR,RCSGENDR,RCSRLTN2,CASTHDX2,CASTHNO2,EMTSUPRT,LSATISFY,ADPLEASR,ADDOWN,ADSLEEP,ADENERGY,ADEAT1,ADFAIL,ADTHINK,ADMOVE,MISTMNT,ADANXEV,QSTVER,QSTLANG,EXACTOT1,EXACTOT2,MSCODE,_STSTR,_STRWT,_RAWRAKE,_WT2RAKE,_CHISPNC,_CRACE1,_CPRACE,_CLLCPWT,_DUALUSE,_DUALCOR,_LLCPWT,_RFHLTH,_HCVU651,_RFHYPE5,_CHOLCHK,_RFCHOL,_MICHD,_LTASTH1,_CASTHM1,_ASTHMS1,_DRDXAR1,_PRACE1,_MRACE1,_HISPANC,_RACE,_RACEG21,_RACEGR3,_RACE_G1,_AGEG5YR,_AGE65YR,_AGE80,_AGE_G,HTIN4,HTM4,WTKG3,_BMI5,_BMI5CAT,_RFBMI5,_CHLDCNT,_EDUCAG,_INCOMG,_SMOKER3,_RFSMOK3,DRNKANY5,DROCDY3_,_RFBING5,_DRNKWEK,_RFDRHV5,FTJUDA1_,FRUTDA1_,BEANDAY_,GRENDAY_,ORNGDAY_,VEGEDA1_,_MISFRTN,_MISVEGN,_FRTRESP,_VEGRESP,_FRUTSUM,_VEGESUM,_FRTLT1,_VEGLT1,_FRT16,_VEG23,_FRUITEX,_VEGETEX,_TOTINDA,METVL11_,METVL21_,MAXVO2_,FC60_,ACTIN11_,ACTIN21_,PADUR1_,PADUR2_,PAFREQ1_,PAFREQ2_,_MINAC11,_MINAC21,STRFREQ_,PAMISS1_,PAMIN11_,PAMIN21_,PA1MIN_,PAVIG11_,PAVIG21_,PA1VIGM_,_PACAT1,_PAINDX1,_PA150R2,_PA300R2,_PA30021,_PASTRNG,_PAREC1,_PASTAE1,_LMTACT1,_LMTWRK1,_LMTSCL1,_RFSEAT2,_RFSEAT3,_FLSHOT6,_PNEUMO2,_AIDTST3
0,1.0,1.0,b'01292015',b'01',b'29',b'2015',1200.0,2015000000.0,2015000000.0,1.0,1.0,,1.0,2.0,,3.0,1.0,2.0,,,,,,,,,5.0,15.0,18.0,10.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,1.0,1.0,2.0,2.0,1.0,1.0,1.0,2.0,3.0,,2.0,1.0,4.0,1.0,2.0,,1.0,2.0,8.0,88.0,3.0,2.0,280.0,510.0,,1.0,1.0,2.0,2.0,1.0,1.0,1.0,1.0,3.0,,2.0,3.0,888.0,,,,305.0,310.0,320.0,310.0,305.0,101.0,2.0,,,,,,,888.0,1.0,1.0,1.0,6.0,1.0,1.0,112014.0,1.0,1.0,1.0,,,1.0,3.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,,,,,,,,,,,,,,,,,,,10.0,1.0,b'',b'',3.0,11011.0,28.78156,3.0,86.344681,,,,,1.0,0.614125,341.384853,2.0,1.0,2.0,1.0,2.0,2.0,2.0,2.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,9.0,1.0,63.0,5.0,70.0,178.0,12701.0,4018.0,4.0,2.0,1.0,2.0,2.0,3.0,1.0,2.0,5.397605e-79,1.0,5.397605e-79,1.0,17.0,33.0,67.0,33.0,17.0,100.0,5.397605e-79,5.397605e-79,1.0,1.0,50.0,217.0,2.0,1.0,1.0,1.0,5.397605e-79,5.397605e-79,2.0,,,2469.0,423.0,,,,,,,,,5.397605e-79,5.397605e-79,,,,,,,4.0,2.0,3.0,3.0,2.0,2.0,4.0,2.0,1.0,1.0,1.0,1.0,1.0,,,1.0
1,1.0,1.0,b'01202015',b'01',b'20',b'2015',1100.0,2015000000.0,2015000000.0,1.0,1.0,,1.0,2.0,,1.0,5.397605e-79,1.0,,,,,,,,,3.0,88.0,88.0,,2.0,1.0,1.0,4.0,3.0,,1.0,4.0,2.0,2.0,2.0,2.0,2.0,,2.0,2.0,2.0,2.0,2.0,2.0,3.0,,2.0,2.0,6.0,1.0,2.0,,2.0,2.0,3.0,88.0,1.0,1.0,165.0,508.0,,1.0,2.0,1.0,1.0,2.0,2.0,2.0,1.0,1.0,2.0,,3.0,888.0,,,,302.0,305.0,302.0,202.0,202.0,304.0,1.0,64.0,212.0,100.0,69.0,212.0,100.0,888.0,,,,,3.0,2.0,,,2.0,2.0,,,2.0,3.0,,,,,,,,,,,,,,2.0,,,,,,,,1.0,,,,,,,,,,1.0,5.0,5.0,,5.0,2.0,2.0,,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,2.0,,2.0,,,,,,,,,,b'',1.0,2.0,,,2.0,60.0,,,,,,,,,,,,,,,,,,,10.0,1.0,b'',b'',5.0,11011.0,28.78156,1.0,28.78156,,,,,9.0,,108.060903,1.0,2.0,1.0,2.0,1.0,2.0,1.0,1.0,3.0,2.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,7.0,1.0,52.0,4.0,68.0,173.0,7484.0,2509.0,3.0,2.0,1.0,4.0,1.0,1.0,2.0,2.0,5.397605e-79,1.0,5.397605e-79,1.0,7.0,17.0,7.0,29.0,29.0,13.0,5.397605e-79,5.397605e-79,1.0,1.0,24.0,78.0,2.0,2.0,1.0,1.0,5.397605e-79,5.397605e-79,1.0,35.0,5.397605e-79,2876.0,493.0,1.0,5.397605e-79,60.0,60.0,2800.0,2800.0,168.0,5.397605e-79,5.397605e-79,5.397605e-79,168.0,5.397605e-79,168.0,5.397605e-79,5.397605e-79,5.397605e-79,2.0,1.0,1.0,2.0,2.0,2.0,2.0,2.0,3.0,3.0,4.0,2.0,2.0,,,2.0
2,1.0,1.0,b'02012015',b'02',b'01',b'2015',1200.0,2015000000.0,2015000000.0,1.0,1.0,,1.0,2.0,,2.0,1.0,1.0,,,,,,,,,4.0,15.0,88.0,88.0,1.0,2.0,2.0,1.0,3.0,,1.0,1.0,1.0,7.0,2.0,1.0,2.0,,2.0,1.0,2.0,1.0,2.0,2.0,3.0,,2.0,2.0,4.0,1.0,2.0,,1.0,2.0,7.0,88.0,99.0,2.0,158.0,511.0,,2.0,2.0,2.0,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.0,3.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,,,,,,,,,,,,,,,,,,,10.0,1.0,b'',b'',5.0,11011.0,28.78156,2.0,57.56312,,,,,1.0,0.614125,255.264797,2.0,9.0,1.0,1.0,2.0,,1.0,1.0,3.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,11.0,2.0,71.0,6.0,71.0,180.0,7167.0,2204.0,2.0,1.0,1.0,2.0,9.0,9.0,9.0,9.0,900.0,9.0,99900.0,9.0,,,,,,,2.0,4.0,5.397605e-79,5.397605e-79,,,9.0,9.0,1.0,1.0,1.0,1.0,9.0,,,2173.0,373.0,,,,,,,,,,9.0,,,,,,,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,
3,1.0,1.0,b'01142015',b'01',b'14',b'2015',1100.0,2015000000.0,2015000000.0,1.0,1.0,,1.0,2.0,,3.0,1.0,2.0,,,,,,,,,5.0,30.0,30.0,30.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,2.0,,2.0,1.0,2.0,1.0,1.0,2.0,3.0,,2.0,1.0,4.0,1.0,2.0,,1.0,2.0,8.0,1.0,8.0,2.0,180.0,507.0,,1.0,2.0,1.0,1.0,1.0,2.0,1.0,2.0,,,,3.0,888.0,,,,555.0,101.0,555.0,301.0,301.0,201.0,2.0,,,,,,,888.0,1.0,1.0,1.0,8.0,1.0,1.0,777777.0,5.0,1.0,9.0,,,2.0,3.0,,,,,,,,,,,,,,2.0,,,,,,,,2.0,,,,,,,,,,1.0,1.0,1.0,2.0,1.0,1.0,2.0,,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,2.0,,1.0,2.0,1.0,,,,,,,,b'',4.0,7.0,,,,97.0,,,,,,,,,,,,,,,,,,,10.0,1.0,b'',b'',3.0,11011.0,28.78156,3.0,86.344681,,,,,1.0,0.614125,341.384853,2.0,1.0,2.0,1.0,2.0,2.0,1.0,1.0,3.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,9.0,1.0,63.0,5.0,67.0,170.0,8165.0,2819.0,3.0,2.0,2.0,2.0,5.0,4.0,1.0,2.0,5.397605e-79,1.0,5.397605e-79,1.0,5.397605e-79,100.0,5.397605e-79,3.0,3.0,14.0,5.397605e-79,5.397605e-79,1.0,1.0,100.0,20.0,1.0,2.0,1.0,1.0,5.397605e-79,5.397605e-79,2.0,,,2469.0,423.0,,,,,,,,,5.397605e-79,5.397605e-79,,,,,,,4.0,2.0,3.0,3.0,2.0,2.0,4.0,2.0,1.0,1.0,1.0,1.0,1.0,,,9.0
4,1.0,1.0,b'01142015',b'01',b'14',b'2015',1100.0,2015000000.0,2015000000.0,1.0,1.0,,1.0,2.0,,2.0,1.0,1.0,,,,,,,,,5.0,20.0,88.0,30.0,1.0,1.0,2.0,1.0,3.0,,1.0,1.0,2.0,2.0,2.0,2.0,2.0,,2.0,2.0,2.0,1.0,2.0,2.0,3.0,,2.0,1.0,5.0,1.0,2.0,,2.0,2.0,8.0,88.0,77.0,1.0,142.0,504.0,,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,,,,3.0,888.0,,,,777.0,102.0,203.0,204.0,310.0,320.0,2.0,,,,,,,888.0,1.0,1.0,1.0,7.0,1.0,2.0,,,1.0,1.0,777777.0,1.0,1.0,3.0,,,,,,,,,,,,,,2.0,,,,,,,,7.0,,,,,,,,,,2.0,,,,,,1.0,777.0,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,2.0,,1.0,2.0,5.0,,,,,,,,b'',5.0,5.0,,,,45.0,,,,,,,,,,,,,,,,,,,10.0,1.0,b'',b'',3.0,11011.0,28.78156,2.0,57.56312,,,,,9.0,,258.682223,2.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,3.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,9.0,1.0,61.0,5.0,64.0,163.0,6441.0,2437.0,2.0,1.0,1.0,3.0,9.0,4.0,1.0,2.0,5.397605e-79,1.0,5.397605e-79,1.0,,200.0,43.0,57.0,33.0,67.0,1.0,5.397605e-79,5.397605e-79,1.0,,200.0,9.0,1.0,1.0,1.0,1.0,5.397605e-79,2.0,,,2543.0,436.0,,,,,,,,,5.397605e-79,5.397605e-79,,,,,,,4.0,2.0,3.0,3.0,2.0,2.0,4.0,2.0,1.0,1.0,1.0,1.0,1.0,,,1.0


In [5]:
df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 441456 entries, 0 to 441455
Columns: 330 entries, _STATE to _AIDTST3
dtypes: float64(323), object(7)
memory usage: 1.1+ GB


In [6]:
df.shape

(441456, 330)

In [12]:
df.describe()

Unnamed: 0,FMONTH,DISPCODE,SEQNO,_PSU,CTELENUM,PVTRESD1,COLGHOUS,STATERES,CELLFON3,LADULT,NUMADULT,NUMMEN,NUMWOMEN,CTELNUM1,CELLFON2,CADULT,PVTRESD2,CCLGHOUS,CSTATE,LANDLINE,HHADULT,GENHLTH,PHYSHLTH,MENTHLTH,POORHLTH,HLTHPLN1,PERSDOC2,MEDCOST,CHECKUP1,BPHIGH4,BPMEDS,BLOODCHO,CHOLCHK,TOLDHI2,CVDINFR4,CVDCRHD4,CVDSTRK3,ASTHMA3,ASTHNOW,CHCSCNCR,CHCOCNCR,CHCCOPD1,HAVARTH3,ADDEPEV2,CHCKIDNY,DIABETE3,DIABAGE2,SEX,MARITAL,EDUCA,RENTHOM1,NUMHHOL2,NUMPHON2,CPDEMO1,VETERAN3,EMPLOY1,CHILDREN,INCOME2,INTERNET,WEIGHT2,HEIGHT3,PREGNANT,QLACTLM2,USEEQUIP,BLIND,DECIDE,DIFFWALK,DIFFDRES,DIFFALON,SMOKE100,SMOKDAY2,STOPSMK2,LASTSMK2,USENOW3,ALCDAY5,AVEDRNK2,DRNK3GE5,MAXDRNKS,FRUITJU1,FRUIT1,FVBEANS,FVGREEN,FVORANG,VEGETAB1,EXERANY2,EXRACT11,EXEROFT1,EXERHMM1,EXRACT21,EXEROFT2,EXERHMM2,STRENGTH,LMTJOIN3,ARTHDIS2,ARTHSOCL,JOINPAIN,SEATBELT,FLUSHOT6,FLSHTMY2,IMFVPLAC,PNEUVAC3,HIVTST6,HIVTSTD3,WHRTST10,PDIABTST,PREDIAB1,INSULIN,BLDSUGAR,FEETCHK2,DOCTDIAB,CHKHEMO3,FEETCHK,EYEEXAM,DIABEYE,DIABEDU,PAINACT2,QLMENTL2,QLSTRES2,QLHLTH2,CAREGIV1,CRGVREL1,CRGVLNG1,CRGVHRS1,CRGVPRB1,CRGVPERS,CRGVHOUS,CRGVMST2,CRGVEXPT,VIDFCLT2,VIREDIF3,VIPRFVS2,VINOCRE2,VIEYEXM2,VIINSUR2,VICTRCT4,VIGLUMA2,VIMACDG2,CIMEMLOS,CDHOUSE,CDASSIST,CDHELP,CDSOCIAL,CDDISCUS,WTCHSALT,LONGWTCH,DRADVISE,ASTHMAGE,ASATTACK,ASERVIST,ASDRVIST,ASRCHKUP,ASACTLIM,ASYMPTOM,ASNOSLEP,ASTHMED3,ASINHALR,HAREHAB1,STREHAB1,CVDASPRN,ASPUNSAF,RLIVPAIN,RDUCHART,RDUCSTRK,ARTTODAY,ARTHWGT,ARTHEXER,ARTHEDU,TETANUS,HPVADVC2,HPVADSHT,SHINGLE2,HADMAM,HOWLONG,HADPAP2,LASTPAP2,HPVTEST,HPLSTTST,HADHYST2,PROFEXAM,LENGEXAM,BLDSTOOL,LSTBLDS3,HADSIGM3,HADSGCO1,LASTSIG3,PCPSAAD2,PCPSADI1,PCPSARE1,PSATEST1,PSATIME,PCPSARS1,PCPSADE1,SCNTMNY1,SCNTMEL1,SCNTPAID,SCNTWRK1,SCNTLPAD,SCNTLWK1,SXORIENT,TRNSGNDR,RCSGENDR,RCSRLTN2,CASTHDX2,CASTHNO2,EMTSUPRT,LSATISFY,ADPLEASR,ADDOWN,ADSLEEP,ADENERGY,ADEAT1,ADFAIL,ADTHINK,ADMOVE,MISTMNT,ADANXEV,QSTVER,QSTLANG,MSCODE,_STSTR,_STRWT,_RAWRAKE,_WT2RAKE,_CHISPNC,_CRACE1,_CPRACE,_CLLCPWT,_DUALUSE,_DUALCOR,_LLCPWT,_RFHLTH,_HCVU651,_RFHYPE5,_CHOLCHK,_RFCHOL,_MICHD,_LTASTH1,_CASTHM1,_ASTHMS1,_DRDXAR1,_PRACE1,_MRACE1,_HISPANC,_RACE,_RACEG21,_RACEGR3,_RACE_G1,_AGEG5YR,_AGE65YR,_AGE80,_AGE_G,HTIN4,HTM4,WTKG3,_BMI5,_BMI5CAT,_RFBMI5,_CHLDCNT,_EDUCAG,_INCOMG,_SMOKER3,_RFSMOK3,DRNKANY5,DROCDY3_,_RFBING5,_DRNKWEK,_RFDRHV5,FTJUDA1_,FRUTDA1_,BEANDAY_,GRENDAY_,ORNGDAY_,VEGEDA1_,_MISFRTN,_MISVEGN,_FRTRESP,_VEGRESP,_FRUTSUM,_VEGESUM,_FRTLT1,_VEGLT1,_FRT16,_VEG23,_FRUITEX,_VEGETEX,_TOTINDA,METVL11_,METVL21_,MAXVO2_,FC60_,ACTIN11_,ACTIN21_,PADUR1_,PADUR2_,PAFREQ1_,PAFREQ2_,_MINAC11,_MINAC21,STRFREQ_,PAMISS1_,PAMIN11_,PAMIN21_,PA1MIN_,PAVIG11_,PAVIG21_,PA1VIGM_,_PACAT1,_PAINDX1,_PA150R2,_PA300R2,_PA30021,_PASTRNG,_PAREC1,_PASTAE1,_LMTACT1,_LMTWRK1,_LMTSCL1,_RFSEAT2,_RFSEAT3,_FLSHOT6,_PNEUMO2,_AIDTST3
count,441456.0,441456.0,441456.0,441456.0,254645.0,254645.0,45.0,254643.0,254646.0,45.0,254620.0,254503.0,254502.0,186811.0,186811.0,186810.0,186811.0,1070.0,186812.0,186067.0,181631.0,441454.0,441455.0,441456.0,226964.0,441456.0,441456.0,441455.0,441455.0,441455.0,178188.0,441456.0,382302.0,382302.0,441456.0,441455.0,441456.0,441456.0,59409.0,441455.0,441456.0,441456.0,441455.0,441456.0,441456.0,441449.0,57253.0,441456.0,441456.0,441456.0,441456.0,254645.0,13250.0,254645.0,441450.0,441456.0,441451.0,438155.0,437146.0,436141.0,435545.0,64952.0,432118.0,431026.0,430302.0,429716.0,429122.0,428728.0,428130.0,427201.0,184193.0,61518.0,122190.0,426566.0,425525.0,210838.0,210420.0,210017.0,413458.0,412306.0,411081.0,410145.0,409341.0,408318.0,406012.0,295778.0,294025.0,293665.0,293280.0,196916.0,196734.0,402736.0,136690.0,136503.0,136294.0,133728.0,400989.0,400075.0,190194.0,191199.0,399513.0,398069.0,113692.0,113518.0,82760.0,82760.0,29424.0,29423.0,29421.0,29418.0,29417.0,29139.0,29417.0,29416.0,29416.0,0.0,0.0,0.0,0.0,108995.0,24020.0,23995.0,23963.0,23932.0,23902.0,23885.0,23822.0,84687.0,3199.0,3188.0,3188.0,1068.0,2529.0,3181.0,3178.0,3178.0,3175.0,116729.0,13256.0,13241.0,4326.0,13218.0,13201.0,41299.0,25462.0,41263.0,1023.0,648.0,313.0,312.0,644.0,644.0,641.0,457.0,639.0,639.0,1011.0,694.0,18798.0,13177.0,5619.0,5619.0,5618.0,19687.0,19675.0,19670.0,19660.0,40587.0,10923.0,1448.0,28059.0,22813.0,18542.0,22802.0,21267.0,22789.0,6907.0,22567.0,5645.0,4933.0,54375.0,19777.0,54343.0,39361.0,39358.0,6627.0,6620.0,6615.0,6615.0,3634.0,3631.0,819.0,69502.0,73360.0,34755.0,34734.0,25705.0,38422.0,166997.0,166907.0,59610.0,59416.0,52305.0,6503.0,20064.0,20052.0,20445.0,20429.0,20419.0,20413.0,20402.0,20395.0,20381.0,20365.0,20350.0,20342.0,441456.0,441434.0,250579.0,441456.0,441430.0,441456.0,441456.0,319635.0,65129.0,65129.0,59590.0,441456.0,275651.0,441456.0,441456.0,441456.0,441456.0,441456.0,382302.0,437514.0,441456.0,441456.0,441456.0,438657.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,434019.0,441456.0,441456.0,441456.0,441456.0,424196.0,426024.0,410535.0,405058.0,405058.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,402921.0,404874.0,401509.0,403507.0,402198.0,399993.0,441456.0,441456.0,441456.0,441456.0,397745.0,390339.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,294564.0,289614.0,441456.0,441456.0,291434.0,286584.0,287716.0,192327.0,291165.0,194696.0,285917.0,283654.0,397443.0,441456.0,282976.0,280742.0,289037.0,287778.0,283415.0,290486.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,441456.0,438657.0,438657.0,438657.0,441456.0,441456.0,157954.0,157954.0,398069.0
mean,6.359676,1115.040457,2015005000.0,2015005000.0,1.0,1.000177,1.0,1.0,1.545133,1.6,1.792675,0.8011222,0.9911671,1.0,1.000005,1.504202,1.005733,1.0,1.066548,1.632826,2.421233,2.57879,60.655113,64.679178,55.768959,1.101201,1.395213,1.916066,1.574185,2.209743,1.173328,1.242296,1.449354,1.630876,1.968411,1.986952,1.97388,1.883046,1.480567,1.920329,1.914746,1.947197,1.697419,1.837522,1.98331,2.757888,54.59548,1.576542,2.263653,4.920094,1.375942,1.972927,1.731547,1.228149,1.881477,3.942769,65.906651,20.253013,1.233819,733.204388,742.183226,2.009592,1.795577,1.899027,1.974155,1.940891,1.8566,1.973701,1.948221,1.613987,2.439034,1.450242,6.748989,2.977549,536.034468,3.493758,67.783191,5.611398,367.928353,213.120153,314.889518,259.775789,302.00825,227.156175,1.311873,52.432544,142.868525,107.50592,62.681628,147.06257,124.239298,603.090347,1.576524,1.862648,2.441678,6.217382,1.279299,1.570316,137103.813638,4.128395,2.069172,1.912065,389070.908155,4.62265,1.621943,2.798985,1.681756,254.959046,271.190136,14.943266,14.592583,24.261951,2.411055,1.886728,1.467025,,,,,1.808734,7.600999,3.361034,2.212703,12.696097,1.531797,1.260917,5.496978,2.220872,1.371991,1.72271,2.40276,6.162921,2.843021,1.595725,2.419761,1.954688,2.001575,1.94184,3.965299,4.051507,2.205502,3.980859,1.603136,1.413811,425.897573,1.72891,63.151515,1.601852,59.929712,54.333333,34.61646,684.737578,4.25897,5.448578,4.361502,4.982786,1.692384,1.691643,1.709012,2.848828,1.818117,1.303079,1.716091,2.242495,1.677865,1.493696,1.897202,3.435189,2.353474,14.020718,1.804341,1.207382,1.854439,1.106175,2.603893,3.142437,2.559867,1.747862,1.145616,1.873302,1.68366,3.263134,1.305357,2.066081,3.259896,1.637543,1.969637,1.693575,1.682993,1.910017,1.70201,2.335775,4.119104,4.327781,1.901712,43.773738,1.876172,47.470616,1.255555,4.052406,1.839406,1.712232,2.188395,1.4664,1.822468,1.6729,63.36762,66.901415,51.243058,39.255817,59.751397,74.080314,76.00785,78.983698,1.879066,1.873513,14.869065,1.037093,2.542045,301167.291524,104.09936,1.440656,121.43604,7.612042,7.236776,7.236776,803.322235,4.164032,0.620023,569.359433,1.208657,3.953148,1.42841,1.533609,1.492956,1.911699,1.161631,1.150448,2.823792,1.662524,3.123485,3.207264,1.984499,2.021758,1.356658,1.681386,1.455114,7.803623,1.36989,55.409943,4.444382,66.725177,169.461051,8092.974409,2804.2424,2.938461,2.264253,1.565964,2.956399,4.677848,3.549056,1.465507,1.850526,59.38831,1.578753,6069.624,1.516312,35.70141,100.4784,28.01763,56.65433,29.59936,80.3034,0.1701574,0.3593042,0.9009845,0.8842082,136.242,194.73,2.131746,2.109316,0.9997916,0.9998935,0.09943233,0.1160048,1.931871,41.71433,28.84516,4003.579983,1686.848585,1.281223,0.8733077,63.549316,71.704706,4027.24426,3134.333674,229.8979,130.0678,1200.005,0.8228838,313.8627,181.7386,483.804,82.47403,50.97478,131.4391,3.257464,2.380122,2.63392,2.842512,2.588713,2.449574,3.494124,2.742695,2.716879,2.815149,3.652717,1.824624,1.887028,2.290705,2.412259,1.970156
std,3.487131,35.746794,4113.443,4113.443,0.0,0.013292,0.0,0.0,0.49796,0.495434,0.798369,0.6203935,0.525192,0.0,0.002314,0.499984,0.0755,0.0,0.249239,0.556716,5.114021,1.117585,37.055684,35.843085,38.070079,0.512261,0.833429,0.415414,1.249199,1.039022,0.454448,0.916399,1.036135,0.740235,0.439678,0.534279,0.348689,0.462582,1.088185,0.409891,0.403817,0.470313,0.644214,0.563087,0.368072,0.723319,18.323767,0.494107,1.687844,1.076198,0.826611,0.465485,1.21505,0.666882,0.448051,2.871768,37.708546,31.853507,0.575152,2197.377381,1380.498142,0.578478,0.679098,0.463273,0.441517,0.567066,0.579838,0.395444,0.492597,0.74653,0.88636,0.605194,6.286721,0.49357,355.880425,10.559959,35.870235,13.981529,192.655278,142.538351,150.635581,139.519553,144.274424,133.79342,0.72629,23.595075,84.702429,149.455253,28.87149,88.353883,166.956693,365.254201,0.837149,1.149088,0.944234,11.37403,0.95332,0.781987,164327.138727,5.358713,1.668282,1.171706,355697.300837,11.005455,1.284443,0.679272,0.530738,273.44847,285.193696,28.447626,28.852911,36.519153,1.467019,0.719051,0.631595,,,,,0.597056,8.704655,1.540557,1.724358,17.401742,0.727373,0.674808,1.357497,1.416948,0.873366,1.076444,1.072059,11.136945,1.337548,1.066957,0.858962,0.39571,0.597989,0.605133,1.359989,1.277119,1.294748,1.407246,0.735559,0.669083,122.001949,0.641903,33.11472,0.86605,40.462568,42.10931,41.621644,361.84152,2.757096,2.826504,2.836035,3.147575,0.918097,0.820034,0.507867,0.69306,0.553978,1.005756,1.573218,1.001937,0.691748,0.81824,0.529089,1.597658,1.572057,27.105943,0.820861,0.556861,1.368671,0.594699,1.722266,2.501041,1.839726,0.625868,0.508312,1.39776,0.71711,1.77497,0.63946,0.753669,1.659712,1.280079,1.246654,1.256966,1.293504,1.498102,1.378462,1.15787,1.39192,1.179059,1.076386,16.595123,1.251996,22.828288,1.242508,0.550096,1.668272,1.782689,1.473973,0.990606,1.251896,0.891544,37.495981,35.969873,40.08539,39.790658,38.763235,30.675864,28.778209,25.335958,0.540619,0.591034,5.025347,0.189097,1.724403,160353.00928,159.827141,0.702771,166.849419,2.837936,22.193894,22.193894,1337.20394,3.766407,0.201628,992.909207,0.5686,3.799186,0.646749,1.555462,0.878391,0.283733,0.569999,0.73938,0.800331,0.472849,12.578508,12.586625,0.734796,2.273676,1.082987,1.527553,0.954026,3.495609,0.507195,17.041646,1.552282,4.129768,10.565036,2161.075422,665.463433,0.826482,2.06957,1.200828,1.049511,2.415903,1.464253,1.59088,1.640494,192.7845,1.86886,23357.35,1.87458,68.90953,111.9253,46.10646,65.96217,46.05388,79.45848,0.5324361,1.085588,0.2986832,0.3199755,137.9642,155.65,2.322882,2.522517,0.01443462,0.01031769,0.2999376,0.3205633,2.209728,16.32416,26.47941,10638.857874,10864.536907,0.550234,0.8151893,65.185353,77.34859,2886.015411,3187.035503,361.447,358.2795,2261.04,2.518358,524.4292,479.8233,744.4642,235.3093,190.0639,316.5299,2.469186,2.496315,2.495601,2.458804,2.48096,2.220698,2.398322,2.449676,1.324145,1.356101,1.26673,2.360812,2.351387,2.518086,2.778032,1.441119
min,1.0,1100.0,2015000000.0,2015000000.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,5.397605e-79,5.397605e-79,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,50.0,200.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,101.0,1.0,1.0,1.0,101.0,101.0,101.0,101.0,101.0,101.0,1.0,1.0,101.0,1.0,1.0,101.0,1.0,101.0,1.0,1.0,1.0,5.397605e-79,1.0,1.0,12014.0,1.0,1.0,1.0,11985.0,1.0,1.0,1.0,1.0,101.0,101.0,1.0,1.0,1.0,1.0,1.0,1.0,,,,,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,101.0,1.0,11.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,10.0,1.0,1.0,11011.0,0.945246,0.333333,0.362773,1.0,1.0,1.0,7.81393,1.0,0.079438,1.18075,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,18.0,1.0,36.0,91.0,2268.0,1202.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,5.397605e-79,1.0,5.397605e-79,1.0,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,1.0,1.0,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,1.0,5.397605e-79,5.397605e-79,555.0,95.0,5.397605e-79,5.397605e-79,1.0,1.0,233.0,233.0,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,5.397605e-79,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0
25%,3.0,1100.0,2015002000.0,2015002000.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,5.397605e-79,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,2.0,15.0,28.0,10.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,2.0,1.0,2.0,2.0,2.0,1.0,2.0,2.0,3.0,44.0,1.0,1.0,4.0,1.0,2.0,1.0,1.0,2.0,1.0,4.0,5.0,1.0,149.0,504.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,1.0,2.0,1.0,6.0,3.0,202.0,1.0,77.0,1.0,202.0,102.0,202.0,201.0,202.0,102.0,1.0,37.0,103.0,30.0,48.0,102.0,30.0,107.0,1.0,1.0,2.0,3.0,1.0,1.0,102014.0,1.0,1.0,1.0,62008.0,1.0,1.0,3.0,1.0,101.0,101.0,2.0,2.0,2.0,2.0,2.0,1.0,,,,,2.0,2.0,2.0,1.0,5.0,1.0,1.0,6.0,2.0,1.0,1.0,2.0,3.0,2.0,1.0,2.0,2.0,2.0,2.0,3.0,3.0,1.0,3.0,1.0,1.0,402.0,1.0,30.0,1.0,3.0,3.0,1.0,777.0,2.0,2.0,2.0,1.0,1.0,1.0,1.0,3.0,2.0,1.0,1.0,2.0,1.0,1.0,2.0,3.0,2.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,3.0,4.0,1.0,38.0,1.0,40.0,1.0,4.0,1.0,1.0,2.0,1.0,1.0,1.0,14.0,14.0,7.0,4.0,10.0,88.0,88.0,88.0,2.0,2.0,10.0,1.0,1.0,191011.0,17.829576,1.0,25.252506,9.0,1.0,1.0,136.617167,1.0,0.464705,92.210103,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,3.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,5.0,1.0,43.0,3.0,64.0,163.0,6577.0,2373.0,2.0,1.0,1.0,2.0,3.0,3.0,1.0,1.0,5.397605e-79,1.0,5.397605e-79,1.0,5.397605e-79,33.0,7.0,14.0,7.0,33.0,5.397605e-79,5.397605e-79,1.0,1.0,57.0,110.0,1.0,1.0,1.0,1.0,5.397605e-79,5.397605e-79,1.0,35.0,5.397605e-79,2247.0,385.0,1.0,5.397605e-79,30.0,30.0,2000.0,1000.0,70.0,5.397605e-79,5.397605e-79,5.397605e-79,90.0,5.397605e-79,120.0,5.397605e-79,5.397605e-79,5.397605e-79,1.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,2.0,2.0,3.0,1.0,1.0,1.0,1.0,1.0
50%,6.0,1100.0,2015004000.0,2015004000.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,2.0,2.0,2.0,88.0,88.0,88.0,1.0,1.0,2.0,1.0,3.0,1.0,1.0,1.0,2.0,2.0,2.0,2.0,2.0,1.0,2.0,2.0,2.0,2.0,2.0,2.0,3.0,54.0,2.0,1.0,5.0,1.0,2.0,1.0,1.0,2.0,3.0,88.0,7.0,1.0,175.0,507.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,3.0,1.0,7.0,3.0,777.0,2.0,88.0,2.0,308.0,202.0,302.0,205.0,302.0,203.0,1.0,64.0,105.0,45.0,67.0,105.0,45.0,888.0,2.0,2.0,3.0,5.0,1.0,2.0,102014.0,4.0,2.0,2.0,112014.0,3.0,1.0,3.0,2.0,103.0,101.0,4.0,3.0,3.0,2.0,2.0,1.0,,,,,2.0,6.0,4.0,1.0,11.0,1.0,1.0,6.0,2.0,1.0,1.0,2.0,5.0,2.0,1.0,3.0,2.0,2.0,2.0,4.0,5.0,2.0,5.0,2.0,1.0,407.0,2.0,63.0,2.0,88.0,88.0,4.0,888.0,3.0,7.0,3.0,6.0,2.0,2.0,2.0,3.0,2.0,1.0,1.0,2.0,2.0,1.0,2.0,4.0,2.0,3.0,2.0,1.0,1.0,1.0,2.0,2.0,2.0,2.0,1.0,1.0,2.0,3.0,1.0,2.0,3.0,1.0,2.0,1.0,1.0,1.0,1.0,2.0,5.0,5.0,2.0,40.0,2.0,40.0,1.0,4.0,2.0,1.0,2.0,1.0,1.0,2.0,88.0,88.0,88.0,14.0,88.0,88.0,88.0,88.0,2.0,2.0,12.0,1.0,2.0,292039.0,44.823976,1.0,61.625519,9.0,1.0,1.0,329.442517,2.0,0.669767,232.385227,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,3.0,2.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,8.0,1.0,58.0,5.0,66.0,168.0,7756.0,2695.0,3.0,2.0,1.0,3.0,5.0,4.0,1.0,2.0,3.0,1.0,23.0,1.0,7.0,100.0,14.0,43.0,17.0,71.0,5.397605e-79,5.397605e-79,1.0,1.0,100.0,169.0,1.0,1.0,1.0,1.0,5.397605e-79,5.397605e-79,1.0,35.0,35.0,2728.0,468.0,1.0,1.0,45.0,45.0,3500.0,2333.0,140.0,32.0,5.397605e-79,5.397605e-79,180.0,46.0,275.0,5.397605e-79,5.397605e-79,5.397605e-79,3.0,2.0,2.0,2.0,2.0,2.0,3.0,2.0,3.0,3.0,4.0,1.0,1.0,1.0,1.0,2.0
75%,9.0,1100.0,2015007000.0,2015007000.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,1.0,2.0,2.0,3.0,88.0,88.0,88.0,1.0,1.0,2.0,2.0,3.0,1.0,1.0,1.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,3.0,64.0,2.0,3.0,6.0,2.0,2.0,2.0,1.0,2.0,7.0,88.0,8.0,1.0,210.0,510.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,3.0,2.0,7.0,3.0,888.0,2.0,88.0,4.0,555.0,305.0,314.0,310.0,310.0,310.0,2.0,64.0,204.0,100.0,88.0,204.0,130.0,888.0,2.0,2.0,3.0,7.0,1.0,2.0,112014.0,5.0,2.0,2.0,772013.0,4.0,2.0,3.0,2.0,204.0,301.0,5.0,4.0,76.0,3.0,2.0,2.0,,,,,2.0,12.0,5.0,3.0,13.0,2.0,1.0,6.0,2.0,1.0,2.0,3.0,7.0,4.0,2.0,3.0,2.0,2.0,2.0,5.0,5.0,3.0,5.0,2.0,2.0,420.0,2.0,97.0,2.0,88.0,88.0,88.0,888.0,8.0,8.0,8.0,8.0,2.0,2.0,2.0,3.0,2.0,1.0,2.0,3.0,2.0,2.0,2.0,4.0,2.0,3.0,2.0,1.0,2.0,1.0,4.0,7.0,4.0,2.0,1.0,2.0,2.0,5.0,2.0,2.0,5.0,2.0,2.0,2.0,2.0,2.0,2.0,3.0,5.0,5.0,2.0,50.0,2.0,50.0,1.0,4.0,2.0,1.0,2.0,2.0,2.0,2.0,88.0,88.0,88.0,88.0,88.0,88.0,88.0,88.0,2.0,2.0,20.0,1.0,5.0,441031.0,97.314988,2.0,136.543634,9.0,2.0,2.0,916.080212,9.0,0.766052,621.157499,1.0,9.0,2.0,1.0,2.0,2.0,1.0,1.0,3.0,2.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,10.0,2.0,69.0,6.0,70.0,178.0,9072.0,3090.0,4.0,2.0,2.0,4.0,5.0,4.0,1.0,2.0,27.0,1.0,350.0,1.0,50.0,100.0,33.0,83.0,43.0,100.0,5.397605e-79,5.397605e-79,1.0,1.0,200.0,243.0,2.0,2.0,1.0,1.0,5.397605e-79,5.397605e-79,2.0,50.0,50.0,3415.0,585.0,2.0,2.0,60.0,90.0,5833.0,4000.0,252.0,120.0,2000.0,5.397605e-79,360.0,180.0,546.0,84.0,17.0,140.0,4.0,2.0,3.0,3.0,2.0,2.0,4.0,2.0,3.0,3.0,4.0,1.0,1.0,2.0,2.0,2.0
max,12.0,1200.0,2015023000.0,2015023000.0,1.0,2.0,1.0,1.0,2.0,2.0,20.0,18.0,10.0,1.0,2.0,2.0,2.0,1.0,2.0,9.0,99.0,9.0,99.0,99.0,99.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,99.0,2.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,99.0,99.0,9.0,9999.0,9999.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,99.0,9.0,999.0,99.0,99.0,99.0,999.0,999.0,999.0,999.0,999.0,999.0,9.0,99.0,999.0,999.0,99.0,999.0,999.0,999.0,9.0,9.0,9.0,99.0,9.0,9.0,999999.0,99.0,9.0,9.0,999999.0,99.0,9.0,9.0,9.0,999.0,999.0,99.0,99.0,99.0,9.0,9.0,9.0,,,,,9.0,99.0,9.0,9.0,99.0,9.0,9.0,9.0,9.0,8.0,7.0,9.0,99.0,9.0,9.0,7.0,7.0,7.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,999.0,9.0,99.0,7.0,98.0,98.0,99.0,999.0,9.0,8.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,99.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,99.0,9.0,99.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,99.0,99.0,99.0,99.0,99.0,99.0,99.0,99.0,9.0,9.0,23.0,3.0,5.0,722019.0,1603.286561,5.0,3191.946865,9.0,99.0,99.0,25755.013877,9.0,0.920562,36700.684291,9.0,9.0,9.0,9.0,9.0,2.0,9.0,9.0,9.0,2.0,99.0,99.0,9.0,9.0,9.0,9.0,5.0,14.0,3.0,80.0,6.0,95.0,241.0,28985.0,9995.0,4.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,900.0,9.0,99900.0,9.0,9900.0,9900.0,9900.0,9900.0,9900.0,9900.0,2.0,4.0,1.0,1.0,15000.0,19929.0,9.0,9.0,1.0,1.0,2.0,2.0,9.0,128.0,128.0,99900.0,99900.0,2.0,2.0,599.0,599.0,99000.0,99000.0,53460.0,47520.0,99000.0,9.0,53460.0,47520.0,54000.0,14400.0,19200.0,19200.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0,9.0


In [13]:
pd.set_option('display.max_rows', 500)
df.isna().sum()

STATE            0
FMONTH           0
IDATE            0
IMONTH           0
IDAY             0
IYEAR            0
DISPCODE         0
SEQNO            0
_PSU             0
CTELENUM    186811
PVTRESD1    186811
COLGHOUS    441411
STATERES    186813
CELLFON3    186810
LADULT      441411
NUMADULT    186836
NUMMEN      186953
NUMWOMEN    186954
CTELNUM1    254645
CELLFON2    254645
CADULT      254646
PVTRESD2    254645
CCLGHOUS    440386
CSTATE      254644
LANDLINE    255389
HHADULT     259825
GENHLTH          2
PHYSHLTH         1
MENTHLTH         0
POORHLTH    214492
HLTHPLN1         0
PERSDOC2         0
MEDCOST          1
CHECKUP1         1
BPHIGH4          1
BPMEDS      263268
BLOODCHO         0
CHOLCHK      59154
TOLDHI2      59154
CVDINFR4         0
CVDCRHD4         1
CVDSTRK3         0
ASTHMA3          0
ASTHNOW     382047
CHCSCNCR         1
CHCOCNCR         0
CHCCOPD1         0
HAVARTH3         1
ADDEPEV2         0
CHCKIDNY         0
DIABETE3         7
DIABAGE2    384203
SEX         

### Data cleaning

I chose the '_STATE' field and decided I want to change the name of that column, and I also want to covert the numeric STATE value to a descriptive value

In [8]:

df = df.rename(columns={'_STATE': 'STATE'})

In [9]:
df['STATE'].replace(1, 'AL', inplace=True)
df['STATE'].replace(2, 'AK', inplace=True)
df['STATE'].replace(4, 'AZ', inplace=True)
df['STATE'].replace(5, 'AR', inplace=True)
df['STATE'].replace(6, 'CA', inplace=True)
df['STATE'].replace(8, 'CO', inplace=True)
df['STATE'].replace(9, 'CT', inplace=True)
df['STATE'].replace(10, 'DE', inplace=True)
df['STATE'].replace(11, 'DC', inplace=True)
df['STATE'].replace(12, 'FL', inplace=True)
df['STATE'].replace(13, 'GA', inplace=True)
df['STATE'].replace(15, 'HI', inplace=True)
df['STATE'].replace(16, 'ID', inplace=True)
df['STATE'].replace(17, 'IL', inplace=True)
df['STATE'].replace(18, 'IN', inplace=True)
df['STATE'].replace(19, 'IA', inplace=True)
df['STATE'].replace(20, 'KS', inplace=True)
df['STATE'].replace(21, 'KY', inplace=True)
df['STATE'].replace(22, 'LA', inplace=True)
df['STATE'].replace(23, 'ME', inplace=True)
df['STATE'].replace(24, 'MD', inplace=True)
df['STATE'].replace(25, 'MA', inplace=True)


df['STATE'].replace(26, 'MI', inplace=True)
df['STATE'].replace(27, 'MN', inplace=True)
df['STATE'].replace(28, 'MS', inplace=True)
df['STATE'].replace(29, 'MO', inplace=True)
df['STATE'].replace(30, 'MT', inplace=True)
df['STATE'].replace(31, 'NE', inplace=True)
df['STATE'].replace(32, 'NV', inplace=True)
df['STATE'].replace(33, 'NH', inplace=True)
df['STATE'].replace(34, 'NJ', inplace=True)
df['STATE'].replace(35, 'NM', inplace=True)
df['STATE'].replace(36, 'NY', inplace=True)
df['STATE'].replace(37, 'NC', inplace=True)
df['STATE'].replace(38, 'ND', inplace=True)
df['STATE'].replace(39, 'OH', inplace=True)
df['STATE'].replace(40, 'OK', inplace=True)
df['STATE'].replace(41, 'OR', inplace=True)
df['STATE'].replace(42, 'PA', inplace=True)
df['STATE'].replace(44, 'RI', inplace=True)
df['STATE'].replace(45, 'SC', inplace=True)
df['STATE'].replace(46, 'SD', inplace=True)
df['STATE'].replace(47, 'TN', inplace=True)
df['STATE'].replace(48, 'TX', inplace=True)
df['STATE'].replace(49, 'UT', inplace=True)
df['STATE'].replace(50, 'VT', inplace=True)

df['STATE'].replace(51, 'VA', inplace=True)
df['STATE'].replace(53, 'WA', inplace=True)
df['STATE'].replace(54, 'WV', inplace=True)

df['STATE'].replace(55, 'WI', inplace=True)
df['STATE'].replace(56, 'WY', inplace=True)
df['STATE'].replace(66, 'GU', inplace=True)
df['STATE'].replace(72, 'PR', inplace=True)




In [37]:
# QA the replace()
df.loc[df['STATE'] == 'WI']

Unnamed: 0,STATE,FMONTH,IDATE,IMONTH,IDAY,IYEAR,DISPCODE,SEQNO,_PSU,CTELENUM,PVTRESD1,COLGHOUS,STATERES,CELLFON3,LADULT,NUMADULT,NUMMEN,NUMWOMEN,CTELNUM1,CELLFON2,CADULT,PVTRESD2,CCLGHOUS,CSTATE,LANDLINE,HHADULT,GENHLTH,PHYSHLTH,MENTHLTH,POORHLTH,HLTHPLN1,PERSDOC2,MEDCOST,CHECKUP1,BPHIGH4,BPMEDS,BLOODCHO,CHOLCHK,TOLDHI2,CVDINFR4,CVDCRHD4,CVDSTRK3,ASTHMA3,ASTHNOW,CHCSCNCR,CHCOCNCR,CHCCOPD1,HAVARTH3,ADDEPEV2,CHCKIDNY,DIABETE3,DIABAGE2,SEX,MARITAL,EDUCA,RENTHOM1,NUMHHOL2,NUMPHON2,CPDEMO1,VETERAN3,EMPLOY1,CHILDREN,INCOME2,INTERNET,WEIGHT2,HEIGHT3,PREGNANT,QLACTLM2,USEEQUIP,BLIND,DECIDE,DIFFWALK,DIFFDRES,DIFFALON,SMOKE100,SMOKDAY2,STOPSMK2,LASTSMK2,USENOW3,ALCDAY5,AVEDRNK2,DRNK3GE5,MAXDRNKS,FRUITJU1,FRUIT1,FVBEANS,FVGREEN,FVORANG,VEGETAB1,EXERANY2,EXRACT11,EXEROFT1,EXERHMM1,EXRACT21,EXEROFT2,EXERHMM2,STRENGTH,LMTJOIN3,ARTHDIS2,ARTHSOCL,JOINPAIN,SEATBELT,FLUSHOT6,FLSHTMY2,IMFVPLAC,PNEUVAC3,HIVTST6,HIVTSTD3,WHRTST10,PDIABTST,PREDIAB1,INSULIN,BLDSUGAR,FEETCHK2,DOCTDIAB,CHKHEMO3,FEETCHK,EYEEXAM,DIABEYE,DIABEDU,PAINACT2,QLMENTL2,QLSTRES2,QLHLTH2,CAREGIV1,CRGVREL1,CRGVLNG1,CRGVHRS1,CRGVPRB1,CRGVPERS,CRGVHOUS,CRGVMST2,CRGVEXPT,VIDFCLT2,VIREDIF3,VIPRFVS2,VINOCRE2,VIEYEXM2,VIINSUR2,VICTRCT4,VIGLUMA2,VIMACDG2,CIMEMLOS,CDHOUSE,CDASSIST,CDHELP,CDSOCIAL,CDDISCUS,WTCHSALT,LONGWTCH,DRADVISE,ASTHMAGE,ASATTACK,ASERVIST,ASDRVIST,ASRCHKUP,ASACTLIM,ASYMPTOM,ASNOSLEP,ASTHMED3,ASINHALR,HAREHAB1,STREHAB1,CVDASPRN,ASPUNSAF,RLIVPAIN,RDUCHART,RDUCSTRK,ARTTODAY,ARTHWGT,ARTHEXER,ARTHEDU,TETANUS,HPVADVC2,HPVADSHT,SHINGLE2,HADMAM,HOWLONG,HADPAP2,LASTPAP2,HPVTEST,HPLSTTST,HADHYST2,PROFEXAM,LENGEXAM,BLDSTOOL,LSTBLDS3,HADSIGM3,HADSGCO1,LASTSIG3,PCPSAAD2,PCPSADI1,PCPSARE1,PSATEST1,PSATIME,PCPSARS1,PCPSADE1,PCDMDECN,SCNTMNY1,SCNTMEL1,SCNTPAID,SCNTWRK1,SCNTLPAD,SCNTLWK1,SXORIENT,TRNSGNDR,RCSGENDR,RCSRLTN2,CASTHDX2,CASTHNO2,EMTSUPRT,LSATISFY,ADPLEASR,ADDOWN,ADSLEEP,ADENERGY,ADEAT1,ADFAIL,ADTHINK,ADMOVE,MISTMNT,ADANXEV,QSTVER,QSTLANG,EXACTOT1,EXACTOT2,MSCODE,_STSTR,_STRWT,_RAWRAKE,_WT2RAKE,_CHISPNC,_CRACE1,_CPRACE,_CLLCPWT,_DUALUSE,_DUALCOR,_LLCPWT,_RFHLTH,_HCVU651,_RFHYPE5,_CHOLCHK,_RFCHOL,_MICHD,_LTASTH1,_CASTHM1,_ASTHMS1,_DRDXAR1,_PRACE1,_MRACE1,_HISPANC,_RACE,_RACEG21,_RACEGR3,_RACE_G1,_AGEG5YR,_AGE65YR,_AGE80,_AGE_G,HTIN4,HTM4,WTKG3,_BMI5,_BMI5CAT,_RFBMI5,_CHLDCNT,_EDUCAG,_INCOMG,_SMOKER3,_RFSMOK3,DRNKANY5,DROCDY3_,_RFBING5,_DRNKWEK,_RFDRHV5,FTJUDA1_,FRUTDA1_,BEANDAY_,GRENDAY_,ORNGDAY_,VEGEDA1_,_MISFRTN,_MISVEGN,_FRTRESP,_VEGRESP,_FRUTSUM,_VEGESUM,_FRTLT1,_VEGLT1,_FRT16,_VEG23,_FRUITEX,_VEGETEX,_TOTINDA,METVL11_,METVL21_,MAXVO2_,FC60_,ACTIN11_,ACTIN21_,PADUR1_,PADUR2_,PAFREQ1_,PAFREQ2_,_MINAC11,_MINAC21,STRFREQ_,PAMISS1_,PAMIN11_,PAMIN21_,PA1MIN_,PAVIG11_,PAVIG21_,PA1VIGM_,_PACAT1,_PAINDX1,_PA150R2,_PA300R2,_PA30021,_PASTRNG,_PAREC1,_PASTAE1,_LMTACT1,_LMTWRK1,_LMTSCL1,_RFSEAT2,_RFSEAT3,_FLSHOT6,_PNEUMO2,_AIDTST3
422702,WI,10.0,b'10202015',b'10',b'20',b'2015',1200.0,2.015000e+09,2.015000e+09,,,,,,,,,,1.0,1.0,2.0,1.0,,2.0,2.0,2.0,1.0,88.0,14.0,88.0,2.0,3.0,2.0,4.0,3.0,,1.0,4.0,2.0,2.0,2.0,2.0,1.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,3.0,,2.0,5.0,5.0,2.0,,,,1.0,6.0,88.0,77.0,1.0,150.0,509.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,1.0,3.0,,1.0,3.0,201.0,2.0,88.0,2.0,101.0,101.0,301.0,101.0,202.0,101.0,1.0,64.0,107.0,100.0,37.0,107.0,30.0,101.0,,,,,1.0,1.0,12015.0,2.0,2.0,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,,,,,,,,,,,,,,,,,,,20.0,1.0,b'',b'',,552029.0,228.442872,1.0,228.442872,9.0,,,,9.0,,291.875056,1.0,2.0,1.0,2.0,1.0,2.0,2.0,1.0,2.0,2.0,2.0,2.0,1.0,8.0,2.0,5.0,3.0,2.0,1.0,28.0,2.0,69.0,175.0,6804.0,2215.0,2.0,1.0,1.0,3.0,9.0,3.0,1.0,1.0,3.000000e+00,1.0,4.700000e+01,1.0,1.000000e+02,1.000000e+02,3.000000e+00,1.000000e+02,2.900000e+01,100.0,5.397605e-79,5.397605e-79,1.0,1.000000e+00,2.000000e+02,232.0,1.0,1.0,1.0,1.0,5.397605e-79,5.397605e-79,1.0,35.0,6.000000e+01,3764.0,645.0,1.0,1.000000e+00,60.0,30.0,7000.0,7000.0,420.0,2.100000e+02,1.000000e+03,5.397605e-79,420.0,2.100000e+02,630.0,5.397605e-79,5.397605e-79,5.397605e-79,1.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,3.0,3.0,4.0,1.0,1.0,,,2.0
422703,WI,9.0,b'09232015',b'09',b'23',b'2015',1200.0,2.015000e+09,2.015000e+09,,,,,,,,,,1.0,1.0,1.0,1.0,,2.0,1.0,3.0,1.0,88.0,88.0,,1.0,1.0,2.0,2.0,3.0,,2.0,,,2.0,2.0,2.0,2.0,,2.0,2.0,2.0,2.0,2.0,2.0,3.0,,1.0,5.0,4.0,3.0,,,,2.0,2.0,1.0,8.0,1.0,140.0,509.0,,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,,,,1.0,888.0,,,,302.0,310.0,310.0,555.0,305.0,320.0,1.0,26.0,225.0,500.0,98.0,210.0,100.0,888.0,,,,,5.0,2.0,,,2.0,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,,,,,,,,,,,,,,,,,,,20.0,1.0,b'',b'Motocross',,552039.0,154.971164,1.0,154.971164,9.0,,,,2.0,0.369293,443.343795,1.0,1.0,1.0,3.0,,2.0,1.0,1.0,3.0,2.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,18.0,1.0,69.0,175.0,6350.0,2067.0,2.0,1.0,2.0,2.0,5.0,4.0,1.0,2.0,5.397605e-79,1.0,5.397605e-79,1.0,7.000000e+00,3.300000e+01,3.300000e+01,5.397605e-79,1.700000e+01,67.0,5.397605e-79,5.397605e-79,1.0,1.000000e+00,4.000000e+01,117.0,2.0,1.0,1.0,1.0,5.397605e-79,5.397605e-79,1.0,50.0,4.500000e+01,5010.0,859.0,1.0,1.000000e+00,300.0,60.0,5833.0,2333.0,1750.0,1.400000e+02,5.397605e-79,5.397605e-79,1750.0,1.400000e+02,1890.0,5.397605e-79,5.397605e-79,5.397605e-79,1.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,3.0,3.0,4.0,2.0,2.0,,,2.0
422704,WI,12.0,b'12152015',b'12',b'15',b'2015',1200.0,2.015000e+09,2.015000e+09,,,,,,,,,,1.0,1.0,1.0,1.0,,2.0,1.0,2.0,4.0,2.0,30.0,29.0,1.0,2.0,2.0,4.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,1.0,2.0,,2.0,2.0,1.0,2.0,1.0,2.0,3.0,,1.0,1.0,5.0,1.0,,,,2.0,7.0,88.0,5.0,1.0,198.0,509.0,,1.0,1.0,2.0,2.0,2.0,2.0,2.0,1.0,3.0,,7.0,3.0,107.0,1.0,88.0,2.0,555.0,312.0,304.0,304.0,310.0,305.0,2.0,,,,,,,888.0,,,,,1.0,2.0,,,1.0,1.0,62012.0,3.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,,,,,,,,,,,,,,,,,,,20.0,1.0,b'',b'',,552089.0,403.641402,1.0,403.641402,9.0,,,,2.0,0.369293,930.522599,2.0,9.0,2.0,1.0,2.0,1.0,1.0,1.0,3.0,2.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,11.0,2.0,73.0,6.0,69.0,175.0,8981.0,2924.0,3.0,2.0,1.0,3.0,3.0,3.0,1.0,1.0,1.000000e+02,1.0,7.000000e+02,1.0,5.397605e-79,4.000000e+01,1.300000e+01,1.300000e+01,3.300000e+01,17.0,5.397605e-79,5.397605e-79,1.0,1.000000e+00,4.000000e+01,76.0,2.0,2.0,1.0,1.0,5.397605e-79,5.397605e-79,2.0,,,1985.0,340.0,,,,,,,,,5.397605e-79,5.397605e-79,,,,,,,4.0,2.0,3.0,3.0,2.0,2.0,4.0,2.0,3.0,3.0,4.0,1.0,1.0,2.0,1.0,1.0
422705,WI,12.0,b'12122015',b'12',b'12',b'2015',1200.0,2.015000e+09,2.015000e+09,,,,,,,,,,1.0,1.0,2.0,1.0,,2.0,2.0,2.0,3.0,88.0,7.0,88.0,1.0,3.0,2.0,3.0,3.0,,1.0,3.0,2.0,2.0,2.0,2.0,2.0,,2.0,2.0,2.0,2.0,2.0,2.0,3.0,,2.0,1.0,5.0,2.0,,,,2.0,5.0,3.0,6.0,1.0,185.0,504.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,,,,3.0,204.0,1.0,88.0,2.0,310.0,102.0,101.0,206.0,201.0,206.0,1.0,15.0,105.0,30.0,73.0,120.0,100.0,103.0,,,,,1.0,2.0,,,2.0,1.0,52002.0,8.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,,,,,,,,,,,,,,,,,,,20.0,1.0,b'',b'',,552049.0,268.207197,1.0,268.207197,9.0,,,,9.0,,783.633818,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,3.0,2.0,1.0,1.0,1.0,8.0,2.0,5.0,3.0,4.0,1.0,35.0,3.0,64.0,163.0,8391.0,3175.0,4.0,2.0,4.0,3.0,4.0,4.0,1.0,1.0,1.300000e+01,1.0,9.300000e+01,1.0,3.300000e+01,2.000000e+02,1.000000e+02,8.600000e+01,1.400000e+01,86.0,5.397605e-79,5.397605e-79,1.0,1.000000e+00,2.330000e+02,286.0,1.0,1.0,1.0,1.0,5.397605e-79,5.397605e-79,1.0,50.0,3.300000e+01,3505.0,601.0,1.0,1.000000e+00,30.0,60.0,5000.0,20000.0,150.0,1.200000e+03,3.000000e+03,5.397605e-79,150.0,1.200000e+03,1350.0,5.397605e-79,5.397605e-79,5.397605e-79,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,3.0,3.0,4.0,1.0,1.0,,,1.0
422706,WI,3.0,b'03232015',b'03',b'23',b'2015',1200.0,2.015000e+09,2.015000e+09,,,,,,,,,,1.0,1.0,1.0,1.0,,2.0,2.0,2.0,2.0,5.0,88.0,88.0,1.0,1.0,1.0,1.0,3.0,,1.0,1.0,2.0,2.0,2.0,2.0,1.0,1.0,2.0,2.0,2.0,2.0,2.0,2.0,3.0,,1.0,1.0,4.0,1.0,,,,1.0,7.0,88.0,6.0,2.0,185.0,600.0,,2.0,2.0,2.0,2.0,2.0,2.0,2.0,1.0,3.0,,7.0,3.0,888.0,,,,101.0,102.0,777.0,101.0,320.0,330.0,1.0,64.0,230.0,30.0,67.0,103.0,20.0,888.0,,,,,1.0,1.0,112014.0,9.0,1.0,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,,,,,,,,,,,,,,,,,,,20.0,1.0,b'',b'',,552039.0,154.971164,1.0,154.971164,9.0,,,,9.0,,652.230800,1.0,9.0,1.0,1.0,1.0,2.0,2.0,2.0,1.0,2.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,12.0,2.0,77.0,6.0,72.0,183.0,8391.0,2509.0,3.0,2.0,1.0,2.0,4.0,3.0,1.0,2.0,5.397605e-79,1.0,5.397605e-79,1.0,1.000000e+02,2.000000e+02,,1.000000e+02,6.700000e+01,100.0,5.397605e-79,1.000000e+00,1.0,5.397605e-79,3.000000e+02,,1.0,9.0,1.0,1.0,5.397605e-79,1.000000e+00,1.0,35.0,5.397605e-79,1765.0,303.0,2.0,5.397605e-79,30.0,20.0,7000.0,3000.0,210.0,5.397605e-79,5.397605e-79,5.397605e-79,420.0,5.397605e-79,420.0,2.100000e+02,5.397605e-79,2.100000e+02,1.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,3.0,3.0,4.0,1.0,1.0,1.0,1.0,2.0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
428885,WI,12.0,b'01042016',b'01',b'04',b'2016',1100.0,2.015006e+09,2.015006e+09,,,,,,,,,,1.0,1.0,2.0,1.0,,1.0,2.0,1.0,1.0,88.0,88.0,,1.0,2.0,1.0,1.0,3.0,,1.0,1.0,1.0,2.0,2.0,2.0,2.0,,2.0,2.0,2.0,2.0,2.0,2.0,3.0,,2.0,1.0,6.0,1.0,,,,2.0,1.0,2.0,8.0,1.0,155.0,508.0,,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,,,,3.0,201.0,1.0,88.0,1.0,101.0,103.0,203.0,777.0,101.0,102.0,1.0,6.0,103.0,45.0,64.0,103.0,100.0,107.0,,,,,1.0,1.0,22015.0,1.0,2.0,2.0,,,,,,,,,,,,,,,,,,2.0,,,,,,,,2.0,,,,,,,,,,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,1.0,4.0,9.0,9.0,9.0,,,,,,,,,,,,,,20.0,1.0,b'',b'',,552109.0,556.633421,1.0,556.633421,9.0,99.0,99.0,2780.63052,9.0,,744.010441,1.0,1.0,1.0,1.0,2.0,2.0,1.0,1.0,3.0,2.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,8.0,1.0,57.0,5.0,68.0,173.0,7031.0,2357.0,2.0,1.0,3.0,4.0,5.0,4.0,1.0,1.0,3.000000e+00,1.0,2.300000e+01,1.0,1.000000e+02,3.000000e+02,4.300000e+01,,1.000000e+02,200.0,5.397605e-79,1.000000e+00,1.0,5.397605e-79,4.000000e+02,,1.0,9.0,1.0,1.0,5.397605e-79,1.000000e+00,1.0,68.0,3.500000e+01,2691.0,461.0,2.0,1.000000e+00,45.0,60.0,3000.0,3000.0,135.0,1.800000e+02,7.000000e+03,5.397605e-79,270.0,1.800000e+02,450.0,1.350000e+02,5.397605e-79,1.350000e+02,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,3.0,3.0,4.0,1.0,1.0,,,2.0
428886,WI,12.0,b'12212015',b'12',b'21',b'2015',1100.0,2.015006e+09,2.015006e+09,,,,,,,,,,1.0,1.0,1.0,1.0,,1.0,2.0,1.0,1.0,88.0,88.0,,1.0,1.0,2.0,3.0,3.0,,1.0,3.0,2.0,2.0,2.0,2.0,2.0,,2.0,2.0,2.0,2.0,2.0,2.0,3.0,,1.0,1.0,6.0,1.0,,,,2.0,7.0,88.0,8.0,1.0,200.0,511.0,,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,,,,3.0,888.0,,,,555.0,555.0,555.0,304.0,305.0,305.0,2.0,,,,,,,888.0,,,,,1.0,2.0,,,2.0,2.0,,,,,,,,,,,,,,,,,,1.0,3.0,2.0,1.0,11.0,2.0,2.0,1.0,,,,,,,,,,,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,1.0,4.0,,,,,,,,,,,,,,,,,20.0,1.0,b'',b'',,552109.0,556.633421,1.0,556.633421,9.0,,,,9.0,,684.781603,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,3.0,2.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,8.0,1.0,56.0,5.0,71.0,180.0,9072.0,2789.0,3.0,2.0,1.0,4.0,5.0,4.0,1.0,2.0,5.397605e-79,1.0,5.397605e-79,1.0,5.397605e-79,5.397605e-79,5.397605e-79,1.300000e+01,1.700000e+01,17.0,5.397605e-79,5.397605e-79,1.0,1.000000e+00,5.397605e-79,47.0,2.0,2.0,1.0,1.0,5.397605e-79,5.397605e-79,2.0,,,2920.0,501.0,,,,,,,,,5.397605e-79,5.397605e-79,,,,,,,4.0,2.0,3.0,3.0,2.0,2.0,4.0,2.0,3.0,3.0,4.0,1.0,1.0,,,2.0
428887,WI,4.0,b'04182015',b'04',b'18',b'2015',1200.0,2.015006e+09,2.015006e+09,,,,,,,,,,1.0,1.0,1.0,1.0,,2.0,2.0,3.0,2.0,2.0,2.0,88.0,1.0,1.0,2.0,1.0,3.0,,1.0,1.0,2.0,2.0,2.0,2.0,2.0,,2.0,2.0,2.0,2.0,2.0,2.0,3.0,,1.0,2.0,6.0,2.0,,,,2.0,6.0,1.0,77.0,1.0,165.0,511.0,,2.0,2.0,2.0,2.0,2.0,2.0,2.0,2.0,,,,3.0,207.0,3.0,1.0,5.0,308.0,312.0,304.0,310.0,310.0,315.0,1.0,37.0,104.0,45.0,7.0,202.0,30.0,103.0,,,,,1.0,1.0,102014.0,6.0,2.0,2.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,,,,,,,,,,,,,,,,,,,20.0,1.0,b'',b'',,552069.0,559.835322,1.0,559.835322,9.0,,,,9.0,,1817.228081,1.0,1.0,1.0,1.0,1.0,2.0,1.0,1.0,3.0,2.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,2.0,1.0,27.0,2.0,71.0,180.0,7484.0,2301.0,2.0,1.0,2.0,4.0,9.0,4.0,1.0,1.0,2.300000e+01,2.0,4.900000e+02,1.0,2.700000e+01,4.000000e+01,1.300000e+01,3.300000e+01,3.300000e+01,50.0,5.397605e-79,5.397605e-79,1.0,1.000000e+00,6.700000e+01,129.0,2.0,1.0,1.0,1.0,5.397605e-79,5.397605e-79,1.0,60.0,6.800000e+01,4515.0,774.0,1.0,1.000000e+00,45.0,30.0,4000.0,467.0,180.0,1.400000e+01,3.000000e+03,5.397605e-79,180.0,1.400000e+01,194.0,5.397605e-79,5.397605e-79,5.397605e-79,2.0,1.0,1.0,2.0,2.0,1.0,1.0,1.0,3.0,3.0,4.0,1.0,1.0,,,2.0
428888,WI,8.0,b'09222015',b'09',b'22',b'2015',1200.0,2.015006e+09,2.015006e+09,,,,,,,,,,1.0,1.0,1.0,1.0,,2.0,2.0,1.0,4.0,88.0,2.0,6.0,1.0,1.0,2.0,1.0,1.0,1.0,1.0,1.0,1.0,2.0,2.0,2.0,2.0,,2.0,2.0,2.0,2.0,2.0,2.0,1.0,22.0,1.0,2.0,4.0,2.0,,,,2.0,4.0,88.0,1.0,2.0,160.0,505.0,,2.0,2.0,2.0,2.0,2.0,2.0,2.0,1.0,2.0,1.0,,3.0,888.0,,,,555.0,101.0,101.0,555.0,555.0,203.0,1.0,64.0,102.0,100.0,88.0,,,888.0,,,,,1.0,1.0,32015.0,3.0,2.0,1.0,22014.0,4.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,b'',,,,,,,,,,,,,,,,,,,,,,,,,20.0,2.0,b'',b'',,552069.0,559.835322,1.0,559.835322,9.0,,,,9.0,,2471.455349,2.0,1.0,2.0,1.0,2.0,2.0,1.0,1.0,3.0,2.0,1.0,1.0,1.0,8.0,2.0,5.0,3.0,5.0,1.0,40.0,3.0,65.0,165.0,7257.0,2663.0,3.0,2.0,1.0,2.0,1.0,2.0,2.0,2.0,5.397605e-79,1.0,5.397605e-79,1.0,5.397605e-79,1.000000e+02,1.000000e+02,5.397605e-79,5.397605e-79,43.0,5.397605e-79,5.397605e-79,1.0,1.000000e+00,1.000000e+02,143.0,1.0,1.0,1.0,1.0,5.397605e-79,5.397605e-79,1.0,35.0,5.397605e-79,3800.0,651.0,1.0,5.397605e-79,60.0,,2000.0,,120.0,5.397605e-79,5.397605e-79,5.397605e-79,120.0,5.397605e-79,120.0,5.397605e-79,5.397605e-79,5.397605e-79,3.0,2.0,2.0,2.0,2.0,2.0,4.0,2.0,3.0,3.0,4.0,1.0,1.0,,,1.0


In [10]:
df['STATE'].value_counts()

KS    23236
NE    17561
MN    16761
WA    16116
TX    14697
CO    13537
CA    12601
MD    12598
NY    12357
OH    11929
CT    11899
SC    11607
NJ    11465
UT    11401
FL     9739
MA     9294
ME     9063
MI     8935
KY     8806
VA     8646
AL     7950
AZ     7946
MO     7307
SD     7221
HI     7163
NH     7022
OK     6943
NM     6734
NC     6698
VT     6489
IA     6227
RI     6206
WI     6188
IN     6067
MT     6051
MS     6035
TN     5979
WV     5957
ID     5802
PA     5740
WY     5492
PR     5405
OR     5359
IL     5289
AR     5256
ND     4972
LA     4716
GA     4678
DE     4070
DC     3994
AK     3657
NV     2926
GU     1669
Name: STATE, dtype: int64

I am most interested in State, General health, Physical health, Mental health, and Poor health columns

In [28]:
df[['STATE', 'GENHLTH', 'PHYSHLTH', 'MENTHLTH', 'POORHLTH']].groupby('STATE').describe()

Unnamed: 0_level_0,GENHLTH,GENHLTH,GENHLTH,GENHLTH,GENHLTH,GENHLTH,GENHLTH,GENHLTH,PHYSHLTH,PHYSHLTH,PHYSHLTH,PHYSHLTH,PHYSHLTH,PHYSHLTH,PHYSHLTH,PHYSHLTH,MENTHLTH,MENTHLTH,MENTHLTH,MENTHLTH,MENTHLTH,MENTHLTH,MENTHLTH,MENTHLTH,POORHLTH,POORHLTH,POORHLTH,POORHLTH,POORHLTH,POORHLTH,POORHLTH,POORHLTH
Unnamed: 0_level_1,count,mean,std,min,25%,50%,75%,max,count,mean,std,min,25%,50%,75%,max,count,mean,std,min,25%,50%,75%,max,count,mean,std,min,25%,50%,75%,max
STATE,Unnamed: 1_level_2,Unnamed: 2_level_2,Unnamed: 3_level_2,Unnamed: 4_level_2,Unnamed: 5_level_2,Unnamed: 6_level_2,Unnamed: 7_level_2,Unnamed: 8_level_2,Unnamed: 9_level_2,Unnamed: 10_level_2,Unnamed: 11_level_2,Unnamed: 12_level_2,Unnamed: 13_level_2,Unnamed: 14_level_2,Unnamed: 15_level_2,Unnamed: 16_level_2,Unnamed: 17_level_2,Unnamed: 18_level_2,Unnamed: 19_level_2,Unnamed: 20_level_2,Unnamed: 21_level_2,Unnamed: 22_level_2,Unnamed: 23_level_2,Unnamed: 24_level_2,Unnamed: 25_level_2,Unnamed: 26_level_2,Unnamed: 27_level_2,Unnamed: 28_level_2,Unnamed: 29_level_2,Unnamed: 30_level_2,Unnamed: 31_level_2,Unnamed: 32_level_2
AK,3657.0,2.51545,1.093218,1.0,2.0,2.0,3.0,9.0,3657.0,58.374351,38.510515,1.0,10.0,88.0,88.0,99.0,3657.0,62.446267,37.326065,1.0,15.0,88.0,88.0,99.0,2029.0,54.092656,39.195853,1.0,7.0,88.0,88.0,99.0
AL,7950.0,2.810943,1.195288,1.0,2.0,3.0,4.0,9.0,7950.0,59.159245,36.818675,1.0,15.0,88.0,88.0,99.0,7950.0,63.446541,35.766922,1.0,25.0,88.0,88.0,99.0,4323.0,53.784409,37.411976,1.0,14.0,88.0,88.0,99.0
AR,5256.0,2.895548,1.167076,1.0,2.0,3.0,4.0,9.0,5256.0,56.218227,37.413463,1.0,15.0,88.0,88.0,99.0,5256.0,65.582002,34.874587,1.0,30.0,88.0,88.0,99.0,2910.0,53.250172,37.283109,1.0,15.0,88.0,88.0,99.0
AZ,7946.0,2.56003,1.125027,1.0,2.0,2.0,3.0,9.0,7946.0,60.232192,37.090186,1.0,15.0,88.0,88.0,99.0,7946.0,64.897684,35.766218,1.0,30.0,88.0,88.0,99.0,4114.0,55.637822,38.03887,1.0,10.0,88.0,88.0,99.0
CA,12601.0,2.496945,1.106479,1.0,2.0,2.0,3.0,9.0,12601.0,60.307198,37.454065,1.0,15.0,88.0,88.0,99.0,12601.0,60.304023,37.729907,1.0,14.0,88.0,88.0,99.0,6759.0,52.655422,39.027002,1.0,7.0,88.0,88.0,99.0
CO,13536.0,2.445257,1.101949,1.0,2.0,2.0,3.0,9.0,13537.0,61.957745,36.925667,1.0,20.0,88.0,88.0,99.0,13537.0,62.900052,37.028733,1.0,15.0,88.0,88.0,99.0,6999.0,56.584083,38.520532,1.0,10.0,88.0,88.0,99.0
CT,11899.0,2.454408,1.10539,1.0,2.0,2.0,3.0,9.0,11899.0,60.432053,37.358532,1.0,15.0,88.0,88.0,99.0,11899.0,63.106227,36.711202,1.0,15.0,88.0,88.0,99.0,6213.0,55.918719,38.423375,1.0,10.0,88.0,88.0,99.0
DC,3994.0,2.436905,1.098071,1.0,2.0,2.0,3.0,9.0,3994.0,59.714822,37.647016,1.0,14.0,88.0,88.0,99.0,3994.0,66.293691,35.244843,1.0,30.0,88.0,88.0,99.0,2043.0,52.279001,39.01,1.0,7.0,88.0,88.0,99.0
DE,4070.0,2.604423,1.101075,1.0,2.0,3.0,3.0,7.0,4070.0,61.621376,36.580992,1.0,20.0,88.0,88.0,99.0,4070.0,66.296314,34.853746,1.0,30.0,88.0,88.0,99.0,2025.0,57.038025,37.635329,1.0,15.0,88.0,88.0,99.0
FL,9739.0,2.559503,1.153289,1.0,2.0,2.0,3.0,9.0,9739.0,60.001848,37.198081,1.0,15.0,88.0,88.0,99.0,9739.0,65.301674,35.525798,1.0,30.0,88.0,88.0,99.0,5159.0,55.544679,37.901731,1.0,14.0,88.0,88.0,99.0


### General Health widening (multiple column categories)

In [59]:
df['GENHLTH_EXCELLENT'] = df['GENHLTH'] == 1
df['GENHLTH_EXCELLENT'].value_counts() 

False    365424
True      76032
Name: GENHLTH_EXCELLENT, dtype: int64

Gen Health - Excellent

In [60]:
df['GENHLTH_EXCELLENT'].replace({True: 1, False: 0}, inplace=True)

In [62]:
df['GENHLTH_EXCELLENT'].value_counts()

0    365424
1     76032
Name: GENHLTH_EXCELLENT, dtype: int64

Gen Health - Very Good

In [63]:
df['GENHLTH_VERYGOOD'] = df['GENHLTH'] == 2
df['GENHLTH_VERYGOOD'].value_counts()

False    296391
True     145065
Name: GENHLTH_VERYGOOD, dtype: int64

In [64]:
df['GENHLTH_VERYGOOD'].replace({True: 1, False: 0}, inplace=True)
df['GENHLTH_VERYGOOD'].value_counts()

0    296391
1    145065
Name: GENHLTH_VERYGOOD, dtype: int64

Gen Health - Good, Fair, Poor, Unsure, Refused, Null

In [84]:
df['GENHLTH_GOOD'] = df['GENHLTH'] == 3
df['GENHLTH_FAIR'] = df['GENHLTH'] == 4
df['GENHLTH_POOR'] = df['GENHLTH'] == 5
df['GENHLTH_UNSURE'] = df['GENHLTH'] == 7
df['GENHLTH_REFUSED'] = df['GENHLTH'] == 9
df['GENHLTH_NULL'] = df['GENHLTH'].isnull()

In [85]:
df['GENHLTH_GOOD'].replace({True: 1, False: 0}, inplace=True)
df['GENHLTH_FAIR'].replace({True: 1, False: 0}, inplace=True)
df['GENHLTH_POOR'].replace({True: 1, False: 0}, inplace=True)
df['GENHLTH_UNSURE'].replace({True: 1, False: 0}, inplace=True)
df['GENHLTH_REFUSED'].replace({True: 1, False: 0}, inplace=True)
df['GENHLTH_NULL'].replace({True: 1, False: 0}, inplace=True)

In [86]:
df[['GENHLTH_EXCELLENT', 'GENHLTH_VERYGOOD', 'GENHLTH_GOOD', 'GENHLTH_FAIR', 'GENHLTH_POOR', 'GENHLTH_UNSURE', 'GENHLTH_REFUSED', 'GENHLTH_NULL']].sum()

GENHLTH_EXCELLENT     76032
GENHLTH_VERYGOOD     145065
GENHLTH_GOOD         136975
GENHLTH_FAIR          58962
GENHLTH_POOR          23175
GENHLTH_UNSURE          799
GENHLTH_REFUSED         446
GENHLTH_NULL              2
dtype: int64

### General Health Descriptive column (long format)

In [91]:
df['GENHLTH_DESCRIPTION'] = df['GENHLTH']
df['GENHLTH_DESCRIPTION'].value_counts()


2.0    145065
3.0    136975
1.0     76032
4.0     58962
5.0     23175
7.0       799
9.0       446
Name: GENHLTH_DESCRIPTION, dtype: int64

In [93]:
df['GENHLTH_DESCRIPTION'].replace({1: 'Excellent'}, inplace=True)
df['GENHLTH_DESCRIPTION'].replace({2: 'Very Good'}, inplace=True)
df['GENHLTH_DESCRIPTION'].replace({3: 'Good'}, inplace=True)
df['GENHLTH_DESCRIPTION'].replace({4: 'Fair'}, inplace=True)
df['GENHLTH_DESCRIPTION'].replace({5: 'Poor'}, inplace=True)
df['GENHLTH_DESCRIPTION'].replace({7: 'Unsure'}, inplace=True)
df['GENHLTH_DESCRIPTION'].replace({9: 'Refused'}, inplace=True)

In [94]:
df['GENHLTH_DESCRIPTION'].value_counts()


Very Good    145065
Good         136975
Excellent     76032
Fair          58962
Poor          23175
Unsure          799
Refused         446
Name: GENHLTH_DESCRIPTION, dtype: int64

In [111]:
df_genhlth = pd.DataFrame(df[['GENHLTH_DESCRIPTION']].value_counts() / df[['GENHLTH_DESCRIPTION']].value_counts().sum())

In [113]:
df_genhlth.reset_index(inplace=True)

In [120]:
df_genhlth.rename(columns={0: 'Ratio'}, inplace=True)
df_genhlth

Unnamed: 0,GENHLTH_DESCRIPTION,Ratio
0,Very Good,0.328607
1,Good,0.310281
2,Excellent,0.172231
3,Fair,0.133563
4,Poor,0.052497
5,Unsure,0.00181
6,Refused,0.00101


## Correlation analysis

This won't work because the numeric values in each health column represent a different description

Perhaps I should separate out each description into its own column

In [34]:
df[['STATE', 'GENHLTH', 'PHYSHLTH', 'MENTHLTH', 'POORHLTH']].corr()xx

Unnamed: 0,GENHLTH,PHYSHLTH,MENTHLTH,POORHLTH
GENHLTH,1.0,-0.309214,-0.157437,-0.189212
PHYSHLTH,-0.309214,1.0,0.242262,0.249592
MENTHLTH,-0.157437,0.242262,1.0,-0.013913
POORHLTH,-0.189212,0.249592,-0.013913,1.0
