# Lab Assignment 10: Exploratory Data Analysis, Part 1
## DS 6001: Practice and Application of Data Science

### Instructions
Please answer the following questions as completely as possible using text, code, and the results of code as needed. Format your answers in a Jupyter notebook. To receive full credit, make sure you address every part of the problem, and make sure your document is formatted in a clean and professional way.

In this lab, you will be working with the 2018 [General Social Survey (GSS)](http://www.gss.norc.org/). The GSS is a sociological survey created and regularly collected since 1972 by the National Opinion Research Center at the University of Chicago. It is funded by the National Science Foundation. The GSS collects information and keeps a historical record of the concerns, experiences, attitudes, and practices of residents of the United States, and it is one of the most important data sources for the social sciences. 

The data includes features that measure concepts that are notoriously difficult to ask about directly, such as religion, racism, and sexism. The data also include many different metrics of how successful a person is in his or her profession, including income, socioeconomic status, and occupational prestige. These occupational prestige scores are coded separately by the GSS.  The full description of their methodology for measuring prestige is available here: http://gss.norc.org/Documents/reports/methodological-reports/MR122%20Occupational%20Prestige.pdf Here's a quote to give you an idea about how these scores are calculated:

> Respondents then were given small cards which each had a single occupational titles listed on it. Cards were in English or Spanish. They were given one card at a time in the preordained order. The interviewer then asked the respondent to "please put the card in the box at the top of the ladder if you think that occupation has the highest possible social standing. Put it in the box of the bottom of the ladder if you think it has the lowest possible social standing. If it belongs somewhere in between, just put it in the box that matches the social standing of the occupation."

The prestige scores are calculated from the aggregated rankings according to the method described above.

### Problem 0
Import the following packages:

In [3]:
import numpy as np
import pandas as pd
import sidetable
import weighted # this is a module of wquantiles, so type pip install wquantiles or conda install wquantiles to get access to it
from scipy import stats 
from sklearn import manifold
from sklearn import metrics
import prince
from pandas_profiling import ProfileReport
pd.options.display.max_columns = None

Then load the GSS data with the following code:

In [4]:
%%capture
gss = pd.read_csv("https://github.com/jkropko/DS-6001/raw/master/localdata/gss2018.csv",
                 encoding='cp1252', na_values=['IAP','IAP,DK,NA,uncodeable', 'NOT SURE',
                                               'DK', 'IAP, DK, NA, uncodeable', '.a', "CAN'T CHOOSE"])

### Problem 1
Drop all columns except for the following:
* `id` - a numeric unique ID for each person who responded to the survey
* `wtss` - survey sample weights
* `sex` - male or female
* `educ` - years of formal education
* `region` - region of the country where the respondent lives
* `age` - age
* `coninc` - the respondent's personal annual income
* `prestg10` - the respondent's occupational prestige score, as measured by the GSS using the methodology described above
* `mapres10` - the respondent's mother's occupational prestige score, as measured by the GSS using the methodology described above
* `papres10` -the respondent's father's occupational prestige score, as measured by the GSS using the methodology described above
* `sei10` - an index measuring the respondent's socioeconomic status
* `satjob` - responses to "On the whole, how satisfied are you with the work you do?"
* `fechld` - agree or disagree with: "A working mother can establish just as warm and secure a relationship with her children as a mother who does not work."
* `fefam` - agree or disagree with: "It is much better for everyone involved if the man is the achiever outside the home and the woman takes care of the home and family."
* `fepol` - agree or disagree with: "Most men are better suited emotionally for politics than are most women."
* `fepresch` - agree or disagree with: "A preschool child is likely to suffer if his or her mother works."
* `meovrwrk` - agree or disagree with: "Family life often suffers because men concentrate too much on their work."

Then rename any columns with names that are non-intuitive to you to more intuitive and descriptive ones. Finally, replace the "89 or older" values of `age` with 89, and convert `age` to a float data type. [1 point]

In [6]:
# Create a list to hold selected features names -- total 17 of them:
selected_fetures = ['id', 'wtss', 'sex', 'educ', 
                    'region', 'age', 'coninc', 'prestg10', 
                    'mapres10', 'papres10', 'sei10', 'satjob', 
                    'fechld', 'fefam', 'fepol', 'fepresch', 'meovrwrk']
len(selected_fetures)

17

In [5]:
gss.head()

Unnamed: 0,abany,abdefect,abfelegl,abhelp1,abhelp2,abhelp3,abhelp4,abhlth,abinspay,abmedgov1,abmedgov2,abmelegl,abmoral,abnomore,abpoor,abpoorw,abrape,absingle,abstate1,abstate2,acqntsex,actssoc,adminconsent,adults,advfront,affrmact,afraidof,afterlif,age,aged,agekdbrn,ancestrs,arthrtis,astrolgy,astrosci,atheists,attend,attend12,attendma,attendpa,away1,away11,away2,away3,away4,away5,away6,away7,babies,backpain,ballot,balneg,balpos,befair,betrlang,bible,bigbang,bigbang1,bigbang2,bird,birdb4,born,boyorgrl,breakdwn,buddhsts,buyesop,buyvalue,cantrust,cappun,cat,catb4,charactr,chemgen,childs,chldidel,christns,churhpow,class,clergvte,closeto1,closeto2,closeto3,closeto4,closeto5,cntctfam,cntctfrd,cntctkid,cntctpar,cntctsib,codeg,coden,coeduc,coevwork,cofund,cohort,cohrs1,cohrs2,coind10,coisco08,cojew,colath,colcom,coldeg1,colhomo,colmil,colmslm,colrac,colsci,colscinm,comfort,company,compperf,comprend,compuse,compwage,conarmy,conbiz,conbus,conchurh,conclerg,concong,concourt,condemnd,condom,condrift,coneduc,conf2f,confed,confinan,coninc,conjudge,conlabor,conlegis,conmedic,conpress,conrinc,conschls,consci,consent,contv,conwkday,coocc10,coop,coother,copres10,copres105plus,corel,cosei10,cosei10educ,cosei10inc,courts,cowrkhlp,cowrkint,cowrkslf,cowrksta,crack30,dangoth1,dangoth2,dangoth3,dangoth4,dangoth5,dangroth,dangrslf,dangslf1,dangslf2,dangslf3,dangslf4,dangslf5,dateintv,decmoney,dectreat,defpensn,degree,demands,denkid,denom,denom16,depress,deptperf,diabetes,diagnosd,difrel,dinefrds,dipged,discaff,discaffm,discaffw,disrspct,divlaw,divorce,dofirst,dog,dogb4,dwelling,dwellpre,dwelown,dwelown16,earnrs,earthsun,educ,egomeans,electron,emailhr,emailmin,emoprobs,empinput,emptrain,endsmeet,eqwlth,esop,esopnot,eth1,eth2,eth3,ethnic,ethnum,evcrack,evidu,evolved,evolved2,evpaidsx,evstray,evwork,expdesgn,exptext,extr2017,extrapay,extraval,extrayr,fair,fairearn,famdif16,famgen,family16,fammhneg,fampress,famvswk,famwkoff,fatalism,fatigue,fear,fechld,feelevel,feelrel,feeused,fefam,fehire,fejobaff,fepol,fepresch,finalter,finrela,firstyou,fish,fishb4,form,formwt,fringeok,frndsex,fucitzn,fund,fund16,gender1,gender10,gender11,gender12,gender2,gender3,gender4,gender5,gender6,gender7,gender8,gender9,geneabrt2,genegen,genegoo2,geneself2,genetics,genetst1,getahead,goat,goatb4,god,godchnge,godmeans,godswill,goodlife,goveqinc,govlazy,govvsrel,granborn,grass,gunlaw,handmove,hapcohab,hapmar,happy,hapunhap,haveinfo,health,health1,healthissp,heaven,hefinfo,height,hell,helpblk,helpfrds,helpful,helpnot,helpoth,helppoor,helpsick,hhrace,hhtype,hhtype1,hindus,hispanic,hivtest,hivtest1,hivtest2,hlpadvce,hlpdown,hlpequip,hlphome,hlpjob,hlploan,hlppaper,hlpresde,hlpsick,hlpsickr,hlpsococ,hlthdays,hlthmntl,hlthphys,hlthstrt,homosex,homosex1,hompop,horse,horseb4,hotcore,hrs1,hrs2,hrsrelax,hsbio,hschem,hsmath,hsphys,huadd,huaddwhy,hubbywrk,huclean,hunt,hunt1,hurtatwk,hurtoth,hurtself,hvylift,hyperten,id,idu30,if12who,if16who,imbalnce,imprvown,imprvtrt,incgap,incom16,income,income16,incuspop,indperf,indus10,indusgen,intage,intcntct,intecon,inteduc,intenvir,intethn,intfarm,inthisp,intid,intintl,intlblks,intlhsps,intlwhts,intmed,intmil,intrace1,intrace2,intrace3,intsci,intsex,intspace,inttech,intyrs,isco08,isco88,issp,jew,jew16,jews,jobfind,jobfind1,joblose,jobsecok,kidpars,kidsinhh,kidssol,knowschd,knowwhat,knwbus,knwclenr,knwcop,knwcuttr,knwexec,knwhrman,knwlawyr,knwmchnc,knwmw1,knwmw2,knwmw3,knwmw4,knwmw5,knwnurse,knwtcher,laidoff,lasers,learnnew,letdie1,letin1a,libath,libcom,libhomo,libmil,libmslm,librac,life,lifein5,lifenow,liveblks,lngthinv,localnum,lonely1,lonely2,lonely3,madeg,madenkid,maeduc,maind10,maisco08,maisco88,major1,major2,majorcol,makefrnd,maleornt,manvsemp,maocc10,mapres10,mapres105plus,mar1,mar11,mar12,mar2,mar3,mar4,mar5,mar6,mar7,mar8,mar9,marasian,marblk,marcohab,marelkid,marhisp,marhomo,marital,martype,marwht,masei10,masei10educ,masei10inc,matesex,mawrkgrw,mawrkslf,mcsds1,mcsds2,mcsds3,mcsds4,mcsds5,mcsds6,mcsds7,meddoc,mentldoc,mentlhos,mentlill,mentloth,meovrwrk,mhdiagno,mhp1r1,mhp1r2,mhp2r1,mhp2r2,mhp3r1,mhp3r2,mhp4r1,mhp4r2,mhp5r1,mhp5r2,mhtreat1,mhtreat2,mhtreat3,mhtreat4,mhtreat5,mhtreatd,mhunsure,miracles,misswork,mnthsusa,mntlhlth,mobile16,mode,moredays,muslims,mustdoc,musthosp,mustmed,mustwork,mygoals,myprobs1,myprobs2,myprobs3,myprobs4,myprobs5,myskills,mywaygod,nanoben,nanoharm,nanowill,nataccess,natactive,nataid,nataidy,natarms,natarmsy,natchld,natcity,natcityy,natcrime,natcrimy,natdrug,natdrugy,nateduc,nateducy,natenrgy,natenvir,natenviy,natfare,natfarey,natheal,nathealy,natlack,natmass,natmeet,natnotice,natpark,natrace,natracey,natrelax,natroad,natsat,natsci,natsoc,natspac,natspacy,nattime,nattimeok,natviews,neisafe,newfrds,news,newsfrom,nextgen,nihilism,notsmart,ntwkhard,nukegen,numcong,numemps,numlangs,nummen,numorg,numpets,numwomen,obey,occ10,odds1,odds2,old1,old10,old11,old12,old2,old3,old4,old5,old6,old7,old8,old9,opdevel,otcmed,oth16,other,othersex,othlang,othlang1,othlang2,othmhneg,othpet,othpetb4,oversamp,overwork,owngun,ownstock,padeg,padenkid,paeduc,paidsex,painarms,paind10,paisco08,paisco88,paocc10,papres10,papres105plus,parborn,parelkid,parsol,partfull,partlsc,partners,partnrs5,partpart,partteam,partvol,partyid,pasei10,pasei10educ,pasei10inc,pawrkslf,petb4,petb4cmfrt,petb4fam,petb4ply,petcmfrt,petfam,petplay,phase,phone,phyeffrt,physacts,physhlth,physill,pig,pigb4,pikupsex,pilingup,pillok,pistol,polabuse,polattak,poleff11,polescap,polhitok,polmurdr,polviews,poorserv,popespks,popular,pornlaw,posslq,posslqy,postlife,pray,prayer,prayfreq,premarsx,pres12,pres16,prestg10,prestg105plus,preteen,prodctiv,promtefr,promteok,proudemp,prvdhlth,prvdold,quallife,racdif1,racdif2,racdif3,racdif4,race,racecen1,racecen2,racecen3,raclive,racopen,racwork,radioact,random,rank,ratepain,ratetone,realinc,realrinc,reborn,reg16,region,relactiv,relate1,relate10,relate11,relate12,relate2,relate3,relate4,relate5,relate6,relate7,relate8,relate9,relatsex,relext1,relext3,relgenbar,relgeneq,relhh1,relhh10,relhh11,relhh12,relhh2,relhh3,relhh4,relhh5,relhh6,relhh7,relhh8,relhh9,relhhd1,relhhd10,relhhd11,relhhd12,relhhd2,relhhd3,relhhd4,relhhd5,relhhd6,relhhd7,relhhd8,relhhd9,relig,relig16,religcon,religint,religkid,reliten,relmarry,relobjct,relpast,relpersn,relrlvnt,relscrpt,relsp1,relsp10,relsp11,relsp12,relsp2,relsp3,relsp4,relsp5,relsp6,relsp7,relsp8,relsp9,relsprt,reptile,reptileb4,res16,respect,respnum,respond,rfamlook,rgroomed,rhlthend,richwork,rifle,rincblls,rincom16,rincome,rlooks,rowngun,rplace,rvisitor,rweight,rxmed,safefrst,safehlth,safetywk,sampcode,sample,satfam7,satfin,satjob,satjob1,satlife,satsoc,savesoul,scibnfts,scientbe,scientgo,scienthe,scientod,scifrom,scinews1,scinews2,scinews3,scistudy,scitext,secondwk,seeksci,seetalk1,seetalk2,seetalk3,seetalk4,seetalk5,sei10,sei10educ,sei10inc,selfhelp,seriousp,severe1,severe2,severe3,severe4,severe5,sex,sexbirth,sexeduc,sexfreq,sexnow,sexornt,sexsex,sexsex5,shotgun,sibs,size,slfmangd,slpprblm,smallgap,smammal,smammalb4,socbar,socfrend,socommun,socrel,solarrev,spaneng,spanint,spanking,spanself,spdeg,spden,speduc,spevwork,spfalook,spfund,sphealer,sphrs1,sphrs2,spind10,spisco08,spisco88,spjew,spkath,spkcom,spkhomo,spklang,spkmil,spkmslm,spkrac,splive,spocc10,spother,sppres10,sppres105plus,sprel,sprtprsn,spsei10,spsei10educ,spsei10inc,spvtrfair,spwksup,spwrkslf,spwrksta,srcbelt,stockops,stockval,stress,stress12,stresses,strredpg,suicide1,suicide2,suicide3,suicide4,supcares,supervis,suphelp,tax,teamsafe,teens,teensex,tempgen,theism,thnkself,threaten,tlkclrgy,tlkfam,toofast,toofewwk,trbigbus,trcourts,trdunion,trust,trustman,trustsci,trynewjb,tvhours,unemp,unhappy,union,union1,unrelat,upsdowns,upset,uscitzn,usedup,usetech,usewww,uswary,version,vetyears,vigfrnd,viggrp,viglabel,vigmar,vignei,vigsoc,vigversn,vigwork,viruses,visitors,visnhist,vissci,vistholy,viszoo,vote12,vote16,vpsu,vstrat,watergen,waypaid,wayraise,wealth,webmob,weekswrk,weight,where1,where11,where2,where3,where4,where5,where6,where7,whoelse1,whoelse2,whoelse3,whoelse4,whoelse5,whoelse6,whynopet,whywkhme,widowed,wkageism,wkdecide,wkfreedm,wkharoth,wkharsex,wkpraise,wkracism,wksexism,wksmooth,wksub,wksub1,wksubs,wksubs1,wksup,wksup1,wksups,wksups1,wkvsfam,wlthblks,wlthhsps,wlthwhts,worda,wordb,wordc,wordd,worde,wordf,wordg,wordh,wordi,wordj,wordsum,workblks,workdiff,workfast,workfor1,workhard,workhsps,workwhts,wrkgovt,wrkhome,wrksched,wrkslf,wrkslffam,wrkstat,wrktime,wrktype,wrkwayup,wtss,wtssall,wtssnr,wwwhr,wwwmin,xmarsex,xmarsex1,xmovie,xnorcsiz,year,yearsjob,yearsusa,yearval,yousup,zodiac
0,no,yes,,yes,yes,yes,yes,yes,people should be able,the government should decide,,it depends,morally opposed,no,no,always wrong,yes,no,neither easy nor hard,make it harder,,very good,r does not consent to possible data linkage,5,strongly agree,strongly oppose pref,never,"yes, definitely",43,,,"no, definitely not",no,no,not at all scientific,somewhat negative,2-3x a month,every week,every week,about once or twice a yr,,,,,,,,,0.0,no,ballot a,slightly in favor,,,language 1,word of god,False,,,,,yes,true,,neither positive nor negative,i would be neither more nor less likely to buy...,,,favor,,,,extremely dangerous,0,2.0,very positive,far too litl pwr,working class,disagree,,,,,,,,,,,,,,,,1975.0,,,,,,allowed,not fired,associate's,allowed,allowed,"yes, allowed",allowed,no,,agree,a company whose stock is owned by outside inve...,no,good,yes,1 lower,,some confidence,,some confidence,,some confidence,very little confidence,not at all true,,True,,,,,,,,,,,,some confidence,,r consents to recording interview,,,,"friendly,interested",,,,,,,,not harsh enough,very true,very true,,,,,,,,,,,,,,,,907,,,yes,junior college,,"baptist, dk which",no denomination,baptist-dk which,no,no,no,,disagree,,high school diploma,somewhat likely,somewhat unlikely,,a few times a month,more difficult,,,,,detached 1-fam house,detached single family house,pays rent,owned or was buying,1.0,earth around sun,14.0,disagree,True,25.0,0.0,rarely,no,yes,,,,i would definitely take the job with the esop ...,germany,scotland,UNCODEABLE & IAP,UNCODEABLE & IAP,cannot choose 1,,,False,,,,,DONT KNOW,,yes,yes,DONT KNOW,2018.0,,somewhat less than you deserve,,1 gen,mother & father,,,rarely,somewhat hard,strongly disagree,mild,yes,strongly agree,$75+,very religious,"yes, money",disagree,,strongly against,agree,strongly disagree,better,below average,,,,standard <x>,1,somewhat true,,,moderate,fundamentalist,male,,,,female,male,male,male,,,,,no,somewhat dangerous,good > harm,no,,not very much,hard work,,,know god exists,"believe now, didn't used to",disagree,,,,,strongly agree,all in u.s,,favor,no,,,pretty happy,fairly happy,somewhat true,good,good,good,"yes, definitely",4th person,73.0,"yes, definitely",,,,,,,,"other, mixed","4+adlts,0mar,0kids","unsure, no children",neither positive nor negative,not hispanic,,,,,,somewhat true,,,,,,,,,0.0,very good,good,,always wrong,always wrong,5,,,True,,41.0,3.0,yes,yes,trigonometry linear programming analysis,no,yes,,neither agree nor disagree,NO ANSWER,neither,neither hunts,0.0,,,no,yes,1,,,,,,,,below average,,,average,yes,employment services,extremely dangerous,68,,very interested,moderately interested,very interested,white,moderately interested,,125,moderately interested,4,4,4,very interested,moderately interested,No Answer,No Answer,No Answer,very interested,female,moderately interested,moderately interested,23.0,personnel and careers professionals,4190.0,did issp,,,somewhat negative,very easy,very easy to find similar job,not likely,somewhat true,,yes,,4 weeks or more,strongly agree,,,,,,,,,,,,,,,,no,False,strongly agree,no,remain the same as it is,not remove,not remove,not remove,not remove,not remove,not remove,routine,9,7.0,neither favor nor oppose,136,1-9,,,,high school,"baptist, dk which",12,clothing stores,shop sales assistants,5220.0,communications/speech,,communications/speech,disagree,r sexual orientation uncertain,very good,retail salespersons,31.0,18.0,,,,,,never married,,,,,,neither favor nor oppose,oppose,"not married, no cohabitating partner",protestant,neither favor nor oppose,strongly disagree,never married,,neither favor nor oppose,39.7,55.9,30.9,,yes,someone else,,,,,,,,,,,,,agree,,,,,,,,,,,,,,,,,,,"yes, definitely",0.0,,20.0,different state,over the phone,8.0,somewhat negative,,,,no,,,,,,,strongly agree,agree,slightly in favor,,benefits greater,,,too much,,too much,,about right,too little,,too little,,too little,,too little,,too little,about right,,about right,,too little,,,about right,,,too much,too little,,,too little,,about right,too little,too much,,,,,very safe,,less than once wk,the internet,strongly agree,strongly disagree,never,10.0,extremely dangerous,300.0,,,,NO ANSWER,,,,human resources workers,no,yes,,,,,,,43.0,,,,,,somewhat true,,,,,yes,spanish,,,,,1,agree,no,i do not know if i own stock in my company,high school,christian; central christian,12,,no,postal service,mail carriers and sorting clerks,4142.0,postal service mail carriers,45.0,51.0,both in u.s,protestant,,full-time,,,,,"no, i work mostly on my own",,not str republican,58.4,50.1,78.4,someone else,,,,,,,,phase two - sub sampled cases,cellphone,very light,completely,0.0,,,,,,disagree,no,,,,,,,conservative,Don't know,,,,no steady partner,,yes,several times a week,disapprove,several times a week,not wrong at all,romney,trump,47.0,59.0,0.0,disagree,somewhat true,somewhat true,strongly agree,,,good,no,no,no,yes,white,white,,,no,owner decides,all white,False,2.0,4,1,,,,yes,middle atlantic,new england,several times a year,head of household,,,,non-relative,non-relative,non-relative,non-relative,,,,,,definitely not,probably not,disagree,treats women better than men,householder,,,,"partner,girl(boy)friend","roommate, housemate","roommate, housemate","roommate, housemate",,,,,head of household,,,,"partner,fiance-e-,boyfriend,girlfriend,etc","roommate,housemate","roommate,housemate","roommate,housemate",,,,,christian,protestant,agree,strongly agree,protestant,strong,probably not accept,yes,strongly disagree,very religious,strongly agree,yes,spouse,,,,"hh spouse, partner","roommate, housemate","roommate, housemate","roommate, housemate",,,,,i follow a religion and consider myself to be ...,,,town lt 50000,strongly agree,4th person,high,iap,NO ANSWER,,continue working,no,yes,,,NO ANSWER,,non-relative,r. is household member,NO ANSWER,,strongly agree,strongly agree,strongly agree,601,2010 fp,fairly satisfied,not at all sat,very satisfied,somewhat satisfied,,good,yes,harmful results greater,agree,strongly agree,agree,agree,the internet,,,science site,little understanding,,no,the internet,,,,,,65.3,82.4,65.0,,,,,,,,male,,favor,,,,,,no,4.0,14,no,sometimes,,,,once a month,sev times a mnth,never,sev times a year,one year,english,,agree,,iap,,iap,,iap,,,,,Not applicable,,,,allowed,allowed,allowed,well,allowed,"yes, allowed",allowed,iap,,,,,,very spiritual,,,,very true,,,,"suburb, 13-100",no,,always,No answer,,no,yes,yes,yes,yes,somewhat true,doesnt supervise,somewhat true,too high,strongly agree,0.0,almst always wrg,extremely dangerous,strongly agree,,never,,,disagree,often,,,DONT KNOW,,disagree,strongly agree,somewhat likely,3.0,,,,,,,,,sometimes,100.0,,yes,1,none,,,,,,,,,True,no visitors,1.0,0.0,never,2.0,voted,voted,1,3301,extremely dangerous,paid by the hour,,DONT KNOW,,52.0,235.0,,,,,,,,,no,no,no,no,no,yes,,worker wants to work at home,,no,sometimes,somewhat true,no,no,maybe,no,no,agree,yes,yes,yes,yes,yes,yes,no,no,rarely,4.0,5.0,4.0,correct,correct,correct,correct,correct,correct,incorrect,correct,correct,correct,9.0,6.0,strongly agree,strongly agree,for-profit company,,2.0,4.0,private,a few times a year,day shift,someone else,,temp not working,not at all true,"regular, permanent employee",agree strongly,2.357493,2.357493,2.753531,20.0,0.0,always wrong,always wrong,,"uninc,med city",2018,1,,,45.0,virgo
1,yes,yes,it depends,no,no,no,no,yes,people should not be able,,a woman and her medical professional should de...,,it depends,yes,no,,yes,yes,easy,stay the same as now,,good,r does not consent to possible data linkage,2,,,never,,74,a good idea,21.0,,,,,,once a year,,,,,,,,,,,,0.0,,ballot c,,,fair all of time,,word of god,,,,,,yes,,very likely,,,,always trusted,oppose,,,somewhat likely,,3,,,,working class,,5.0,not close at all,,,,several times a year,once a week,daily,my parents are no longer alive,several times a year,,,,,,1944.0,,,,,,not allowed,fired,,allowed,not allowed,not allowed,not allowed,,,,,,good,,,a great deal,,a great deal,,a great deal,,,,not used,,a great deal,most of them,a great deal,a great deal,22782.5,a great deal,a great deal,a great deal,a great deal,a great deal,,,,r consents to recording interview,a great deal,5-9 people,,"friendly,interested",,,,,,,,not harsh enough,,,,,,not at all dangerous,very dangerous,,,,yes,yes,not at all dangerous,very dangerous,,,,426,not very able,very able,,high school,"yes, but seldom",,,,,,,no,,never,ged,not very likely,,,never,,,talk to family and friends about it,,,detached 1-fam house,detached single family house,,,1.0,,10.0,,,,,rarely,,,neither easy nor difficult,no govt action,,,italy,UNCODEABLE & IAP,UNCODEABLE & IAP,italy,names 1,no,no,,,no,no,yes,,,,,,,take advantage,,,"2 gens, children",mother & father,somewhat,"no, never",,,,none,no,,25,,"yes, money",,,,,,worse,below average,agree,,,alternate <y>,1,,,,moderate,moderate,female,,,,female,,,,,,,,no,,harm > good,no,somewhat likely,nothing at all,hard work,,,know god exists,,,somewhat likely,agree,strongly disagree,agree,,4,not legal,favor,,,,very happy,,,excellent,,very good,,1st person,,,govt help blks,agree,helpful,agree with both,2nd important,agree with both,agree with both,white,"2adlts,ntmar,rel,0kids","other fam., no children",,not hispanic,no,,,close family member,close family member,,close family member,no person or organization,public services,public services,no person or organization,close family member,family members or close friends,close family member,,very good,very good,good,not wrong at all,,2,,,,,,,,,,,yes,,,clean,neither,neither hunts,,not very likely,very likely,,,2,,,,not very likely,very likely,not likely at all,neither,far below average,$25000 or more,$30000 to 34999,average,,grocery stores,,62,none or almost none of it,,,,white,,not hispanic,32,,,,,,,white,,,,female,,,21.0,hand packers,9322.0,did issp,,,,,,,,strongly agree,no,much better,,,no one,no one,no one,no one,no one,no one,someone else i know,no one,male,female,,,,no one,no one,,,,,,not remove,not remove,not remove,not remove,not remove,not remove,exciting,,,,153,,never,never,never,lt high school,,8,"textile product mills, except carpet and rug",sewing machine operators,8263.0,,,,,,,sewing machine operators,32.0,22.0,married,,,divorced,,,,,,,,,,"not married, no cohabitating partner",,,agree,separated,,,13.2,16.5,5.5,,yes,someone else,True,False,True,False,False,False,False,yes,yes,no,not very likely,yes,,yes,spouse/partner (current/ex),No answer,child,No answer,,,,,,,yes,yes,,,,,"disagree, or",,,,,different state,in-person,,,yes,yes,yes,,completely true,8,very much,,,,,,,,,strongly agree,strongly agree,,too much,,too little,about right,,about right,,too little,,too little,,too little,about right,,too little,,too little,,about right,somewhat disagree,about right,strongly agree,strongly agree,about right,,about right,strongly agree,about right,strongly agree,too little,too little,,about right,strongly agree,strongly agree,strongly agree,very safe,,,,,,never,,,,,one language,56.0,,no pets,0.0,4th important,"packers and packagers, hand",,,74.0,,,,53.0,,,,,,,,,yes,,,,no,,,somewhat,,,1,,no,,lt high school,,0,,,construction,bricklayers and related workers,7122.0,"brickmasons, blockmasons, and stonemasons",39.0,36.0,mother only,,much better,,several times a year,no partners,1 partner,never,,never,"ind,near dem",24.6,16.0,30.4,someone else,no,,,,,,,phase one - initial cases,phone in home,,moderately,,somewhat likely,,,,never,,no,no,no,disagree,yes,no,yes,,never,certainly true,least important,illegal under 18,,"i have a husband or wife or steady partner, bu...",yes,once a day,,,,obama,trump,22.0,13.0,0.0,,,,,government,government,excellent,,,,,white,white,,,,cant discriminate,,,,top,7,lightest,14755.0,,no,middle atlantic,new england,never,head of household,,,,child,,,,,,,,"no, no relationship",,,,,householder,,,,"child, unsp",,,,,,,,head of household,,,,"child,natural or adopted,stepchild",,,,,,,,catholic,catholic,,,,strong,,,,modrte religious,,,,,,,,,,,,,,,,,,50000 to 250000,,1st person,high,iap,about average,,,no,,,,about average,,head of household,r. is household member,slightly overweight,yes,,,,601,2010 fp,,more or less,,,completely satisfied,very good,yes,,,,,,,,,,,,,,very often,not at all,,,,14.8,16.5,7.8,yes,very serious,very severe,very severe,,,,female,female,,not at all,women,heterosexual or straight,,exclusively male,no,4.0,14,,,disagree,,,,,,,,english,,,,iap,,iap,,iap,,yes,,,Not applicable,,,,allowed,allowed,allowed,,not allowed,"yes, allowed",allowed,iap,,,,,,very spiritual,,,,,,,,"suburb, 13-100",,,NO ISSP,,very likely,,,,,,,NO ISSP,,too high,,0.0,,,,most important,never,yes,yes,,,7.0,7.0,,can trust,,,,,no,never,r belongs,r belongs,0.0,very likely,never,,,,,,3,none,definitely willing,definitely willing,very likely,definitely willing,probably willing,probably willing,6.0,definitely willing,,no visitors,,,,,voted,voted,1,3301,,,not very likely,,,0.0,,,,,,,,,,no,no,no,no,no,yes,too expensive,,no,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,3rd important,,,private,,,someone else,,retired,,,,0.942997,0.942997,1.101412,,,always wrong,,no,"uninc,med city",2018,.i,,,,aquarius
2,,,,yes,no,yes,yes,,people should not be able,a woman and her medical professional should de...,,it depends,it depends,,,wrong only sometimes,,,very easy,make it harder,,excellent,r does not consent to possible data linkage,2,disagree,strongly oppose pref,,"yes, definitely",42,a good idea,35.0,"no, probably not",no,no,not at all scientific,neither positive nor negative,once a year,several times a week,several times a week,every week,,,,,,,,,,no,ballot b,,strongly in favor,,,inspired word,,True,,,,yes,DONT KNOW,,somewhat positive,i would be neither more nor less likely to buy...,0.0,,favor,,,,,2,2.0,somewhat positive,too much power,middle class,strongly agree,,,,,,,,,,,,,,,,1976.0,,,,,,,,bachelor's,,,,,yes,20.0,agree,a company whose stock is owned by the employee...,yes,good,yes,5 higher,only some,a great deal of confidence,only some,very little confidence,hardly any,very little confidence,some confidence,somewhat true,used last time,True,only some,,only some,hardly any,112160.0,a great deal,hardly any,hardly any,a great deal,hardly any,70100.0,some confidence,a great deal,r consents to recording interview,only some,,,"friendly,interested",,,,,,,,not harsh enough,somewhat true,somewhat true,,,,,,,,,,,,,,,,910,,,no,bachelor,,,,,no,no,no,,agree,,high school diploma,,somewhat unlikely,,,more difficult,no,,,,detached 1-fam house,detached single family house,own or is buying,owned or was buying,2.0,earth around sun,16.0,neither agree nor disagree,False,20.0,0.0,sometimes,no,yes,,3,no,i would definitely take the job without the es...,italy,other spanish,UNCODEABLE & IAP,UNCODEABLE & IAP,cannot choose 1,no,no,True,,no,no,,500 get the drug 500 dont,correct control group,yes,yes,10000,,fair,somewhat more than you deserve,"divorce,separated","2 gens, children",father & stpmother,,,sometimes,not at all hard,disagree,severe,,strongly agree,$75+,not rel or non,"yes, money",disagree,,against,disagree,disagree,better,above average,,,,standard <x>,1,very true,,,liberal,moderate,male,,,,female,male,female,,,,,,,,,,,,,,,believe but doubts,"don't believe now, used to",disagree,,agree,,,strongly agree,2,legal,,no,,very happy,very happy,very happy,somewhat true,,very good,very good,"yes, probably",1st person,68.0,"yes, probably",agree with both,,helpful,govt does too much,3rd important,4,agree with both,"other, mixed","2adlts,mar,1+kids",married couple w children,somewhat positive,chilean,no,,,,,very true,,,,,,,,,0.0,very good,good,,,not wrong at all,4,,,True,40.0,,3.0,yes,yes,pre-calculus,no,yes,,strongly disagree,NO ANSWER,,,0.0,,,no,yes,3,,,,,,,,above average,$25000 or more,$150000 to $169999,average,yes,wired telecommunications carriers,,68,,very interested,very interested,very interested,white,moderately interested,,125,moderately interested,intelligent,intelligent,intelligent,very interested,moderately interested,No Answer,No Answer,No Answer,very interested,female,moderately interested,very interested,23.0,computer network professionals,2131.0,did issp,,,somewhat positive,somewhat easy,somewhat easy to find similar job,not likely,somewhat true,,yes,somewhat better,4 weeks or more,agree,,,,,,,,,,,,,,,,no,False,agree,yes,reduced a little,,,,,,,,best possible state,8.0,favor,90,100-499,,,,bachelor,,16,banking and related activities,general office clerks,4110.0,electronics,,electronics,agree,,quite good,"office clerks, general",32.0,26.0,,,,,never married,never married,,,,,,strongly favor,strongly favor,married,catholic,favor,,married,marriage between a man and a woman,favor,35.8,56.1,22.8,yes,yes,someone else,,,,,,,,,,,,,disagree,,,,,,,,,,,,,,,,,,,"yes, probably",0.0,,3.0,different state,in-person,3.0,somewhat positive,,,,no,,,,,,,agree,agree,,,,,,,,too little,,,about right,,,,too little,,too little,,too little,too little,,too little,,too little,,,too little,,,about right,too little,,,too little,,,,too little,,,,,very safe,,never,radio,agree,strongly disagree,,30.0,,1000.0,,,0.0,5000,,8.0,least important,computer network architects,no,yes,,,,,,,,,,,,,somewhat true,,,,,no,,,,,,1,agree,,yes,high school,,12,,no,waste management and remediation services,managing directors and chief executives,1210.0,chief executives,72.0,90.0,father only,catholic,about the same,full-time,,1 partner,1 partner,,"yes, i work as part of a team",,"ind,near rep",77.4,85.7,87.0,self-employed,,,,,,,,phase one - initial cases,phone in home,very light,completely,0.0,,,,,,disagree,,no,yes,,yes,no,no,slghtly conservative,,,4th important,illegal under 18,married with partner,,yes,lt once a week,approve,2-3 times a month,not wrong at all,romney,trump,61.0,92.0,,agree,not too true,somewhat true,agree,,,excellent,yes,no,yes,no,white,white,,,no,,,False,1.0,5,2,,72640.0,45400.0,no,middle atlantic,new england,less than once a year,head of household,,,,spouse,child,child,,,,,,"yes, in relationship",probably,probably,agree,treats women better than men,householder,,,,spouse,"child, unsp","child, unsp",,,,,,head of household,,,,spouse,"child,natural or adopted,stepchild","child,natural or adopted,stepchild",,,,,,none,catholic,not agree/dsagre,agree,catholic,no religion,probably accept,no,neither agree nor disagree,slight religious,disagree,no,spouse,,,,"hh spouse, partner","child, not specified","child, not specified",,,,,,i follow a religion and consider myself to be ...,,,big-city suburb,agree,2nd person,high,iap,NO ANSWER,,,,yes,$90000 to $109999,$25000 or more,NO ANSWER,,spouse,r. is household member,NO ANSWER,,agree,agree,agree,601,2010 fp,fairly satisfied,more or less,mod. satisfied,somewhat satisfied,,very good,no,benefits greater,agree,agree,agree,disagree,books other printed material,,,,general sense,measurement,no,the internet,,,,,,83.4,89.4,93.1,,,,,,,,male,male,oppose,weekly,man,heterosexual or straight,exclusively female,exclusively female,,2.0,14,no,sometimes,,,,once a year,once a month,sev times a mnth,almost daily,one year,english,,agree,,junior college,,14,,iap,moderate,,40.0,,pharmaceutical and medicine manufacturing,agricultural technicians,3212.0,,,,,,,,,iap,agricultural and food science technicians,,45.0,50.0,catholic,modeate spirtual,44.5,55.6,42.5,somewhat true,no,someone else,working fulltime,"suburb, 13-100",no,10.0,sometimes,no,,yes,no,no,no,no,somewhat true,supervises,not too true,,agree,,always wrong,,neither agree nor disagree,most important,,,,disagree,often,,,disagree,can't be too careful,agree,strong disagree,not at all likely,1.0,no,,neither belongs,neither belongs,0.0,,,,often,100.0,,,2,none,,,,,,,,,True,no visitors,3.0,3.0,never,2.0,voted,voted,1,3301,,paid by the hour,,"$250,000 to $500,000",,52.0,225.0,,,,,,,,,no,no,no,no,no,yes,,worker wants to work at home,no,no,sometimes,somewhat true,no,no,yes,no,no,agree,yes,yes,yes,yes,yes,yes,no,no,sometimes,4.0,4.0,4.0,correct,correct,incorrect,correct,correct,correct,incorrect,incorrect,correct,incorrect,6.0,4.0,agree,disagree,for-profit company,2nd important,4.0,4.0,private,a few times a year,night shift,someone else,,working fulltime,not too true,"regular, permanent employee",neither agree nor disagree,0.942997,0.942997,1.101412,10.0,0.0,,always wrong,no,"uninc,med city",2018,15,,,3.0,aries
3,,,should,yes,yes,yes,yes,,people should be able,,a woman and her medical professional should de...,,it depends,,,,,,easy,stay the same as now,,very good,r consents to possible data linkage,2,,oppose pref,,,63,a good idea,32.0,,yes,,,,nrly every week,,,,,,,,,,,,0.0,yes,ballot b,,,fair mst of time,,inspired word,,,,,did not have,yes,,not very likely,,i would be less likely to buy from an employee...,,always trusted,oppose,,did not have,not at all likely,,2,2.0,,,middle class,,9.0,not close at all,7,6.0,not close at all,several times a week,several times a week,daily,my parents are no longer alive,daily,,,,,,1955.0,,,,,,,,,,,,,,,,a company whose stock is owned by the employee...,,good,yes,1 lower,a great deal,,only some,,only some,,,not too true,used last time,,hardly any,most of them,hardly any,only some,158201.8412,a great deal,only some,hardly any,hardly any,hardly any,84120.0,,a great deal,r consents to recording interview,only some,20-49 people,,"friendly,interested",,,,,,,,not harsh enough,very true,very true,,,,not at all dangerous,6,not at all dangerous,not at all dangerous,Don't know,yes,yes,not at all dangerous,6,not at all dangerous,not at all dangerous,Don't know,425,somewhat able,not very able,yes,bachelor,"no, never",,united methodist,,yes,,no,yes,,several times a year,high school diploma,,,somewhat unlikely,,easier,no,go to a general medical doctor for help,,had,detached 1-fam house,detached single family house,own or is buying,owned or was buying,2.0,,16.0,,,7.0,0.0,rarely,yes,no,very easy,5,,i would probably take the job with the esop (e...,france,england & wales,ireland,ireland,chooses 1 of 2+,no,no,,,no,yes,,,,,no,,,fair,about as much as you deserve,,1 gen,mother & father,not very much,"no, never",rarely,not too hard,,mild,,agree,25,,"yes, money",disagree,agree,,disagree,disagree,better,above average,strongly agree,,did not have,alternate <y>,1,not too true,,,liberal,moderate,male,,,,female,,,,,,,,,,,,somewhat likely,,,,did not have,know god exists,,,not at all likely,neither,disagree,disagree,,all in u.s,not legal,,yes,,very happy,very happy,,very true,,excellent,excellent,,1st person,68.0,,agree with both,neither agree nor disagree,helpful,4,3rd important,2,2,white,"2adlts,mar,0kids","married couple, no children",,not hispanic,no,,,close friend,close friend,somewhat true,close family member,other organizations,private companies,private companies,private companies,close family member,other organizations,close friend,0.0,excellent,excellent,good,,,2,,did not have,,40.0,,7.0,,,,,yes,,,clean,,,0.0,not very likely,somewhat likely,yes,yes,4,,,,very likely,not likely at all,very likely,agree,above average,,$170000 or over,average,,hospitals,,62,most of it,,,,white,,not hispanic,32,,5,5,5,,,white,,,,female,,,21.0,medical imaging and therapeutic equipment tech...,3133.0,did issp,,,,not easy,not easy at all to find similar job,not likely,very true,agree,yes,about the same,between 3 and 4 weeks,strongly agree,someone else i know,family or relative,family or relative,family or relative,no one,someone else i know,family or relative,family or relative,male,male,female,female,female,close friend,family or relative,no,,strongly agree,no,increased a little,,,,,,,,8,8.0,neither favor nor oppose,192,"1,000-1,999",rarely,rarely,rarely,high school,,12,Not applicable,,,business administration,computer science,,,,quite good,,,,married,,,married,,,,,,,,neither favor nor oppose,neither favor nor oppose,married,,neither favor nor oppose,,married,marriage between a man and a woman,neither favor nor oppose,,,,,no,,False,True,True,False,False,True,False,yes,yes,yes,very likely,yes,neither agree nor disagree,yes,child,No answer,other family,No answer,other family,No answer,friend,No answer,sibling,No answer,no,unsure,no,no,unsure,yes,"disagree, or",,0.0,,1.0,different state,in-person,2.0,,no,no,no,yes,mostly true,2,not at all,not at all,not at all,not at all,strongly agree,,,,,strongly agree,strongly agree,,too much,,too much,too little,,about right,,too little,,too little,,about right,too little,,too little,,too little,,about right,strongly disagree,about right,strongly agree,strongly agree,about right,,too little,strongly agree,too little,strongly agree,too little,too much,,about right,strongly agree,somewhat agree,strongly agree,very safe,rarely,everyday,,,,,10.0,,100.0,,one language,15.0,1000-1999 in range,no pets,0.0,least important,diagnostic related technologists and technicians,,,58.0,,,,63.0,,,,,,,,somewhat true,no,,,,no,,,somewhat,,did not have,1,disagree,,,bachelor,,16,,no,computer and peripheral equipment manufacturing,business services and administration managers ...,1319.0,"managers, all other",39.0,42.0,both in u.s,,somewhat better,full-time,several times a year,no partners,no partners,never,"yes, i work as part of a team",several times a year,"ind,near dem",67.7,76.9,76.8,someone else,yes,often,almost always,almost always,,,,phase one - initial cases,phone in home,somewhat hard,completely,0.0,somewhat likely,,did not have,,rarely,agree,,no,yes,agree,yes,yes,no,moderate,,,4th important,illegal under 18,,i am married and living in the same household ...,yes,several times a day,approve,,sometimes wrong,romney,don't know,59.0,79.0,0.0,agree,not too true,not at all true,strongly agree,private companies/for-profit organizations,non-profit organizations/charities/cooperatives,excellent,yes,no,yes,no,white,white,,,no,,,,,4,2,lightest,119879.4173,54480.0,no,middle atlantic,new england,nearly every week,head of household,,,,spouse,,,,,,,,"yes, in relationship",,,,,householder,,,,spouse,,,,,,,,head of household,,,,spouse,,,,,,,,protestant,catholic,,,,strong,,,,slight religious,,,spouse,,,,"hh spouse, partner",,,,,,,,,,did not have,town lt 50000,strongly agree,2nd person,high,iap,about average,,,,yes,$110000 to $129999,$25000 or more,attractive,,spouse,r. is household member,about the right weight,yes,strongly agree,strongly agree,agree,601,2010 fp,,satisfied,very satisfied,very satisfied,mostly satisfied,good,yes,,,,,,,,,,,,no,,7,2,7.0,6.0,not at all,69.3,86.7,68.4,yes,very serious,not at all severe,8,2.0,not at all severe,9.0,female,female,favor,not at all,women,heterosexual or straight,,,,3.0,14,yes,sometimes,disagree,,did not have,once a year,once a month,never,almost daily,,english,,disagree,,junior college,united methodist,14,,iap,liberal,no,40.0,,hospitals,medical imaging and therapeutic equipment tech...,3133.0,,,,,,,,,iap,diagnostic related technologists and technicians,,59.0,79.0,protestant,modeate spirtual,69.3,86.7,68.4,very true,no,someone else,working fulltime,"suburb, 13-100",,,often,yes,somewhat likely,yes,yes,no,no,no,very true,supervises,very true,,strongly agree,0.0,always wrong,,,2nd important,,yes,yes,,sometimes,6.0,8.0,disagree,can trust,agree,,not at all likely,1.0,no,rarely,spouse belongs,spouse or partner belongs,0.0,not very likely,rarely,,sometimes,70.0,,,2,none,probably willing,probably willing,somewhat likely,probably unwilling,probably willing,probably willing,44.0,probably unwilling,,no visitors,,,,,voted,voted,1,3301,,salaried,not at all likely,$1 million to $2 million,,52.0,170.0,,,,,,,,,no,no,yes,no,no,no,too much time or work to care for pet,NO ANSWER,no,no,often,very true,no,no,yes,no,yes,strongly agree,yes,yes,yes,yes,yes,yes,no,no,rarely,5.0,,4.0,correct,correct,incorrect,correct,correct,correct,correct,correct,correct,correct,9.0,4.0,strongly agree,strongly agree,non-profit or not-for-profit organization,most important,4.0,4.0,private,never,day shift,someone else,,working fulltime,very true,"regular, permanent employee",neither agree nor disagree,0.942997,0.942997,1.101412,6.0,0.0,,,no,"uninc,med city",2018,25,,,10.0,aries
4,no,yes,,no,no,no,yes,yes,people should not be able,,a woman and her medical professional should de...,it depends,morally opposed,no,no,,no,no,Don't know,stay the same as now,,excellent,r consents to possible data linkage,2,,,a few times a month,,71,a bad idea,,,,,,,more thn once wk,,,,,,,,,,,,0.0,,ballot c,,,fair mst of time,,inspired word,,,,does not have,,yes,,not at all likely,,,,usually trusted,oppose,does not have,,not very likely,,0,,,,upper class,,3.0,not close at all,very close,,,less often,several times a week,i do not have any adult children,my parents are no longer alive,two to three times a month,,,,,,1947.0,,,,,,not allowed,not fired,,allowed,not allowed,not allowed,not allowed,,,,,,good,,,a great deal,,a great deal,,a great deal,,,,not used,,only some,some of them,only some,a great deal,158201.8412,a great deal,only some,hardly any,a great deal,hardly any,,,a great deal,r consents to recording interview,hardly any,20-49 people,,"friendly,interested",,,,,,,,not harsh enough,,,,,,5,5,not at all dangerous,,,yes,yes,Don't know,Don't know,5,,,718,very able,very able,,graduate,"yes, sometimes",,,,,,,no,,two to three times a month,high school diploma,not very likely,,,less than once a year,,,go to a general medical doctor for help,does not have,,detached 1-fam house,detached single family house,,,1.0,,18.0,,,,,rarely,,,very easy,6,,,scotland,ireland,UNCODEABLE & IAP,scotland,chooses 1 of 2+,no,no,,,no,no,yes,,,,,,,fair,,,1 gen,mother & father,not at all,"no, never",,,,none,no,,$75+,,"yes, money",,,,,,stayed same,far above average,strongly agree,does not have,,alternate <y>,1,,,,moderate,moderate,male,,,,female,,,,,,,,no,,good > harm,no,not at all likely,not very much,hard work,does not have,,know god exists,,,not at all likely,agree,disagree,agree,,2,not legal,favor,,,,pretty happy,,,excellent,,excellent,,1st person,,,no special treatment,agree,helpful,govt do more,2nd important,4,agree with both,white,"2adlts,dkmar,0kids","unsure, no children",,not hispanic,no,,,no one,no one,,someone else,non-profit or religious organizations,other organizations,non-profit or religious organizations,Don't know,no one,family members or close friends,close friend,,very good,excellent,good,not wrong at all,,2,does not have,,,,,,,,,,yes,,,NO ANSWER,neither,neither hunts,,not likely at all,not likely at all,,,5,,,,DONT KNOW,somewhat likely,not likely at all,disagree,below average,,$170000 or over,higher than average,,electronic component and product manufacturing...,,68,all or almost all of it,,,,white,,,125,,,,,,,No Answer,No Answer,No Answer,,female,,,23.0,human resource managers,1232.0,did issp,,,,,,,,strongly agree,yes,somewhat worse,,,no one,no one,close friend,no one,close friend,close friend,close friend,no one,female,male,male,,,family or relative,family or relative,,,,,,not remove,not remove,not remove,not remove,not remove,not remove,routine,,,,111,,rarely,never,never,high school,,12,elementary and secondary schools,child care workers,5131.0,business administration,,,,r is not homosexual/gay,,childcare workers,35.0,28.0,married,,,married,,,,,,,,,,"not married, no cohabitating partner",,,strongly disagree,divorced,,,21.8,45.4,6.9,,yes,someone else,True,True,False,False,True,True,False,yes,no,no,not very likely,no,,yes,coworker,No answer,friend,No answer,friend,No answer,,,,,unsure,unsure,no,,,,"disagree, or",,,,,different state,over the phone,,,no,no,no,,mostly true,not at all,not at all,5,,,,,,,,strongly agree,strongly agree,,about right,,too little,about right,,about right,,too little,,too little,,about right,too little,,too little,,about right,,too little,strongly disagree,too little,strongly agree,strongly agree,too little,,too much,strongly agree,too little,strongly agree,too little,about right,,too little,strongly agree,strongly agree,strongly agree,very safe,rarely,,,,,never,,,2400.0,,one language,0.0,,1,3.0,4th important,human resources managers,,,71.0,,,,67.0,,,,,,,,,no,,,,no,,,not at all,does not have,,1,,yes,,high school,,12,,,postal service,mail carriers and sorting clerks,4142.0,postal service mail carriers,45.0,51.0,father only,,much better,,never,no partners,no partners,never,,several times a year,strong republican,58.4,50.1,78.4,someone else,yes,,,,never,never,sometimes,phase one - initial cases,phone in home,,completely,,not at all likely,does not have,,,never,,yes,no,yes,disagree,yes,yes,no,extrmly conservative,never,probably true,least important,illegal under 18,,i don’t have a steady partner.,yes,once a day,,,,romney,trump,53.0,73.0,0.0,,,,,government,"family, relatives or friends",very good,,,,,black,black or african american,white,,yes,cant discriminate,,,,5,no pain,,119879.4173,,no,middle atlantic,new england,never,head of household,,,,spouse,,,,,,,,"no, no relationship",,,,,householder,,,,spouse,,,,,,,,head of household,,,,spouse,,,,,,,,catholic,catholic,,,,strong,,,,very religious,,,spouse,,,,"hh spouse, partner",,,,,,,,,has,,big-city suburb,,1st person,high,iap,NO ANSWER,,,no,,,,NO ANSWER,yes,head of household,r. is household member,NO ANSWER,no,,,,601,2010 fp,,satisfied,,,mostly satisfied,excellent,yes,,,,,,,,,,,,,,not at all,not at all,5.0,,,68.6,79.2,76.4,no,somewhat serious,very severe,very severe,5.0,,,male,male,,not at all,man,heterosexual or straight,,,yes,3.0,14,,,disagree,does not have,,,,,,,english,,,,iap,,iap,,iap,,no,,,Not applicable,,,,allowed,allowed,allowed,,allowed,"yes, allowed",allowed,iap,,,,,,very spiritual,,,,,,,,"suburb, 13-100",,,NO ISSP,,somewhat likely,,,,,,,NO ISSP,,too high,,0.0,,,,3rd important,less than once a year,no,,,,8.0,8.0,,can trust,,,,,no,never,neither belongs,neither belongs,,somewhat likely,never,,,,,,3,more than 4 yrs,definitely willing,probably willing,,probably willing,probably willing,definitely willing,85.0,probably willing,,no visitors,,,,,voted,voted,1,3301,,,somewhat likely,,,0.0,,,,,,,,,,no,no,no,no,no,yes,,,no,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,most important,,,private,,,someone else,,retired,,,,0.942997,0.942997,1.101412,,,always wrong,,no,"uninc,med city",2018,.i,,,,cancer


### Problem 2
#### Part a
Use the `ProfileReport()` function to generate and embed an HTML formatted exploratory data analysis report in your notebook. Make sure that it includes a "Correlations" report along with "Overview" and "Variables". [1 point]

#### Part b
Looking through the HTML report you displayed in part a, how many people in the data are from New England? [1 point]

#### Part c
Looking through the HTML report you displayed in part a, which feature in the data has the highest number of missing values, and what percent of the values are missing for this feature? [1 point]

#### Part d
Looking through the HTML report you displayed in part a, which two distinct features in the data have the highest correlation? [1 point]

### Problem 3
On a primetime show on a 24-hour cable news network, two unpleasant-looking men in suits sit across a table from each other, scowling. One says "This economy is failing the middle-class. The average American today is making less than \\$48,000 a year." The other screams "Fake news! The typical American makes more than \$55,000 a year!" Explain, using words and code, how the data can support both of their arguments. Use the sample weights to calculate descriptive statistics that are more representative of the American adult population as a whole. [1 point]

### Problem 4
For each of the following parts, 
* generate a table that provides evidence about the relationship between the two features in the data that are relevant to each question, 
* interpret the table in words, 
* use a hypothesis test to assess the strength of the evidence in the table, 
* and provide a **specific and accurate** intepretation of the $p$-value associated with this hypothesis test beyond "significant or not". 

#### Part a
Is there a gender wage gap? That is, is there a difference between the average incomes of men and women? [2 points]

#### Part b
Are there different average values of occupational prestige for different levels of job satisfaction? [2 points]

### Problem 5
Report the Pearson's correlation between years of education, socioeconomic status, income, occupational prestige, and a person's mother's and father's occupational prestige? Then perform a hypothesis test for the correlation between years of education and socioeconomic status and provide a **specific and accurate** intepretation of the $p$-value associated with this hypothesis test beyond "significant or not". [2 points]

### Problem 6
Create a new categorical feature for age groups, with categories for 18-35, 36-49, 50-69, and 70 and older (see the module 8 notebook for an example of how to do this). 

Then create a cross-tabulation in which the rows represent age groups and the columns represent responses to the statement that "It is much better for everyone involved if the man is the achiever outside the home and the woman takes care of the home and family." Rearrange the columns so that they are in the following order: strongly agree, agree, disagree, strongly disagree. Place row percents in the cells of this table.

Finally, use a hypothesis test that can tell use whether there is enough evidence to conclude that these two features have a relationship, and provide a specific and accurate intepretation of the $p$-value. [2 points]

### Problem 7
For this problem, you will conduct and interpret a correspondence analysis on the categorical features that ask respondents to state the extent to which they agree or disagree with the statements:
* "A working mother can establish just as warm and secure a relationship with her children as a mother who does not work."
* "It is much better for everyone involved if the man is the achiever outside the home and the woman takes care of the home and family."
* "Most men are better suited emotionally for politics than are most women."
* "A preschool child is likely to suffer if his or her mother works."
* "Family life often suffers because men concentrate too much on their work."

#### Part a
Conduct a correspondence analysis using the observed features listed above that measures two latent features. Plot the two latent categories for each category in each of the features used in the analysis. [2 points]

#### Part b
Display the latent features for every category in the observed features, sorted by the first latent feature. Describe in words what concept this feature is attempting to measure, and give the feature a name. [2 points]

#### Part c
We can use the results of the MCA model to conduct some cool EDA. For one example, follow these steps:

1. Use the `.row_coordinates()` method to calculate values of the latent feature for every row in the data you passed to the MCA in part a. Extract the first column and store it in its own dataframe.

2. To join it with the full, cleaned GSS data based on row numbers (instead of on a primary key), use the `.join()` method. For example, if we named the cleaned GSS data `gss_clean` and if we named the dataframe in step 1 `latentfeature`, we can type
```
gss_clean = gss_clean.join(latentfeature, how="outer")
```
3. Create a cross-tabuation with age categories (that you constructed in problem 5) in the rows and sex in the columns. Instead of a frequency, place the mean value of the latent feature in the cells. 

What does this table tell you about the relationship between sex, age, and the latent feature? [2 points]