# About the condition: Arthroplasty Knee

Also known as Knee replacement surgery.

The procedure involves cutting away damaged bone and cartilage from thighbone, shinbone and kneecap and replacing it with an artificial joint (prosthesis) made of metal alloys, high-grade plastics and polymers.


# About this notebook

**The goal:**
- conduct some machine learning models. 
- Train test split, see which factors are significant in accounting for the cost of anthroplasty knee surgery cost

**What is the dependent variable?**
- Avg cost of anthroplasty

**What are the independent variables?**
- Hospital features
- etc


In [7]:
import pandas as pd



In [9]:
# Reading in the cost data
test = pd.read_csv('~/002_ML/nycHospitalPricing/dataFiles/compiledMasterchargesNycHospitals.csv')
test


Unnamed: 0.1,Unnamed: 0,hospitalName,drgType,facId,providerId,hospDrg,description,charges
0,0,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,775.0,ALCOHOL ABUSE & DEPENDENCE,22066.805466
1,1,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,772.0,ALCOHOL/DRUG DEP W/REHAB DETOX,29814.074395
2,2,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,198.0,ANGINA PECTORIS & CORON ATHERO,21846.361053
3,3,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,141.0,ASTHMA,9181.951122
4,4,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,753.0,BIPOLAR DISORDERS,44274.231067
5,5,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,383.0,CELLULITIS/OTH BACT SKN INFCNS,22316.887635
6,6,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,540.0,CESAREAN DELIVERY,25299.090168
7,7,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,203.0,CHEST PAIN,18574.202489
8,8,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,140.0,CHRONIC OBSTRUCTIVE PULM DIS,13160.971033
9,9,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,774.0,COCAINE ABUSE & DEPENDENCE,17824.844856


In [17]:
test[test['hospDrg']==313.0]

Unnamed: 0.1,Unnamed: 0,hospitalName,drgType,facId,providerId,hospDrg,description,charges
189,189,BRONX-LEBANON HOSPITAL CENTER,APR-DRG,1164,330009,313.0,KNEE/LWR LIMB PROC EXCPT FOOT,83364.02625
420,420,CONEY ISLAND HOSPITAL,APR-DRG,1294,330196,313.0,KNEE & LOWER LEG PROCEDURES EXCEPT FOOT,59345.15
730,730,ELMHURST HOSPITAL CENTER,APR-DRG,1626,330128,313.0,KNEE & LOWER LEG PROCEDURES EXCEPT FOOT,59345.15
1040,1040,HARLEM HOSPITAL CENTER,APR-DRG,1445,330240,313.0,KNEE & LOWER LEG PROCEDURES EXCEPT FOOT,59345.15
1350,1350,JACOBI MEDICAL CENTER,APR-DRG,1165,330127,313.0,KNEE & LOWER LEG PROCEDURES EXCEPT FOOT,59345.15
1660,1660,KINGS COUNTY HOSPITAL CENTER,APR-DRG,1301,330202,313.0,KNEE & LOWER LEG PROCEDURES EXCEPT FOOT,59345.15
1970,1970,LINCOLN MEDICAL & MENTAL HEALTH CENTER,APR-DRG,1172,330080,313.0,KNEE & LOWER LEG PROCEDURES EXCEPT FOOT,59345.15
2280,2280,METROPOLITAN HOSPITAL CENTER,APR-DRG,1454,330199,313.0,KNEE & LOWER LEG PROCEDURES EXCEPT FOOT,59345.15
15569,15569,NORTH CENTRAL BRONX HOSPITAL,APR-DRG,1186,330385,313.0,KNEE & LOWER LEG PROCEDURES EXCEPT FOOT,59345.15
15879,15879,WOODHULL MEDICAL AND MENTAL HEALTH CENTER,APR-DRG,1692,330396,313.0,KNEE & LOWER LEG PROCEDURES EXCEPT FOOT,59345.15


In [22]:
# Reading in the hospital data 
hospFeat = pd.read_json("https://data.cms.gov/resource/8rp3-rzmi.json?state_code=NY")
hospFeat.head()


Unnamed: 0,accounts_payable,accounts_receivable,allowable_dsh_percentage,buildings,cash_on_hand_and_in_banks,ccn_facility_type,city,combined_outpatient_inpatient,contract_labor,cost_of_charity_care,...,total_salaries_adjusted,total_salaries_from_worksheet,total_unreimbursed_and,type_of_control,unsecured_loans,wage_related_costs_core,wage_related_costs_for_interns,wage_related_costs_for_part,wage_related_costs_rhc_fqhc,zip_code
0,,,,,,PH,ORANGEBURG,22147704.0,,,...,93088364.0,93088364.0,-3912103.0,10,,49192404.0,,,,10962-1196
1,,,,,,PH,ROCHESTER,1939591.0,,,...,34442946.0,34442946.0,-6429628.0,10,,18333880.0,,,,14620-3965
2,,,,,,PH,UTICA,6125975.0,,,...,19373691.0,19373691.0,,10,,10361050.0,,,,13502-3803
3,,,,,,PH,DIX HILLS,,,,...,,,,10,,,,,,11746-5861
4,,,,,,STH,ROCHESTER,,,,...,,,,5,,,,,,14620-4629


In [25]:
hospFeat.dtypes

accounts_payable                    float64
accounts_receivable                 float64
allowable_dsh_percentage            float64
buildings                           float64
cash_on_hand_and_in_banks           float64
ccn_facility_type                    object
city                                 object
combined_outpatient_inpatient       float64
contract_labor                      float64
cost_of_charity_care                float64
cost_of_uncompensated_care          float64
cost_to_charge_ratio                float64
county                               object
deferred_income                     float64
depreciation_cost                   float64
disproporationate_share             float64
drg_amounts_after_october           float64
drg_amounts_before_october          float64
fiscal_year_begin_date               object
fiscal_year_end_date                 object
fixed_equipment                     float64
fte_employees_on_payroll            float64
general_fund_balance            

 HOSPITAL FEATURES
- Provider CNN (same as provider id)
- Type of Control (type of hospital eg: nonprofit/ gov)
- Number of beds (only for lodging patients in acute, long term stay)
- Total Days (V + XVIII + XIX + Unknown) - Total number of inpatient days for all classes of patients for each component

COST FEATURES
- Cost of Charity Care (cost of free, essential medical services rendered for ppl who can't pay)
- Total Bad Debt Expense (cost of hosp services expected to not be paid, does not include doctor & other professional fee)
- Total Costs (total hospital cost)
- DRG Amounts Other Than Outlier Payments (DRG payment paid for Prospective Payment System (PPS) discharges)
- Total IME payment (additional amt a teaching hospital 'earns' in addition to each Medicare case) 
- Disproporationate Share Adjustment (percentage add-on to the DRG payment, additional compensation for treating low income)
- Net Patient Revenue (net income earned for each patient seen) 
- Net Revenue from Medicaid (inclusive of DSH & IME revenue)
- Medicaid Charges (total revenue from Medicaid)
- Net Revenue from Stand-Alone SCHIP (SCHIP = The State Children’s Health Insurance Program, for low income kids)
- Stand-Alone SCHIP Charges (Total revenue from The State Children’s Health Insurance Program) 
