# Opioid Study Data Collection
#### Defined study plan: https://docs.google.com/document/d/1kkTbheDP5vS8rh_C6W1U7qzthkScqrzKBjd0jSY3j48/edit
The main pieces consist of:

1. Inclusion/Exclusion Criteria
2. Demographic Data
3. Clinical Data

#### About: 
- Below I use PostgresSQL to query MIMIC-3 through a PostgresSQL adapter for Python
- Each step for collection is outlined below and the steps build on eachother
- Each step will have some relevant info to give context. In the SQL code, if the line starts with "--" it's a readable comment for more clarification on what the code does
- Once I write the SQL code, it is rendered as a Pandas dataframe, which is a really flexible library for data manipulation. From here, I can export the data to a common output format (.csv, .xslm, .tsv, etc.)

In [525]:
# Import libraries
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import psycopg2
from IPython.display import display, HTML # used to print out pretty pandas dataframes
display(HTML("<style>.container { width:100% !important; }</style>")) # widest display
import matplotlib.dates as dates
import finddrugs
import matplotlib.lines as mlines
import queries
import finddrugs_refactor as finder
import notebook_helper as helper

pd.options.display.max_colwidth = 1000
pd.options.display.width = 1000
pd.options.display.max_columns = 1000
pd.options.display.max_rows = 3


%matplotlib inline
plt.style.use('ggplot') 

# specify user/password/where the database is
sqluser = 'eightiesfanjan'
sqlpass = 'squiggle'
dbname = 'mimic'
schema_name = 'mimiciii'
host = 'localhost'

query_schema = 'SET search_path to ' + schema_name + ';'

# connect to the database
con = psycopg2.connect(dbname=dbname, user=sqluser, password=sqlpass, host=host)

In [614]:
reload(queries)
reload(helper)
reload(finder)

<module 'finddrugs_refactor' from 'finddrugs_refactor.pyc'>

## Phase 1: Inclusion Exclusion Criteria
- added first phase outcomes

### Step 1) Let's get earliest icu admits and admits with 180+ day intervals
- 61,532 original records from icustays table (takes into account those directly in ICU and those who transferred between floors)
- After filtering for earliest record or those with 180 day interals, there are 51,373 records
- More info on icustays table: https://mimic.physionet.org/mimictables/icustays/

In [615]:
df_step1 = helper.run_query(query=queries.unique_icu_admit, db_connection=con)
df_step1

Unnamed: 0,subject_id,hadm_id,icu_intime,icu_outtime,last_out_time,diff_last_outtime
0,82574,118464,2100-06-07 20:00:22,2100-06-08 14:59:31,,
...,...,...,...,...,...,...
51372,5216,130232,2114-02-26 05:41:00,NaT,,



### Step 2) Get 18+ year olds and no death within 24 hours of admittance
- From 51,373 records in Step 1, we go down to 42,211 (18% decrease) after filtering for no death in 24 hrs and 18+ yr olds
- More info on patients table: https://mimic.physionet.org/mimictables/patients/
- More info on icustays table: https://mimic.physionet.org/mimictables/icustays/


In [512]:
df_step2 = helper.run_query(query=queries.adults_with_no_death_within_day, db_connection=con)
df_step2

Unnamed: 0,row_id,subject_id,hadm_id,intime,outtime,age,diff_death_admit_hrs,diff_last_outtime
0,2,3,145834,2101-10-20 19:10:11,2101-10-26 20:43:09,76.52,5668.830278,
1,3,4,185777,2191-03-16 00:29:31,2191-03-17 16:46:31,47.84,,
2,5,6,107064,2175-05-30 21:30:54,2175-06-03 13:39:54,65.94,,
3,9,9,150750,2149-11-09 13:07:02,2149-11-14 20:52:14,41.79,106.882778,
4,11,11,194540,2178-04-16 06:19:32,2178-04-17 20:21:05,50.15,5081.674444,
...,...,...,...,...,...,...,...,...
42206,61528,99985,176670,2181-01-29 05:33:34,2181-02-09 12:45:20,53.81,,
42207,61529,99991,151118,2184-12-28 17:30:58,2184-12-31 20:56:20,47.73,,
42208,61530,99992,197084,2144-07-25 18:04:42,2144-07-27 17:27:55,65.77,,
42209,61531,99995,137810,2147-02-08 13:53:58,2147-02-10 17:46:30,88.70,5578.100556,



### Step 3) Map patients to ALL relevant ICD9 codes in relevant admission
- Admits mapped to an array of ICD9 codes, ordered by priority level. 
- Result: 42,211 records
- The first icd9 code in this array is their reason for admission
    - See here for more understanding: https://github.com/MIT-LCP/mimic-code/issues/199
- To keep things readable, I add a table called flags that are binary values to filter out people
- More info on patients table: https://mimic.physionet.org/mimictables/patients/
- More info on icustays table: https://mimic.physionet.org/mimictables/icustays/


In [514]:
df_step3 = helper.run_query(query=queries.aggregate_icd9_codes, db_connection=con)
df_step3

Unnamed: 0,row_id,subject_id,hadm_id,intime,outtime,age,diff_death_admit_hrs,diff_last_outtime,icd9_codes,seq_num,short_titles,long_titles,valid_icu_admit,valid_age,valid_death
0,2,3,145834,2101-10-20 19:10:11,2101-10-26 20:43:09,76.52,5668.830278,,"[0389, 78559, 5849, 4275, 41071, 4280, 6826, 4254, 2639]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Septicemia NOS, Shock w/o trauma NEC, Acute kidney failure NOS, Cardiac arrest, Subendo infarct, initial, CHF NOS, Cellulitis of leg, Prim cardiomyopathy NEC, Protein-cal malnutr NOS]","[Unspecified septicemia, Other shock without mention of trauma, Acute kidney failure, unspecified, Cardiac arrest, Subendocardial infarction, initial episode of care, Congestive heart failure, unspecified, Cellulitis and abscess of leg, except foot, Other primary cardiomyopathies, Unspecified protein-calorie malnutrition]",1,1,1
1,3,4,185777,2191-03-16 00:29:31,2191-03-17 16:46:31,47.84,,,"[042, 1363, 7994, 2763, 7907, 5715, 04111, V090, E9317]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Human immuno virus dis, Pneumocystosis, Cachexia, Alkalosis, Bacteremia, Cirrhosis of liver NOS, Mth sus Stph aur els/NOS, Inf mcrg rstn pncllins, Adv eff antiviral drugs]","[Human immunodeficiency virus [HIV] disease, Pneumocystosis, Cachexia, Alkalosis, Bacteremia, Cirrhosis of liver without mention of alcohol, Methicillin susceptible Staphylococcus aureus in conditions classified elsewhere and of unspecified site, Infection with microorganisms resistant to penicillins, Antiviral drugs causing adverse effects in therapeutic use]",1,1,1
2,5,6,107064,2175-05-30 21:30:54,2175-06-03 13:39:54,65.94,,,"[40391, 9972, 2767, 2859, 2753, V1582]","[1, 3, 5, 6, 7, 8]","[Hyp kid NOS w cr kid V, Surg comp-peri vasc syst, Hyperpotassemia, Anemia NOS, Dis phosphorus metabol, History of tobacco use]","[Hypertensive chronic kidney disease, unspecified, with chronic kidney disease stage V or end stage renal disease, Peripheral vascular complications, not elsewhere classified, Hyperpotassemia, Anemia, unspecified, Disorders of phosphorus metabolism, Personal history of tobacco use]",1,1,1
3,9,9,150750,2149-11-09 13:07:02,2149-11-14 20:52:14,41.79,106.882778,,"[431, 5070, 4280, 5849, 4019]","[1, 2, 3, 4, 6]","[Intracerebral hemorrhage, Food/vomit pneumonitis, CHF NOS, Acute kidney failure NOS, Hypertension NOS]","[Intracerebral hemorrhage, Pneumonitis due to inhalation of food or vomitus, Congestive heart failure, unspecified, Acute kidney failure, unspecified, Unspecified essential hypertension]",1,1,1
4,11,11,194540,2178-04-16 06:19:32,2178-04-17 20:21:05,50.15,5081.674444,,[1913],[1],[Mal neo parietal lobe],[Malignant neoplasm of parietal lobe],1,1,1
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
42206,61528,99985,176670,2181-01-29 05:33:34,2181-02-09 12:45:20,53.81,,,"[0389, 51881, 48241, 4870, 78552, V4281, 99592, 2449, 2724, 2859, 53081, V1072, 23871]","[1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13]","[Septicemia NOS, Acute respiratry failure, Meth sus pneum d/t Staph, Influenza with pneumonia, Septic shock, Trnspl status-bne marrow, Severe sepsis, Hypothyroidism NOS, Hyperlipidemia NEC/NOS, Anemia NOS, Esophageal reflux, Hx-hodgkin's disease, Essntial thrombocythemia]","[Unspecified septicemia, Acute respiratory failure, Methicillin susceptible pneumonia due to Staphylococcus aureus, Influenza with pneumonia, Septic shock, Bone marrow replaced by transplant, Severe sepsis, Unspecified acquired hypothyroidism, Other and unspecified hyperlipidemia, Anemia, unspecified, Esophageal reflux, Personal history of hodgkin's disease, Essential thrombocythemia]",1,1,1
42207,61529,99991,151118,2184-12-28 17:30:58,2184-12-31 20:56:20,47.73,,,"[56211, 0389, 5570, 5849, 99592, 56081, 78959, 5538, 7885, 40291, 4280, 71947, 5644, 25000, V0254, E8788, 27651]","[1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17]","[Dvrtcli colon w/o hmrhg, Septicemia NOS, Ac vasc insuff intestine, Acute kidney failure NOS, Severe sepsis, Intestinal adhes w obstr, Ascites NEC, Hernia NEC, Oliguria & anuria, Hyp ht dis NOS w ht fail, CHF NOS, Joint pain-ankle, Postop GI funct dis NEC, DMII wo cmp nt st uncntr, Meth resis Staph carrier, Abn react-surg proc NEC, Dehydration]","[Diverticulitis of colon (without mention of hemorrhage), Unspecified septicemia, Acute vascular insufficiency of intestine, Acute kidney failure, unspecified, Severe sepsis, Intestinal or peritoneal adhesions with obstruction (postoperative) (postinfection), Other ascites, Hernia of other specified sites without mention of obstruction or gangrene, Oliguria and anuria, Unspecified hypertensive heart disease with heart failure, Congestive heart failure, unspecified, Pain in joint, ankle and foot, Other postoperative functional disorders, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Carrier or suspected carrier of Methicillin resistant Staphylococcus aureus, Other specified surgical operations and procedures causing abnormal patient reaction, or later complication, without mention of misadventure at time of operation, Dehydration]",1,1,1
42208,61530,99992,197084,2144-07-25 18:04:42,2144-07-27 17:27:55,65.77,,,"[9999, 56881, 5772, 2851, 5849, 5799, 72992, 53081, 4019, 2721, 5699, 3004]","[1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12]","[Complic med care NEC/NOS, Hemoperitoneum, Pancreat cyst/pseudocyst, Ac posthemorrhag anemia, Acute kidney failure NOS, Intest malabsorption NOS, Nontrauma hema soft tiss, Esophageal reflux, Hypertension NOS, Pure hyperglyceridemia, Intestinal disorder NOS, Dysthymic disorder]","[Other and unspecified complications of medical care, not elsewhere classified, Hemoperitoneum (nontraumatic), Cyst and pseudocyst of pancreas, Acute posthemorrhagic anemia, Acute kidney failure, unspecified, Unspecified intestinal malabsorption, Nontraumatic hematoma of soft tissue, Esophageal reflux, Unspecified essential hypertension, Pure hyperglyceridemia, Unspecified disorder of intestine, Dysthymic disorder]",1,1,1
42209,61531,99995,137810,2147-02-08 13:53:58,2147-02-10 17:46:30,88.70,5578.100556,,"[4414, 42833, 99812, 2851, 4241, 25000, 99811, 9961, E8798, 2724, V4581, 4280, V103, V1582, V5861, 4400, 41401]","[1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17]","[Abdom aortic aneurysm, Ac on chr diast hrt fail, Hematoma complic proc, Ac posthemorrhag anemia, Aortic valve disorder, DMII wo cmp nt st uncntr, Hemorrhage complic proc, Malfunc vasc device/graf, Abn react-procedure NEC, Hyperlipidemia NEC/NOS, Aortocoronary bypass, CHF NOS, Hx of breast malignancy, History of tobacco use, Long-term use anticoagul, Aortic atherosclerosis, Crnry athrscl natve vssl]","[Abdominal aneurysm without mention of rupture, Acute on chronic diastolic heart failure, Hematoma complicating a procedure, Acute posthemorrhagic anemia, Aortic valve disorders, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Hemorrhage complicating a procedure, Mechanical complication of other vascular device, implant, and graft, Other specified procedures as the cause of abnormal reaction of patient, or of later complication, without mention of misadventure at time of procedure, Other and unspecified hyperlipidemia, Aortocoronary bypass status, Congestive heart failure, unspecified, Personal history of malignant neoplasm of breast, Personal history of tobacco use, Long-term (current) use of anticoagulants, Atherosclerosis of aorta, Coronary atherosclerosis of native coronary artery]",1,1,1



### Step 4**) Get those with no opioid abuse, no anoxic brain injury, no cancer
- As of 12/11/2018, we are just filtering out opioid abuse, anoxic brain, and cancer
- Result: 36,440 records
- Opioid/heroid abuse (or poisoning) ICD9 codes
    - https://www.ncbi.nlm.nih.gov/books/NBK367628/table/sb202.t4/?report=objectonly
- More info on diagnoses table: https://mimic.physionet.org/mimictables/d_icd_diagnoses/
- More info on patients table: https://mimic.physionet.org/mimictables/patients/
- More info on icustays table: https://mimic.physionet.org/mimictables/icustays/


In [517]:
df_step4 = helper.run_query(query=queries.filter_exclusion_criteria, db_connection=con)
df_step4

Unnamed: 0,row_id,subject_id,hadm_id,intime,outtime,age,diff_death_admit_hrs,diff_last_outtime,icd9_codes,seq_num,short_titles,long_titles,valid_icu_admit,valid_age,valid_death,opiate_abuse,has_anoxic_brain,has_cancer
0,2,3,145834,2101-10-20 19:10:11,2101-10-26 20:43:09,76.52,5668.830278,,"[0389, 78559, 5849, 4275, 41071, 4280, 6826, 4254, 2639]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Septicemia NOS, Shock w/o trauma NEC, Acute kidney failure NOS, Cardiac arrest, Subendo infarct, initial, CHF NOS, Cellulitis of leg, Prim cardiomyopathy NEC, Protein-cal malnutr NOS]","[Unspecified septicemia, Other shock without mention of trauma, Acute kidney failure, unspecified, Cardiac arrest, Subendocardial infarction, initial episode of care, Congestive heart failure, unspecified, Cellulitis and abscess of leg, except foot, Other primary cardiomyopathies, Unspecified protein-calorie malnutrition]",1,1,1,0,0,0
1,3,4,185777,2191-03-16 00:29:31,2191-03-17 16:46:31,47.84,,,"[042, 1363, 7994, 2763, 7907, 5715, 04111, V090, E9317]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Human immuno virus dis, Pneumocystosis, Cachexia, Alkalosis, Bacteremia, Cirrhosis of liver NOS, Mth sus Stph aur els/NOS, Inf mcrg rstn pncllins, Adv eff antiviral drugs]","[Human immunodeficiency virus [HIV] disease, Pneumocystosis, Cachexia, Alkalosis, Bacteremia, Cirrhosis of liver without mention of alcohol, Methicillin susceptible Staphylococcus aureus in conditions classified elsewhere and of unspecified site, Infection with microorganisms resistant to penicillins, Antiviral drugs causing adverse effects in therapeutic use]",1,1,1,0,0,0
2,5,6,107064,2175-05-30 21:30:54,2175-06-03 13:39:54,65.94,,,"[40391, 9972, 2767, 2859, 2753, V1582]","[1, 3, 5, 6, 7, 8]","[Hyp kid NOS w cr kid V, Surg comp-peri vasc syst, Hyperpotassemia, Anemia NOS, Dis phosphorus metabol, History of tobacco use]","[Hypertensive chronic kidney disease, unspecified, with chronic kidney disease stage V or end stage renal disease, Peripheral vascular complications, not elsewhere classified, Hyperpotassemia, Anemia, unspecified, Disorders of phosphorus metabolism, Personal history of tobacco use]",1,1,1,0,0,0
3,9,9,150750,2149-11-09 13:07:02,2149-11-14 20:52:14,41.79,106.882778,,"[431, 5070, 4280, 5849, 4019]","[1, 2, 3, 4, 6]","[Intracerebral hemorrhage, Food/vomit pneumonitis, CHF NOS, Acute kidney failure NOS, Hypertension NOS]","[Intracerebral hemorrhage, Pneumonitis due to inhalation of food or vomitus, Congestive heart failure, unspecified, Acute kidney failure, unspecified, Unspecified essential hypertension]",1,1,1,0,0,0
4,13,13,143045,2167-01-08 18:44:25,2167-01-12 10:43:31,39.86,,,"[41401, 4111, 25000, 4019, 2720]","[1, 2, 3, 4, 5]","[Crnry athrscl natve vssl, Intermed coronary synd, DMII wo cmp nt st uncntr, Hypertension NOS, Pure hypercholesterolem]","[Coronary atherosclerosis of native coronary artery, Intermediate coronary syndrome, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Unspecified essential hypertension, Pure hypercholesterolemia]",1,1,1,0,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
36435,61528,99985,176670,2181-01-29 05:33:34,2181-02-09 12:45:20,53.81,,,"[0389, 51881, 48241, 4870, 78552, V4281, 99592, 2449, 2724, 2859, 53081, V1072, 23871]","[1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13]","[Septicemia NOS, Acute respiratry failure, Meth sus pneum d/t Staph, Influenza with pneumonia, Septic shock, Trnspl status-bne marrow, Severe sepsis, Hypothyroidism NOS, Hyperlipidemia NEC/NOS, Anemia NOS, Esophageal reflux, Hx-hodgkin's disease, Essntial thrombocythemia]","[Unspecified septicemia, Acute respiratory failure, Methicillin susceptible pneumonia due to Staphylococcus aureus, Influenza with pneumonia, Septic shock, Bone marrow replaced by transplant, Severe sepsis, Unspecified acquired hypothyroidism, Other and unspecified hyperlipidemia, Anemia, unspecified, Esophageal reflux, Personal history of hodgkin's disease, Essential thrombocythemia]",1,1,1,0,0,0
36436,61529,99991,151118,2184-12-28 17:30:58,2184-12-31 20:56:20,47.73,,,"[56211, 0389, 5570, 5849, 99592, 56081, 78959, 5538, 7885, 40291, 4280, 71947, 5644, 25000, V0254, E8788, 27651]","[1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17]","[Dvrtcli colon w/o hmrhg, Septicemia NOS, Ac vasc insuff intestine, Acute kidney failure NOS, Severe sepsis, Intestinal adhes w obstr, Ascites NEC, Hernia NEC, Oliguria & anuria, Hyp ht dis NOS w ht fail, CHF NOS, Joint pain-ankle, Postop GI funct dis NEC, DMII wo cmp nt st uncntr, Meth resis Staph carrier, Abn react-surg proc NEC, Dehydration]","[Diverticulitis of colon (without mention of hemorrhage), Unspecified septicemia, Acute vascular insufficiency of intestine, Acute kidney failure, unspecified, Severe sepsis, Intestinal or peritoneal adhesions with obstruction (postoperative) (postinfection), Other ascites, Hernia of other specified sites without mention of obstruction or gangrene, Oliguria and anuria, Unspecified hypertensive heart disease with heart failure, Congestive heart failure, unspecified, Pain in joint, ankle and foot, Other postoperative functional disorders, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Carrier or suspected carrier of Methicillin resistant Staphylococcus aureus, Other specified surgical operations and procedures causing abnormal patient reaction, or later complication, without mention of misadventure at time of operation, Dehydration]",1,1,1,0,0,0
36437,61530,99992,197084,2144-07-25 18:04:42,2144-07-27 17:27:55,65.77,,,"[9999, 56881, 5772, 2851, 5849, 5799, 72992, 53081, 4019, 2721, 5699, 3004]","[1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12]","[Complic med care NEC/NOS, Hemoperitoneum, Pancreat cyst/pseudocyst, Ac posthemorrhag anemia, Acute kidney failure NOS, Intest malabsorption NOS, Nontrauma hema soft tiss, Esophageal reflux, Hypertension NOS, Pure hyperglyceridemia, Intestinal disorder NOS, Dysthymic disorder]","[Other and unspecified complications of medical care, not elsewhere classified, Hemoperitoneum (nontraumatic), Cyst and pseudocyst of pancreas, Acute posthemorrhagic anemia, Acute kidney failure, unspecified, Unspecified intestinal malabsorption, Nontraumatic hematoma of soft tissue, Esophageal reflux, Unspecified essential hypertension, Pure hyperglyceridemia, Unspecified disorder of intestine, Dysthymic disorder]",1,1,1,0,0,0
36438,61531,99995,137810,2147-02-08 13:53:58,2147-02-10 17:46:30,88.70,5578.100556,,"[4414, 42833, 99812, 2851, 4241, 25000, 99811, 9961, E8798, 2724, V4581, 4280, V103, V1582, V5861, 4400, 41401]","[1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17]","[Abdom aortic aneurysm, Ac on chr diast hrt fail, Hematoma complic proc, Ac posthemorrhag anemia, Aortic valve disorder, DMII wo cmp nt st uncntr, Hemorrhage complic proc, Malfunc vasc device/graf, Abn react-procedure NEC, Hyperlipidemia NEC/NOS, Aortocoronary bypass, CHF NOS, Hx of breast malignancy, History of tobacco use, Long-term use anticoagul, Aortic atherosclerosis, Crnry athrscl natve vssl]","[Abdominal aneurysm without mention of rupture, Acute on chronic diastolic heart failure, Hematoma complicating a procedure, Acute posthemorrhagic anemia, Aortic valve disorders, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Hemorrhage complicating a procedure, Mechanical complication of other vascular device, implant, and graft, Other specified procedures as the cause of abnormal reaction of patient, or of later complication, without mention of misadventure at time of procedure, Other and unspecified hyperlipidemia, Aortocoronary bypass status, Congestive heart failure, unspecified, Personal history of malignant neoplasm of breast, Personal history of tobacco use, Long-term (current) use of anticoagulants, Atherosclerosis of aorta, Coronary atherosclerosis of native coronary artery]",1,1,1,0,0,0



### Step 5) Get discharge summaries for the 36k people
- As of 11/12/2018, we will NOT be using this table to determine those using opiates chronically, will instead use outpatient prescription table
- 40,217 records. Note: Discharge information can be of type report or addendum. Both can exist for a patient. Will filter only for report
- 36,888 records after filtering. Still about 444 duplicates -- need to investigate


In [616]:
df_step5 = helper.run_query(query=queries.discharge_events, db_connection=con)
df_step5



Unnamed: 0,row_id,subject_id,hadm_id,intime,outtime,age,diff_death_admit_hrs,diff_last_outtime,icd9_codes,seq_num,short_titles,long_titles,valid_icu_admit,valid_age,valid_death,opiate_abuse,has_anoxic_brain,has_cancer,category,description,text
0,2,3,145834,2101-10-20 19:10:11,2101-10-26 20:43:09,76.52,5668.830278,,"[0389, 78559, 5849, 4275, 41071, 4280, 6826, 4254, 2639]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Septicemia NOS, Shock w/o trauma NEC, Acute kidney failure NOS, Cardiac arrest, Subendo infarct, initial, CHF NOS, Cellulitis of leg, Prim cardiomyopathy NEC, Protein-cal malnutr NOS]","[Unspecified septicemia, Other shock without mention of trauma, Acute kidney failure, unspecified, Cardiac arrest, Subendocardial infarction, initial episode of care, Congestive heart failure, unspecified, Cellulitis and abscess of leg, except foot, Other primary cardiomyopathies, Unspecified protein-calorie malnutrition]",1,1,1,0,0,0,Discharge summary,Report,"Admission Date: [**2101-10-20**] Discharge Date: [**2101-10-31**]\n\nDate of Birth: [**2025-4-11**] Sex: M\n\nService: Medicine\n\nCHIEF COMPLAINT: Admitted from rehabilitation for\nhypotension (systolic blood pressure to the 70s) and\ndecreased urine output.\n\nHISTORY OF PRESENT ILLNESS: The patient is a 76-year-old\nmale who had been hospitalized at the [**Hospital1 190**] from [**10-11**] through [**10-19**] of [**2101**]\nafter undergoing a left femoral-AT bypass graft and was\nsubsequently discharged to a rehabilitation facility.\n\nOn [**2101-10-20**], he presented again to the [**Hospital1 346**] after being found to have a systolic\nblood pressure in the 70s and no urine output for 17 hours.\nA Foley catheter placed at the rehabilitation facility\nyielded 100 cc of murky/brown urine. There may also have\nbeen purulent discharge at the penile meatus at this time.\n\nOn presentation to the Emergency Department, the patient was\nwithout subjective complaints...."
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
36887,61532,99999,113369,2117-12-31 11:52:36,2118-01-01 14:51:27,63.98,,,"[75612, 7861, 4019, 25000, 4139]","[1, 2, 3, 4, 5]","[Spondylolisthesis, Stridor, Hypertension NOS, DMII wo cmp nt st uncntr, Angina pectoris NEC/NOS]","[Spondylolisthesis, Stridor, Unspecified essential hypertension, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Other and unspecified angina pectoris]",1,1,1,0,0,0,Discharge summary,Report,"Admission Date: [**2117-12-30**] Discharge Date: [**2118-1-4**]\n\nDate of Birth: [**2054-1-6**] Sex: F\n\nService: NEUROSURGERY\n\nAllergies:\nPenicillins\n\nAttending:[**First Name3 (LF) 2724**]\nChief Complaint:\nNeurogenic claudication\n\nMajor Surgical or Invasive Procedure:\n[**2117-12-30**] PLIF L4-5\n\n\nHistory of Present Illness:\n63-year-old woman who complains of bilateral lower extremity\nsymptoms that are exacerbated by walking. She receives some\namelioration with rest. She denies difficulty with bowel or\nbladder function.\n\n\nPast Medical History:\nHTN\nDiabetes\nAngina\n\nSocial History:\nNC\n\nFamily History:\nNC\n\nPhysical Exam:\nPre-Op on clinic visit:\n\nOn examination, her motor strength was [**3-30**] in hip flexion,\nextension, quadriceps, hamstrings, dorsiflexion, and plantar\nflexion bilaterally. Her sensory examination was intact with\nrespect to the modality of light touch. Her reflexes were\nnormal\nand symmetric in..."


### Step 6) Parse out files for opiates

In [424]:
finder.search(df_step5, DRUGLIST_FILE='opiates.txt', SUMMARY_FILE='test_jan.csv')

Using drugs from opiates.txt
Reading documents...
...index: 0. row_id: 2. subject_id: 3. hadm_id: 145834. 

...index: 100. row_id: 210. subject_id: 152. hadm_id: 117181. 

...index: 200. row_id: 362. subject_id: 265. hadm_id: 101608. 

...index: 300. row_id: 519. subject_id: 389. hadm_id: 134048. 

...index: 400. row_id: 689. subject_id: 525. hadm_id: 128280. 

...index: 500. row_id: 843. subject_id: 664. hadm_id: 181314. 

...index: 600. row_id: 991. subject_id: 780. hadm_id: 140935. 

...index: 700. row_id: 1171. subject_id: 905. hadm_id: 150569. 

...index: 800. row_id: 1353. subject_id: 1042. hadm_id: 177447. 

...index: 900. row_id: 1525. subject_id: 1183. hadm_id: 191513. 

...index: 1000. row_id: 1693. subject_id: 1324. hadm_id: 140065. 

...index: 1100. row_id: 1860. subject_id: 1459. hadm_id: 170103. 

...index: 1200. row_id: 2037. subject_id: 1604. hadm_id: 193058. 

...index: 1300. row_id: 2234. subject_id: 1769. hadm_id: 147591. 

...index: 1400. row_id: 2408. subject_id: 1

...index: 12100. row_id: 21373. subject_id: 16855. hadm_id: 192793. 

...index: 12200. row_id: 21561. subject_id: 16993. hadm_id: 183854. 

...index: 12300. row_id: 21739. subject_id: 17125. hadm_id: 137179. 

...index: 12400. row_id: 21916. subject_id: 17260. hadm_id: 180007. 

...index: 12500. row_id: 22106. subject_id: 17414. hadm_id: 104313. 

...index: 12600. row_id: 22277. subject_id: 17566. hadm_id: 156549. 

...index: 12700. row_id: 22464. subject_id: 17712. hadm_id: 167975. 

...index: 12800. row_id: 22632. subject_id: 17835. hadm_id: 171801. 

...index: 12900. row_id: 22819. subject_id: 17981. hadm_id: 174389. 

...index: 13000. row_id: 23012. subject_id: 18134. hadm_id: 146200. 

...index: 13100. row_id: 23191. subject_id: 18287. hadm_id: 149224. 

...index: 13200. row_id: 23388. subject_id: 18456. hadm_id: 183887. 

...index: 13300. row_id: 23590. subject_id: 18624. hadm_id: 199390. 

...index: 13400. row_id: 23756. subject_id: 18748. hadm_id: 175847. 

...index: 13500. row

...index: 23900. row_id: 42114. subject_id: 42351. hadm_id: 174564. 

...index: 24000. row_id: 42243. subject_id: 42769. hadm_id: 102128. 

...index: 24100. row_id: 42410. subject_id: 43220. hadm_id: 117549. 

...index: 24200. row_id: 42576. subject_id: 43749. hadm_id: 129564. 

...index: 24300. row_id: 42728. subject_id: 44269. hadm_id: 171586. 

...index: 24400. row_id: 42870. subject_id: 44694. hadm_id: 144807. 

...index: 24500. row_id: 43013. subject_id: 45111. hadm_id: 143962. 

...index: 24600. row_id: 43164. subject_id: 45524. hadm_id: 189369. 

...index: 24700. row_id: 43312. subject_id: 46007. hadm_id: 178313. 

...index: 24800. row_id: 43466. subject_id: 46399. hadm_id: 127874. 

...index: 24900. row_id: 43616. subject_id: 46845. hadm_id: 173151. 

...index: 25000. row_id: 43771. subject_id: 47311. hadm_id: 154886. 

...index: 25100. row_id: 43924. subject_id: 47798. hadm_id: 140591. 

...index: 25200. row_id: 44068. subject_id: 48184. hadm_id: 197111. 

...index: 25300. row

...index: 35700. row_id: 59716. subject_id: 94687. hadm_id: 196208. 

...index: 35800. row_id: 59884. subject_id: 95147. hadm_id: 180407. 

...index: 35900. row_id: 60032. subject_id: 95611. hadm_id: 105027. 

...index: 36000. row_id: 60194. subject_id: 96072. hadm_id: 106637. 

...index: 36100. row_id: 60342. subject_id: 96482. hadm_id: 155301. 

...index: 36200. row_id: 60494. subject_id: 96863. hadm_id: 179477. 

...index: 36300. row_id: 60636. subject_id: 97291. hadm_id: 158386. 

...index: 36400. row_id: 60794. subject_id: 97782. hadm_id: 102294. 

...index: 36500. row_id: 60956. subject_id: 98276. hadm_id: 164637. 

...index: 36600. row_id: 61097. subject_id: 98713. hadm_id: 105148. 

...index: 36700. row_id: 61244. subject_id: 99138. hadm_id: 148579. 

...index: 36800. row_id: 61406. subject_id: 99611. hadm_id: 108679. 

Done analyzing 36888 documents in 261.74 seconds (140.94 docs/sec)
Summary file is in /Users/eightiesfanjan/Desktop/research/opioid_mimic_research


### Step 7) Generate flags for opiates and join back to original dataframe

- total recs with meds on admission section: 32,384
- with opiates in meds: 4,184
- without opiates in meds: 28,200

- this is done by
    - removing ambiguous admit data (which we then filter to)
    - removing ambiguous group data (which is used to determine opiate/nonopiate split)
    - joining raw sql to medication data

In [635]:
reload(queries)
reload(helper)

<module 'notebook_helper' from 'notebook_helper.py'>

In [527]:
# load the output to a dataframe
medications = pd.read_csv('test_jan.csv')
medications

Unnamed: 0,row_id,subject_id,hadm_id,hist_found,opiate_history,admit_found,dis_found,group,opiates,hydromorphone,hydrocodone,oxycodone,morphine,fentanyl,tramadol,buprenorphine,methadone,oxymorphone,meperidine
0,2,3,145834,1,0,1,1,3,1,0,0,1,0,0,0,0,0,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
36887,61532,99999,113369,1,0,1,1,2,0,0,0,0,0,0,0,0,0,0,0


In [528]:
df_clean_admit_groups = helper.remove_ambiguous_data(medications)

Removing ambiguous subject/admissions using column=admit_found
Checking unique subject/admissions using column=admit_found
Success! Old df cnt=36888, new df cnt=35066
Removing ambiguous subject/admissions using column=group
Checking unique subject/admissions using column=group
Success! Old df cnt=35066, new df cnt=34610


In [617]:
df_joined = helper.get_admit_df(df_step5, df_clean_admit_groups)
df_joined

Unnamed: 0,row_id,subject_id,hadm_id,intime,outtime,age,diff_death_admit_hrs,diff_last_outtime,icd9_codes,seq_num,short_titles,long_titles,valid_icu_admit,valid_age,valid_death,opiate_abuse,has_anoxic_brain,has_cancer,category,description,text,hist_found,opiate_history,admit_found,dis_found,group,opiates,hydromorphone,hydrocodone,oxycodone,morphine,fentanyl,tramadol,buprenorphine,methadone,oxymorphone,meperidine
0,2,3,145834,2101-10-20 19:10:11,2101-10-26 20:43:09,76.52,5668.830278,,"[0389, 78559, 5849, 4275, 41071, 4280, 6826, 4254, 2639]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Septicemia NOS, Shock w/o trauma NEC, Acute kidney failure NOS, Cardiac arrest, Subendo infarct, initial, CHF NOS, Cellulitis of leg, Prim cardiomyopathy NEC, Protein-cal malnutr NOS]","[Unspecified septicemia, Other shock without mention of trauma, Acute kidney failure, unspecified, Cardiac arrest, Subendocardial infarction, initial episode of care, Congestive heart failure, unspecified, Cellulitis and abscess of leg, except foot, Other primary cardiomyopathies, Unspecified protein-calorie malnutrition]",1,1,1,0,0,0,Discharge summary,Report,"Admission Date: [**2101-10-20**] Discharge Date: [**2101-10-31**]\n\nDate of Birth: [**2025-4-11**] Sex: M\n\nService: Medicine\n\nCHIEF COMPLAINT: Admitted from rehabilitation for\nhypotension (systolic blood pressure to the 70s) and\ndecreased urine output.\n\nHISTORY OF PRESENT ILLNESS: The patient is a 76-year-old\nmale who had been hospitalized at the [**Hospital1 190**] from [**10-11**] through [**10-19**] of [**2101**]\nafter undergoing a left femoral-AT bypass graft and was\nsubsequently discharged to a rehabilitation facility.\n\nOn [**2101-10-20**], he presented again to the [**Hospital1 346**] after being found to have a systolic\nblood pressure in the 70s and no urine output for 17 hours.\nA Foley catheter placed at the rehabilitation facility\nyielded 100 cc of murky/brown urine. There may also have\nbeen purulent discharge at the penile meatus at this time.\n\nOn presentation to the Emergency Department, the patient was\nwithout subjective complaints....",1,0,1,1,3,1,0,0,1,0,0,0,0,0,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
35633,61532,99999,113369,2117-12-31 11:52:36,2118-01-01 14:51:27,63.98,,,"[75612, 7861, 4019, 25000, 4139]","[1, 2, 3, 4, 5]","[Spondylolisthesis, Stridor, Hypertension NOS, DMII wo cmp nt st uncntr, Angina pectoris NEC/NOS]","[Spondylolisthesis, Stridor, Unspecified essential hypertension, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Other and unspecified angina pectoris]",1,1,1,0,0,0,Discharge summary,Report,"Admission Date: [**2117-12-30**] Discharge Date: [**2118-1-4**]\n\nDate of Birth: [**2054-1-6**] Sex: F\n\nService: NEUROSURGERY\n\nAllergies:\nPenicillins\n\nAttending:[**First Name3 (LF) 2724**]\nChief Complaint:\nNeurogenic claudication\n\nMajor Surgical or Invasive Procedure:\n[**2117-12-30**] PLIF L4-5\n\n\nHistory of Present Illness:\n63-year-old woman who complains of bilateral lower extremity\nsymptoms that are exacerbated by walking. She receives some\namelioration with rest. She denies difficulty with bowel or\nbladder function.\n\n\nPast Medical History:\nHTN\nDiabetes\nAngina\n\nSocial History:\nNC\n\nFamily History:\nNC\n\nPhysical Exam:\nPre-Op on clinic visit:\n\nOn examination, her motor strength was [**3-30**] in hip flexion,\nextension, quadriceps, hamstrings, dorsiflexion, and plantar\nflexion bilaterally. Her sensory examination was intact with\nrespect to the modality of light touch. Her reflexes were\nnormal\nand symmetric in...",1,0,1,1,2,0,0,0,0,0,0,0,0,0,0,0


In [618]:
df_joined.drop(axis=1, labels=['short_titles', 'category', 'description', 'text'], inplace=True)

In [619]:
df_joined

Unnamed: 0,row_id,subject_id,hadm_id,intime,outtime,age,diff_death_admit_hrs,diff_last_outtime,icd9_codes,seq_num,long_titles,valid_icu_admit,valid_age,valid_death,opiate_abuse,has_anoxic_brain,has_cancer,hist_found,opiate_history,admit_found,dis_found,group,opiates,hydromorphone,hydrocodone,oxycodone,morphine,fentanyl,tramadol,buprenorphine,methadone,oxymorphone,meperidine
0,2,3,145834,2101-10-20 19:10:11,2101-10-26 20:43:09,76.52,5668.830278,,"[0389, 78559, 5849, 4275, 41071, 4280, 6826, 4254, 2639]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Unspecified septicemia, Other shock without mention of trauma, Acute kidney failure, unspecified, Cardiac arrest, Subendocardial infarction, initial episode of care, Congestive heart failure, unspecified, Cellulitis and abscess of leg, except foot, Other primary cardiomyopathies, Unspecified protein-calorie malnutrition]",1,1,1,0,0,0,1,0,1,1,3,1,0,0,1,0,0,0,0,0,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
35633,61532,99999,113369,2117-12-31 11:52:36,2118-01-01 14:51:27,63.98,,,"[75612, 7861, 4019, 25000, 4139]","[1, 2, 3, 4, 5]","[Spondylolisthesis, Stridor, Unspecified essential hypertension, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Other and unspecified angina pectoris]",1,1,1,0,0,0,1,0,1,1,2,0,0,0,0,0,0,0,0,0,0,0


### Step 8)Get outcomes:  ICU LOS,  hospital LOS, mortality

In [645]:
reload(helper)
reload(queries)

<module 'queries' from 'queries.py'>

In [636]:
#todo: rename col
df_joined = helper.get_time_diff_hours(df=df_joined, 
                                      col_out='outtime',
                                      col_in='intime', 
                                      name='icu_los_hours')

In [637]:
df_step8 = helper.run_query(query=queries.hospital_outcomes, db_connection=con)
df_step8

Unnamed: 0,subject_id,hadm_id,hospital_intime,hospital_outtime,deathtime,hospital_expire_flag,admission_type,discharge_location,diagnosis
0,22,165315,2196-04-09 12:26:00,2196-04-10 15:54:00,,0,EMERGENCY,DISC-TRAN CANCER/CHLDRN H,BENZODIAZEPINE OVERDOSE
...,...,...,...,...,...,...,...,...,...
58975,98813,190264,2131-10-25 03:09:00,2131-10-26 17:44:00,,0,EMERGENCY,HOME,INTRACRANIAL HEMORRHAGE


In [638]:
keys = ['subject_id', 'hadm_id']
df_hospital = df_joined.merge(df_step8, on=keys)

In [639]:
df_hospital = helper.get_time_diff_hours(df=df_hospital, 
                                      col_out='hospital_outtime',
                                      col_in='hospital_intime', 
                                      name='hospital_los_hours')

In [641]:
df_hospital

Unnamed: 0,row_id,subject_id,hadm_id,intime,outtime,age,diff_death_admit_hrs,diff_last_outtime,icd9_codes,seq_num,long_titles,valid_icu_admit,valid_age,valid_death,opiate_abuse,has_anoxic_brain,has_cancer,hist_found,opiate_history,admit_found,dis_found,group,opiates,hydromorphone,hydrocodone,oxycodone,morphine,fentanyl,tramadol,buprenorphine,methadone,oxymorphone,meperidine,icu_los_hours,hospital_intime,hospital_outtime,deathtime,hospital_expire_flag,admission_type,discharge_location,diagnosis,hospital_los_hours
0,2,3,145834,2101-10-20 19:10:11,2101-10-26 20:43:09,76.52,5668.830278,,"[0389, 78559, 5849, 4275, 41071, 4280, 6826, 4254, 2639]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Unspecified septicemia, Other shock without mention of trauma, Acute kidney failure, unspecified, Cardiac arrest, Subendocardial infarction, initial episode of care, Congestive heart failure, unspecified, Cellulitis and abscess of leg, except foot, Other primary cardiomyopathies, Unspecified protein-calorie malnutrition]",1,1,1,0,0,0,1,0,1,1,3,1,0,0,1,0,0,0,0,0,0,0,145.549444,2101-10-20 19:08:00,2101-10-31 13:58:00,,0,EMERGENCY,SNF,HYPOTENSION,258.833333
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
32383,61532,99999,113369,2117-12-31 11:52:36,2118-01-01 14:51:27,63.98,,,"[75612, 7861, 4019, 25000, 4139]","[1, 2, 3, 4, 5]","[Spondylolisthesis, Stridor, Unspecified essential hypertension, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Other and unspecified angina pectoris]",1,1,1,0,0,0,1,0,1,1,2,0,0,0,0,0,0,0,0,0,0,0,26.980833,2117-12-30 07:15:00,2118-01-04 16:30:00,,0,ELECTIVE,SNF,SPONDYLOLISTHESIS/SDA,129.250000


In [668]:
reload(queries)
reload(helper)

<module 'notebook_helper' from 'notebook_helper.py'>

In [669]:
df_death = helper.run_query(query=queries.death_outcome, db_connection=con, check_distinct_events=False)

In [670]:
data_w_first_outcomes = helper.get_mortality_outcome(df_hospital, df_death)
data_w_first_outcomes

Unnamed: 0,row_id,subject_id,hadm_id,intime,outtime,age,diff_death_admit_hrs,diff_last_outtime,icd9_codes,seq_num,long_titles,valid_icu_admit,valid_age,valid_death,opiate_abuse,has_anoxic_brain,has_cancer,hist_found,opiate_history,admit_found,dis_found,group,opiates,hydromorphone,hydrocodone,oxycodone,morphine,fentanyl,tramadol,buprenorphine,methadone,oxymorphone,meperidine,icu_los_hours,hospital_intime,hospital_outtime,deathtime,hospital_expire_flag,admission_type,discharge_location,diagnosis,hospital_los_hours,gender,dod,dod_hosp,dod_ssn,death_days_since_hospital,30day_mortality,1year_mortality
0,2,3,145834,2101-10-20 19:10:11,2101-10-26 20:43:09,76.52,5668.830278,,"[0389, 78559, 5849, 4275, 41071, 4280, 6826, 4254, 2639]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Unspecified septicemia, Other shock without mention of trauma, Acute kidney failure, unspecified, Cardiac arrest, Subendocardial infarction, initial episode of care, Congestive heart failure, unspecified, Cellulitis and abscess of leg, except foot, Other primary cardiomyopathies, Unspecified protein-calorie malnutrition]",1,1,1,0,0,0,1,0,1,1,3,1,0,0,1,0,0,0,0,0,0,0,145.549444,2101-10-20 19:08:00,2101-10-31 13:58:00,,0,EMERGENCY,SNF,HYPOTENSION,258.833333,M,2102-06-14,NaT,2102-06-14 00:00:00,5410.033333,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
32383,61532,99999,113369,2117-12-31 11:52:36,2118-01-01 14:51:27,63.98,,,"[75612, 7861, 4019, 25000, 4139]","[1, 2, 3, 4, 5]","[Spondylolisthesis, Stridor, Unspecified essential hypertension, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Other and unspecified angina pectoris]",1,1,1,0,0,0,1,0,1,1,2,0,0,0,0,0,0,0,0,0,0,0,26.980833,2117-12-30 07:15:00,2118-01-04 16:30:00,,0,ELECTIVE,SNF,SPONDYLOLISTHESIS/SDA,129.250000,F,NaT,NaT,,,0,0


## Phase 2: Demographic Data
Steps: 
    1. Acquire ICD9 codes for all comorbities
    2. Construct flags based on all specified comorbidities

From here on out we are now joining raw sql to our med list 

### Step 1) Acquire ICD9 codes for all comorbities

In [532]:
df_comord = df_join.copy()
df_comord

Unnamed: 0,row_id,subject_id,hadm_id,intime,outtime,age,diff_death_admit_hrs,diff_last_outtime,icd9_codes,seq_num,short_titles,long_titles,valid_icu_admit,valid_age,valid_death,opiate_abuse,has_anoxic_brain,has_cancer,category,description,text,index,hist_found,opiate_history,admit_found,dis_found,group,opiates,hydromorphone,hydrocodone,oxycodone,morphine,fentanyl,tramadol,buprenorphine,methadone,oxymorphone,meperidine
0,2,3,145834,2101-10-20 19:10:11,2101-10-26 20:43:09,76.52,5668.830278,,"[0389, 78559, 5849, 4275, 41071, 4280, 6826, 4254, 2639]","[1, 2, 3, 4, 5, 6, 7, 8, 9]","[Septicemia NOS, Shock w/o trauma NEC, Acute kidney failure NOS, Cardiac arrest, Subendo infarct, initial, CHF NOS, Cellulitis of leg, Prim cardiomyopathy NEC, Protein-cal malnutr NOS]","[Unspecified septicemia, Other shock without mention of trauma, Acute kidney failure, unspecified, Cardiac arrest, Subendocardial infarction, initial episode of care, Congestive heart failure, unspecified, Cellulitis and abscess of leg, except foot, Other primary cardiomyopathies, Unspecified protein-calorie malnutrition]",1,1,1,0,0,0,Discharge summary,Report,"Admission Date: [**2101-10-20**] Discharge Date: [**2101-10-31**]\n\nDate of Birth: [**2025-4-11**] Sex: M\n\nService: Medicine\n\nCHIEF COMPLAINT: Admitted from rehabilitation for\nhypotension (systolic blood pressure to the 70s) and\ndecreased urine output.\n\nHISTORY OF PRESENT ILLNESS: The patient is a 76-year-old\nmale who had been hospitalized at the [**Hospital1 190**] from [**10-11**] through [**10-19**] of [**2101**]\nafter undergoing a left femoral-AT bypass graft and was\nsubsequently discharged to a rehabilitation facility.\n\nOn [**2101-10-20**], he presented again to the [**Hospital1 346**] after being found to have a systolic\nblood pressure in the 70s and no urine output for 17 hours.\nA Foley catheter placed at the rehabilitation facility\nyielded 100 cc of murky/brown urine. There may also have\nbeen purulent discharge at the penile meatus at this time.\n\nOn presentation to the Emergency Department, the patient was\nwithout subjective complaints....",0,1,0,1,1,3,1,0,0,1,0,0,0,0,0,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
35633,61532,99999,113369,2117-12-31 11:52:36,2118-01-01 14:51:27,63.98,,,"[75612, 7861, 4019, 25000, 4139]","[1, 2, 3, 4, 5]","[Spondylolisthesis, Stridor, Hypertension NOS, DMII wo cmp nt st uncntr, Angina pectoris NEC/NOS]","[Spondylolisthesis, Stridor, Unspecified essential hypertension, Diabetes mellitus without mention of complication, type II or unspecified type, not stated as uncontrolled, Other and unspecified angina pectoris]",1,1,1,0,0,0,Discharge summary,Report,"Admission Date: [**2117-12-30**] Discharge Date: [**2118-1-4**]\n\nDate of Birth: [**2054-1-6**] Sex: F\n\nService: NEUROSURGERY\n\nAllergies:\nPenicillins\n\nAttending:[**First Name3 (LF) 2724**]\nChief Complaint:\nNeurogenic claudication\n\nMajor Surgical or Invasive Procedure:\n[**2117-12-30**] PLIF L4-5\n\n\nHistory of Present Illness:\n63-year-old woman who complains of bilateral lower extremity\nsymptoms that are exacerbated by walking. She receives some\namelioration with rest. She denies difficulty with bowel or\nbladder function.\n\n\nPast Medical History:\nHTN\nDiabetes\nAngina\n\nSocial History:\nNC\n\nFamily History:\nNC\n\nPhysical Exam:\nPre-Op on clinic visit:\n\nOn examination, her motor strength was [**3-30**] in hip flexion,\nextension, quadriceps, hamstrings, dorsiflexion, and plantar\nflexion bilaterally. Her sensory examination was intact with\nrespect to the modality of light touch. Her reflexes were\nnormal\nand symmetric in...",36887,1,0,1,1,2,0,0,0,0,0,0,0,0,0,0,0


In [None]:
df_comord.

## Phase 3: Clinical Data
Steps: 
    1. Extract reason for admission from icd9 code array
    2. Locate SOFA score
    3. Locate use of ALL mechanical ventilation
    4. Create flag from mechanical ventilation
    5. Extract duration from ventilation
    6. Locate ICD9 codes for Pressors
    7. Create flags based on specified pressors
    8. Locate ICD9 codes for dialysis
    9. Creates flags for dialysis
