# Clean, reformat, and add useful columns to CDR dataset

* Input: `cdr_processed.csv`
* Output: `cdr_cleaned.csv`

In [1]:
####################################################
# Boilerplate import/setup code for general analysis
# everett.wetchler@gmail.com
####################################################

import datetime as dt
import os
import random

import datadotworld as dw
import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
import seaborn as sns
from tqdm import tqdm

# Personal libraries
import evutils.everett_eda as eda

## Jupyter setup
%matplotlib inline

# [OPTIONAL]
# Print any variable that is executed on its own line
# (not just if its last in a cell)
# Uncomment to use.
#from IPython.core.interactiveshell import InteractiveShell
#InteractiveShell.ast_node_interactivity = "all"

## Graphical setup
# Useful colors to reference
SNS_BLUE, SNS_GREEN, SNS_RED, SNS_PURPLE, SNS_YELLOW, SNS_CYAN = sns.color_palette()
SNS_COLORS = sns.color_palette()
# sns.set_palette(sns.color_palette("cubehelix", 8))
pd.set_option('display.max_columns', 500)
plt.rcParams.update({
  'font.size': 14,
  'axes.titlesize': 'x-large',
  'axes.labelsize': 'large',
  'xtick.labelsize': 'medium',
  'ytick.labelsize': 'medium',
  'legend.fancybox': True,
  'legend.fontsize': 'medium',
  'legend.frameon': True,
  'legend.framealpha': 0.7,
  'figure.figsize': ['9', '6'],
})

# Watermark extension to print version/system information
# Flags:
# -a [author] -d (date) -t (time) -z (timezone) -r (repo)
# -g (git hash) -w (watermark version) -p [packages] (package info)
%load_ext watermark
%watermark -a "Everett Wetchler" -d -t -z -w -p numpy,pandas,matplotlib,datadotworld

####################################################
# END Boilerplate
####################################################

Everett Wetchler 2018-05-05 22:06:30 CDT

numpy 1.14.3
pandas 0.20.1
matplotlib 2.2.0
datadotworld 1.6.0
watermark 1.5.0


In [68]:
df = pd.read_csv('cdr_processed.csv')
df.drop('Unnamed: 0', inplace=True, axis=1)
df = df[~(df['filename'].str.startswith('1111') | df['filename'].str.startswith('unknown'))]
df.head()

  interactivity=interactivity, compiler=compiler, result=result)


Unnamed: 0,Agency Information ** ** CDR Number,Agency Information ** ** PA Number,Agency Information ** ** Report Date,Agency Information ** ** Status,Agency Information ** ** Version Type,Agency Information ** Agency/Facility Information ** Agency Address,Agency Information ** Agency/Facility Information ** Agency City,Agency Information ** Agency/Facility Information ** Agency County,Agency Information ** Agency/Facility Information ** Agency Name,Agency Information ** Agency/Facility Information ** Agency Number,Agency Information ** Agency/Facility Information ** Agency Phone,Agency Information ** Agency/Facility Information ** Agency State,Agency Information ** Agency/Facility Information ** Agency Zip,Agency Information ** Agency/Facility Information ** Department ID,Agency Information ** Agency/Facility Information ** Department Type,Agency Information ** Director Information ** Director First Name,Agency Information ** Director Information ** Director Last Name,Agency Information ** Director Information ** Director Middle Name,Agency Information ** Director Information ** Director Salutation,Agency Information ** Director Information ** Reporter Email,Agency Information ** Director Information ** Reporter Name,CDR Information ** CDR Information from mainframe data ** Agency Address,CDR Information ** CDR Information from mainframe data ** Agency City,CDR Information ** CDR Information from mainframe data ** Agency Name,CDR Information ** CDR Information from mainframe data ** Agency Phone,CDR Information ** CDR Information from mainframe data ** Agency Zip,CDR Information ** CDR Information from mainframe data ** County,CDR Information ** CDR Information from mainframe data ** PA Number,CDR Information ** CDR Information from mainframe data ** Report Date,CDR Information ** CDR Information from mainframe data ** Reporter Name Original CDR,CDR Information ** CDR Information from mainframe data ** Reporter Title,CDR Summary ** Summary ** Summary,Decedent Info ** Decedent Information ** Age At Time Of Death,Decedent Info ** Decedent Information ** Date of Birth,Decedent Info ** Decedent Information ** Education,Decedent Info ** Decedent Information ** First Name,Decedent Info ** Decedent Information ** Last Name,Decedent Info ** Decedent Information ** Marital Status,Decedent Info ** Decedent Information ** Middle Name,Decedent Info ** Decedent Information ** Occupation,Decedent Info ** Decedent Information ** Race,Decedent Info ** Decedent Information ** Sex,"Decedent Information ** Date/Time of Custody (arrest, incarceration) (mm/dd/yyyy hh:mm AM/PM): ** Date/Time of Custody or Incident",Decedent Information ** Date/Time of Death (mm/dd/yyyy hh:mm AM/PM): ** Death Date and Time,Decedent Information ** Identity of Deceased ** Age At Time Of Death,Decedent Information ** Identity of Deceased ** Date of Birth,Decedent Information ** Identity of Deceased ** Ethnicity,Decedent Information ** Identity of Deceased ** Ethnicity Other,Decedent Information ** Identity of Deceased ** First Name,Decedent Information ** Identity of Deceased ** Last Name,Decedent Information ** Identity of Deceased ** Middle Name,Decedent Information ** Identity of Deceased ** Race,Decedent Information ** Identity of Deceased ** Sex,Decedent Information ** Identity of Deceased ** Suffix,"General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent ** Decedent display/use of weapons","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Decedent Display or Use Weapon Details","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Discharged firearm; Displayed other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Displayed other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Displayed other weapon, specify:; Used other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Specify Weapon Displayed","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Specify Weapon Used","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Used other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Appear intoxicated (alcohol or drugs)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Attempt gain possession officer's weapon","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Attempt to Injure Others?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Barricade self or initiate standoff?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Escape or attempt to escape/flee custody","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Exhibit any medical problems?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Exhibit any mental health problems?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Gain possession of officer's weapon","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Grab, hit or fight with the officer(s)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Make suicidal statements?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Other Behavior","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Physically attempt/assault officer(s)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Resist being handcuffed or arrested?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Specify Other Behavior","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Specify weapon used to threaten/assault","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Threaten the officer(s) involved","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Try to escape/flee from custody","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Use weapon threaten/assault officer(s)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Verbally threaten other(s) including law","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Ways Decedent Attempted To Injure Others",General Information ** Did any other law enforcement agencies respond to calls for service related to this incident? ** Other Agencies Respond?,General Information ** Injuries of Decedent ** Injured By,"General Information ** Type of restraint ** Other device, specify",General Information ** Type of restraint ** Type of Restraint,General Information ** Was the deceased under restraint in the time leading up to the death or the events causing the ** Under Restraint,"General Information ** Was the deceased under restraint in the time leading up to the death or the events causing the death? ** Other device, specify",General Information ** What were the most serious offense(s) with which the deceased was (or would have been) ** Offense 1,General Information ** What were the most serious offense(s) with which the deceased was (or would have been) charged with at the time of death? ** Offense 2,General Information ** What were the most serious offense(s) with which the deceased was (or would have been) charged with at the time of death? ** Offense 3,General Information ** What were the most serious offense(s) with which the deceased was (or would have been) charged with at the time of death? ** Were the Charges:,General Information ** What were the types of charges or reason for contact? (Hold CTRL to select all that apply) ** Type of Offense,"General Information ** What were the types of charges or reason for contact? (Hold CTRL to select all that apply) ** Type of Offense, Other",General Information ** What were the types of charges or reason for contact? ** Type of Offense,"General Information ** What were the types of charges or reason for contact? ** Type of Offense, Other",Location / Custody Information ** Specific type of custody/facility: ** Custody Type Facility,Location / Custody Information ** Specific type of custody/facility: ** Facility/Detention Center,Location / Custody Information ** Specific type of custody/facility: ** Specific Type of Custody/Facility,Location / Custody Information ** Specific type of custody/facility: ** TDCJ - Specify Unit,Location / Custody Information ** What location category best describes where the event causing the death occurred? ** Location Category,Location / Custody Information ** What location category best describes where the event causing the death occurred? ** Other Location Category,Location / Custody Information ** What type of custody/facility was the Decedent in at the time of death: ** Type of Custody,Location / Custody Information ** What was the time and date of the deceased's entry into the law enforcement facility where ** Entry Date Time,Location / Custody Information ** What was the time and date of the deceased's entry into the law enforcement facility where the death occurred (mm/dd/yyyy hh:mm AM/PM): ** Entry Date Time N/A,Location / Custody Information ** Where did the death occur? ** Death Location,Location / Custody Information ** Where did the death occur? ** Death Location Elsewhere,Location / Custody Information ** Where did the event causing the death occur? ** City,Location / Custody Information ** Where did the event causing the death occur? ** County,Location / Custody Information ** Where did the event causing the death occur? ** Street Address,Location / Custody Information ** Where did the event causing the death occur? ** Zip,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death ** Contributory Condition: Acute bronchopneumonia Medical Treatment,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death ** Medical Treatment,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Abdominal discomfort was taking the following medications,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Enalapril-Blood Pressure; Tramterene-Blood Pressure,"Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** End Liver Disease, Hepatitis C, Hypertension diabetes Hyperlipoidemia",Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Medical Treatment Description,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Medications,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Prostate Cancer,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** The deceased had been receiving the following medications,Manner / Cause of Death ** Has a medical examiner or coroner conducted an evaluation to determine a cause of death? ** Medical Examinor/Coroner Evalution?,"Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (Hold CTRL to select ** Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (Hold CTRL to select ** Type of weapon that caused death?","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (Hold CTRL to select all that apply) ** Other weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Conducted energy device (e.g. Taser); Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Not Applicable; Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Other weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Type of Death Weapon","Manner / Cause of Death ** If death was an accident, homicide or suicide, what was the means of death? ** Means of Death","Manner / Cause of Death ** If death was an accident, homicide or suicide, what was the means of death? ** Means of Death Other","Manner / Cause of Death ** If death was an accident, homicide or suicide, who caused the death? ** Death Causer Other","Manner / Cause of Death ** If death was an accident, homicide or suicide, who caused the death? ** Who caused the death?",Manner / Cause of Death ** Medical Cause of Death: ** Medical Cause of Death,Manner / Cause of Death ** Was the cause of death the result of a pre-existing medical condition or did the decedent ** Pre existing medical condition?,Manner / Cause of Death ** What was the manner of death? (select only one) ** Death Reason,Manner / Cause of Death ** What was the manner of death? (select only one) ** Manner of Death,Manner / Cause of Death ** What was the manner of death? (select only one) ** Manner of Death Description,Other Information ** Other Information ** Code of Charges,Other Information ** Other Information ** Custody Code,Other Information ** Other Information ** Custody or Incident,Other Information ** Other Information ** Date/Time of Custody or Incident,Other Information ** Other Information ** Death Code,Other Information ** Other Information ** Death Date and Time,Other Information ** Other Information ** Exhibit any medical problems?,Other Information ** Other Information ** Exhibit any mental health problems?,Other Information ** Other Information ** Intoxicated,Other Information ** Other Information ** Make suicidal statements?,Other Information ** Other Information ** Manner of Death Description,Other Information ** Other Information ** Medical Treatment,Other Information ** Other Information ** Medical Treatment Description,Other Information ** Other Information ** Type of Custody,"Summary of Incident ** Summary of How the Death Occurred: (max. 30,000 characters) ** Summary",filename
4,,,,,,,,,,,,,,,,,,,,,,,HOUSTON,HARRIS COUNTY SHERIFF'S DEPT.,,,,PA86103CJ,7/30/1986 12:00 AM,,,,0.0,1/9/1942,,WALPHA,SCOTT,,E,,,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,UNDETERMINED,UNDETERMINED,,5/6/1986 12:00 AM,NATURAL CAUSES,1/9/1942 12:00 AM,,,,,AIDS,Not Applicable,,County Jail,,19420109_a2C31000001uzEOEAY.pdf
5,,,,,,,,,,,,,,,,,,,,,,,HUNTSVILLE,TX DEPT. OF CRIMINAL JUSTICE,,,,PA84001P,4/27/1989 12:00 AM,,,DATE OF CUSTODY IS INCORRECT - DATE IS UNKNOWN,,,,LARRY,COOK,,E,,Anglo,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,CONVICTED,TEXAS DEPARTMENT OF CORRECTIONS,,1/8/1944 1:00 AM,UNDETERMINED,1/8/1944 1:00 AM,,,,,SEIZURE,Not Applicable,,Penitentiary,,19440108_a2C31000001uyFxEAI.pdf
6,,,,,,,,,,,,,,,,,,,,,,,,TX DEPT. OF CRIMINAL JUSTICE,,,,PA95321P,9/26/1995 12:00 AM,,,,-22.0,9/18/1972,ADVANCED DEGREE,RENAE,GUERRA,UNKNOWN,,,Hispanic,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,UNDETERMINED,TEXAS DEPARTMENT OF CORRECTIONS,,,HOMICIDE,6/25/1951 7:30 PM,No,No,UNKNOWN,No,SUSPECTED HOMICIDE,No,,Penitentiary,,19510625_a2C31000001uz4tEAA.pdf
7,,,,,,,,,,,,,,,,,,,,,,,HUNTSVILLE,TX DEPT. OF CRIMINAL JUSTICE,,,,PA85136P,9/10/1985 12:00 AM,,,DATE OF CUSTODY IS INCORRECT - DATE UNKNOWN,0.0,12/4/1954,,ARTURO,AGUILAR,,,,,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,UNDETERMINED,UNDETERMINED,,12/4/1954 12:00 AM,UNDETERMINED,12/4/1954 12:00 AM,,,,,HYPOVOLOMIA SHOCK,Not Applicable,,Penitentiary,,19541204_a2C31000001uy0eEAA.pdf
8,,,,,,,,,,,,,,,,,,,,,,,CEDAR HILL,CEDAR HILL POLICE DEPARTMENT,,,,PA84084MJ,6/15/1984 12:00 AM,,,DATE OF CUSTODY IS UNKNOWN,,,,RODGER,VESTAL,,T,,Anglo,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,UNDETERMINED,JAIL - MULTIPLE OCCUPANCY CELL,,6/5/1984 12:00 AM,SUICIDE,6/8/1958 12:00 AM,,,,,HANGING,Not Applicable,,Municipal Jail,,19580608_a2C31000001uzGcEAI.pdf


In [73]:
DATE = 'Agency Information **  ** Report Date'
df[DATE]

4                       NaN
5                       NaN
6                       NaN
7                       NaN
8                       NaN
9                       NaN
10                      NaN
11                      NaN
12                      NaN
13                      NaN
14                      NaN
15                      NaN
16                      NaN
17                      NaN
18                      NaN
19                      NaN
20                      NaN
21                      NaN
22                      NaN
23                      NaN
24                      NaN
25                      NaN
26                      NaN
27                      NaN
28                      NaN
29                      NaN
30                      NaN
31                      NaN
32                      NaN
33                      NaN
                ...        
11584      4/4/2018 9:59 AM
11585     4/5/2018 10:09 AM
11586     4/25/2018 7:44 AM
11587      4/9/2018 2:33 PM
11588     4/10/2018 

In [74]:
date_cols = [c for c in df.columns if 'Report Date' in c]
date_cols

['Agency Information **  ** Report Date',
 'CDR Information ** CDR Information from mainframe data ** Report Date']

In [75]:
pd.crosstab(df[date_cols[0]].notnull(),df[date_cols[1]].notnull())

CDR Information ** CDR Information from mainframe data ** Report Date,False,True
Agency Information ** ** Report Date,Unnamed: 1_level_1,Unnamed: 2_level_1
False,1,5464
True,6145,0


In [76]:
df.drop([c for c in df.columns if c.startswith('COUNT ')], axis=1, inplace=True)

sections = sorted(set([c.split(' ** ')[0] for c in df.columns]))
print(sections)

count_cols = []
for s in sections:
    sec_cols = []
    for c in df.columns:
        sec = c.split(' ** ')[0]
        if s == sec:
            sec_cols.append(c)
    frame = df[sec_cols]
    newcol = 'COUNT ' + s
    df[newcol] = frame.notnull().sum(axis=1)
    count_cols.append(newcol)

['Agency Information', 'CDR Information', 'CDR Summary', 'Decedent Info', 'Decedent Information', 'General Information', 'Location / Custody Information', 'Manner / Cause of Death', 'Other Information', 'Summary of Incident', 'filename']


In [77]:
df[count_cols].sample(10)

Unnamed: 0,COUNT Agency Information,COUNT CDR Information,COUNT CDR Summary,COUNT Decedent Info,COUNT Decedent Information,COUNT General Information,COUNT Location / Custody Information,COUNT Manner / Cause of Death,COUNT Other Information,COUNT Summary of Incident,COUNT filename
5665,19,0,0,0,8,10,8,9,0,1,1
10168,18,0,0,0,7,10,8,10,0,1,1
9628,18,0,0,0,7,10,8,10,0,1,1
4465,0,7,1,10,0,0,0,0,12,0,1
8962,18,0,0,0,8,10,8,10,0,1,1
9435,18,0,0,0,7,10,8,11,0,1,1
328,0,4,1,6,0,0,0,0,8,0,1
9508,18,0,0,0,7,10,8,11,0,1,1
4208,0,7,1,9,0,0,0,0,12,0,1
399,0,4,1,5,0,0,0,0,8,0,1


In [78]:
table = df[count_cols].corr().sort_values('COUNT Agency Information')
table

Unnamed: 0,COUNT Agency Information,COUNT CDR Information,COUNT CDR Summary,COUNT Decedent Info,COUNT Decedent Information,COUNT General Information,COUNT Location / Custody Information,COUNT Manner / Cause of Death,COUNT Other Information,COUNT Summary of Incident,COUNT filename
COUNT Other Information,-0.965233,0.954375,0.783378,0.990768,-0.967022,-0.913806,-0.948451,-0.9621,1.0,-0.973965,
COUNT Decedent Info,-0.953572,0.95692,0.778146,1.0,-0.95534,-0.902766,-0.936993,-0.950477,0.990768,-0.962199,
COUNT CDR Information,-0.937814,1.0,0.861266,0.95692,-0.939553,-0.887848,-0.921509,-0.93477,0.954375,-0.946299,
COUNT CDR Summary,-0.787326,0.861266,1.0,0.778146,-0.788786,-0.745378,-0.773638,-0.784771,0.783378,-0.794449,
COUNT General Information,0.898772,-0.887848,-0.745378,-0.902766,0.949643,1.0,0.938936,0.90032,-0.913806,0.938232,
COUNT Location / Custody Information,0.943016,-0.921509,-0.773638,-0.936993,0.9689,0.938936,1.0,0.954944,-0.948451,0.973804,
COUNT Decedent Information,0.977309,-0.939553,-0.788786,-0.95534,1.0,0.949643,0.9689,0.972151,-0.967022,0.992871,
COUNT Summary of Incident,0.991034,-0.946299,-0.794449,-0.962199,0.992871,0.938232,0.973804,0.987817,-0.973965,1.0,
COUNT Manner / Cause of Death,0.991337,-0.93477,-0.784771,-0.950477,0.972151,0.90032,0.954944,1.0,-0.9621,0.987817,
COUNT Agency Information,1.0,-0.937814,-0.787326,-0.953572,0.977309,0.898772,0.943016,0.991337,-0.965233,0.991034,


In [79]:
df[count_cols[:3]].corr()

Unnamed: 0,COUNT Agency Information,COUNT CDR Information,COUNT CDR Summary
COUNT Agency Information,1.0,-0.937814,-0.787326
COUNT CDR Information,-0.937814,1.0,0.861266
COUNT CDR Summary,-0.787326,0.861266,1.0


In [80]:
df.groupby(count_cols[:3]).size()

COUNT Agency Information  COUNT CDR Information  COUNT CDR Summary
0                         3                      0                     249
                                                 1                      12
                          4                      0                     644
                                                 1                     583
                          5                      0                      29
                                                 1                      25
                          6                      0                      35
                                                 1                     145
                          7                      0                     319
                                                 1                    2661
                          8                      0                      15
                                                 1                     747
12                        0      

In [81]:
df[df['COUNT Agency Information'] == 0].head()

Unnamed: 0,Agency Information ** ** CDR Number,Agency Information ** ** PA Number,Agency Information ** ** Report Date,Agency Information ** ** Status,Agency Information ** ** Version Type,Agency Information ** Agency/Facility Information ** Agency Address,Agency Information ** Agency/Facility Information ** Agency City,Agency Information ** Agency/Facility Information ** Agency County,Agency Information ** Agency/Facility Information ** Agency Name,Agency Information ** Agency/Facility Information ** Agency Number,Agency Information ** Agency/Facility Information ** Agency Phone,Agency Information ** Agency/Facility Information ** Agency State,Agency Information ** Agency/Facility Information ** Agency Zip,Agency Information ** Agency/Facility Information ** Department ID,Agency Information ** Agency/Facility Information ** Department Type,Agency Information ** Director Information ** Director First Name,Agency Information ** Director Information ** Director Last Name,Agency Information ** Director Information ** Director Middle Name,Agency Information ** Director Information ** Director Salutation,Agency Information ** Director Information ** Reporter Email,Agency Information ** Director Information ** Reporter Name,CDR Information ** CDR Information from mainframe data ** Agency Address,CDR Information ** CDR Information from mainframe data ** Agency City,CDR Information ** CDR Information from mainframe data ** Agency Name,CDR Information ** CDR Information from mainframe data ** Agency Phone,CDR Information ** CDR Information from mainframe data ** Agency Zip,CDR Information ** CDR Information from mainframe data ** County,CDR Information ** CDR Information from mainframe data ** PA Number,CDR Information ** CDR Information from mainframe data ** Report Date,CDR Information ** CDR Information from mainframe data ** Reporter Name Original CDR,CDR Information ** CDR Information from mainframe data ** Reporter Title,CDR Summary ** Summary ** Summary,Decedent Info ** Decedent Information ** Age At Time Of Death,Decedent Info ** Decedent Information ** Date of Birth,Decedent Info ** Decedent Information ** Education,Decedent Info ** Decedent Information ** First Name,Decedent Info ** Decedent Information ** Last Name,Decedent Info ** Decedent Information ** Marital Status,Decedent Info ** Decedent Information ** Middle Name,Decedent Info ** Decedent Information ** Occupation,Decedent Info ** Decedent Information ** Race,Decedent Info ** Decedent Information ** Sex,"Decedent Information ** Date/Time of Custody (arrest, incarceration) (mm/dd/yyyy hh:mm AM/PM): ** Date/Time of Custody or Incident",Decedent Information ** Date/Time of Death (mm/dd/yyyy hh:mm AM/PM): ** Death Date and Time,Decedent Information ** Identity of Deceased ** Age At Time Of Death,Decedent Information ** Identity of Deceased ** Date of Birth,Decedent Information ** Identity of Deceased ** Ethnicity,Decedent Information ** Identity of Deceased ** Ethnicity Other,Decedent Information ** Identity of Deceased ** First Name,Decedent Information ** Identity of Deceased ** Last Name,Decedent Information ** Identity of Deceased ** Middle Name,Decedent Information ** Identity of Deceased ** Race,Decedent Information ** Identity of Deceased ** Sex,Decedent Information ** Identity of Deceased ** Suffix,"General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent ** Decedent display/use of weapons","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Decedent Display or Use Weapon Details","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Discharged firearm; Displayed other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Displayed other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Displayed other weapon, specify:; Used other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Specify Weapon Displayed","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Specify Weapon Used","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Used other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Appear intoxicated (alcohol or drugs)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Attempt gain possession officer's weapon","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Attempt to Injure Others?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Barricade self or initiate standoff?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Escape or attempt to escape/flee custody","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Exhibit any medical problems?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Exhibit any mental health problems?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Gain possession of officer's weapon","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Grab, hit or fight with the officer(s)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Make suicidal statements?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Other Behavior","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Physically attempt/assault officer(s)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Resist being handcuffed or arrested?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Specify Other Behavior","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Specify weapon used to threaten/assault","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Threaten the officer(s) involved","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Try to escape/flee from custody","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Use weapon threaten/assault officer(s)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Verbally threaten other(s) including law","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Ways Decedent Attempted To Injure Others",General Information ** Did any other law enforcement agencies respond to calls for service related to this incident? ** Other Agencies Respond?,General Information ** Injuries of Decedent ** Injured By,"General Information ** Type of restraint ** Other device, specify",General Information ** Type of restraint ** Type of Restraint,General Information ** Was the deceased under restraint in the time leading up to the death or the events causing the ** Under Restraint,"General Information ** Was the deceased under restraint in the time leading up to the death or the events causing the death? ** Other device, specify",General Information ** What were the most serious offense(s) with which the deceased was (or would have been) ** Offense 1,General Information ** What were the most serious offense(s) with which the deceased was (or would have been) charged with at the time of death? ** Offense 2,General Information ** What were the most serious offense(s) with which the deceased was (or would have been) charged with at the time of death? ** Offense 3,General Information ** What were the most serious offense(s) with which the deceased was (or would have been) charged with at the time of death? ** Were the Charges:,General Information ** What were the types of charges or reason for contact? (Hold CTRL to select all that apply) ** Type of Offense,"General Information ** What were the types of charges or reason for contact? (Hold CTRL to select all that apply) ** Type of Offense, Other",General Information ** What were the types of charges or reason for contact? ** Type of Offense,"General Information ** What were the types of charges or reason for contact? ** Type of Offense, Other",Location / Custody Information ** Specific type of custody/facility: ** Custody Type Facility,Location / Custody Information ** Specific type of custody/facility: ** Facility/Detention Center,Location / Custody Information ** Specific type of custody/facility: ** Specific Type of Custody/Facility,Location / Custody Information ** Specific type of custody/facility: ** TDCJ - Specify Unit,Location / Custody Information ** What location category best describes where the event causing the death occurred? ** Location Category,Location / Custody Information ** What location category best describes where the event causing the death occurred? ** Other Location Category,Location / Custody Information ** What type of custody/facility was the Decedent in at the time of death: ** Type of Custody,Location / Custody Information ** What was the time and date of the deceased's entry into the law enforcement facility where ** Entry Date Time,Location / Custody Information ** What was the time and date of the deceased's entry into the law enforcement facility where the death occurred (mm/dd/yyyy hh:mm AM/PM): ** Entry Date Time N/A,Location / Custody Information ** Where did the death occur? ** Death Location,Location / Custody Information ** Where did the death occur? ** Death Location Elsewhere,Location / Custody Information ** Where did the event causing the death occur? ** City,Location / Custody Information ** Where did the event causing the death occur? ** County,Location / Custody Information ** Where did the event causing the death occur? ** Street Address,Location / Custody Information ** Where did the event causing the death occur? ** Zip,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death ** Contributory Condition: Acute bronchopneumonia Medical Treatment,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death ** Medical Treatment,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Abdominal discomfort was taking the following medications,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Enalapril-Blood Pressure; Tramterene-Blood Pressure,"Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** End Liver Disease, Hepatitis C, Hypertension diabetes Hyperlipoidemia",Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Medical Treatment Description,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Medications,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Prostate Cancer,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** The deceased had been receiving the following medications,Manner / Cause of Death ** Has a medical examiner or coroner conducted an evaluation to determine a cause of death? ** Medical Examinor/Coroner Evalution?,"Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (Hold CTRL to select ** Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (Hold CTRL to select ** Type of weapon that caused death?","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (Hold CTRL to select all that apply) ** Other weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Conducted energy device (e.g. Taser); Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Not Applicable; Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Other weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Type of Death Weapon","Manner / Cause of Death ** If death was an accident, homicide or suicide, what was the means of death? ** Means of Death","Manner / Cause of Death ** If death was an accident, homicide or suicide, what was the means of death? ** Means of Death Other","Manner / Cause of Death ** If death was an accident, homicide or suicide, who caused the death? ** Death Causer Other","Manner / Cause of Death ** If death was an accident, homicide or suicide, who caused the death? ** Who caused the death?",Manner / Cause of Death ** Medical Cause of Death: ** Medical Cause of Death,Manner / Cause of Death ** Was the cause of death the result of a pre-existing medical condition or did the decedent ** Pre existing medical condition?,Manner / Cause of Death ** What was the manner of death? (select only one) ** Death Reason,Manner / Cause of Death ** What was the manner of death? (select only one) ** Manner of Death,Manner / Cause of Death ** What was the manner of death? (select only one) ** Manner of Death Description,Other Information ** Other Information ** Code of Charges,Other Information ** Other Information ** Custody Code,Other Information ** Other Information ** Custody or Incident,Other Information ** Other Information ** Date/Time of Custody or Incident,Other Information ** Other Information ** Death Code,Other Information ** Other Information ** Death Date and Time,Other Information ** Other Information ** Exhibit any medical problems?,Other Information ** Other Information ** Exhibit any mental health problems?,Other Information ** Other Information ** Intoxicated,Other Information ** Other Information ** Make suicidal statements?,Other Information ** Other Information ** Manner of Death Description,Other Information ** Other Information ** Medical Treatment,Other Information ** Other Information ** Medical Treatment Description,Other Information ** Other Information ** Type of Custody,"Summary of Incident ** Summary of How the Death Occurred: (max. 30,000 characters) ** Summary",filename,COUNT Agency Information,COUNT CDR Information,COUNT CDR Summary,COUNT Decedent Info,COUNT Decedent Information,COUNT General Information,COUNT Location / Custody Information,COUNT Manner / Cause of Death,COUNT Other Information,COUNT Summary of Incident,COUNT filename
4,,,,,,,,,,,,,,,,,,,,,,,HOUSTON,HARRIS COUNTY SHERIFF'S DEPT.,,,,PA86103CJ,7/30/1986 12:00 AM,,,,0.0,1/9/1942,,WALPHA,SCOTT,,E,,,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,UNDETERMINED,UNDETERMINED,,5/6/1986 12:00 AM,NATURAL CAUSES,1/9/1942 12:00 AM,,,,,AIDS,Not Applicable,,County Jail,,19420109_a2C31000001uzEOEAY.pdf,0,4,0,6,0,0,0,0,8,0,1
5,,,,,,,,,,,,,,,,,,,,,,,HUNTSVILLE,TX DEPT. OF CRIMINAL JUSTICE,,,,PA84001P,4/27/1989 12:00 AM,,,DATE OF CUSTODY IS INCORRECT - DATE IS UNKNOWN,,,,LARRY,COOK,,E,,Anglo,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,CONVICTED,TEXAS DEPARTMENT OF CORRECTIONS,,1/8/1944 1:00 AM,UNDETERMINED,1/8/1944 1:00 AM,,,,,SEIZURE,Not Applicable,,Penitentiary,,19440108_a2C31000001uyFxEAI.pdf,0,4,1,5,0,0,0,0,8,0,1
6,,,,,,,,,,,,,,,,,,,,,,,,TX DEPT. OF CRIMINAL JUSTICE,,,,PA95321P,9/26/1995 12:00 AM,,,,-22.0,9/18/1972,ADVANCED DEGREE,RENAE,GUERRA,UNKNOWN,,,Hispanic,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,UNDETERMINED,TEXAS DEPARTMENT OF CORRECTIONS,,,HOMICIDE,6/25/1951 7:30 PM,No,No,UNKNOWN,No,SUSPECTED HOMICIDE,No,,Penitentiary,,19510625_a2C31000001uz4tEAA.pdf,0,3,0,8,0,0,0,0,11,0,1
7,,,,,,,,,,,,,,,,,,,,,,,HUNTSVILLE,TX DEPT. OF CRIMINAL JUSTICE,,,,PA85136P,9/10/1985 12:00 AM,,,DATE OF CUSTODY IS INCORRECT - DATE UNKNOWN,0.0,12/4/1954,,ARTURO,AGUILAR,,,,,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,UNDETERMINED,UNDETERMINED,,12/4/1954 12:00 AM,UNDETERMINED,12/4/1954 12:00 AM,,,,,HYPOVOLOMIA SHOCK,Not Applicable,,Penitentiary,,19541204_a2C31000001uy0eEAA.pdf,0,4,1,5,0,0,0,0,8,0,1
8,,,,,,,,,,,,,,,,,,,,,,,CEDAR HILL,CEDAR HILL POLICE DEPARTMENT,,,,PA84084MJ,6/15/1984 12:00 AM,,,DATE OF CUSTODY IS UNKNOWN,,,,RODGER,VESTAL,,T,,Anglo,Male,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,UNDETERMINED,JAIL - MULTIPLE OCCUPANCY CELL,,6/5/1984 12:00 AM,SUICIDE,6/8/1958 12:00 AM,,,,,HANGING,Not Applicable,,Municipal Jail,,19580608_a2C31000001uzGcEAI.pdf,0,4,1,5,0,0,0,0,8,0,1


In [82]:
frame = df[df['COUNT Agency Information'] == 0]
frame.groupby('COUNT CDR Summary')['filename'].min()

COUNT CDR Summary
0    19420109_a2C31000001uzEOEAY.pdf
1    19440108_a2C31000001uyFxEAI.pdf
Name: filename, dtype: object

In [83]:
frame.groupby('COUNT CDR Summary')['filename'].max()

COUNT CDR Summary
0    20040831_a2C31000001uzHsEAI.pdf
1    20041230_a2C31000001uz4EEAQ.pdf
Name: filename, dtype: object

In [86]:
frame = df[df['COUNT Agency Information'] != 0]
frame.sample(10)

Unnamed: 0,Agency Information ** ** CDR Number,Agency Information ** ** PA Number,Agency Information ** ** Report Date,Agency Information ** ** Status,Agency Information ** ** Version Type,Agency Information ** Agency/Facility Information ** Agency Address,Agency Information ** Agency/Facility Information ** Agency City,Agency Information ** Agency/Facility Information ** Agency County,Agency Information ** Agency/Facility Information ** Agency Name,Agency Information ** Agency/Facility Information ** Agency Number,Agency Information ** Agency/Facility Information ** Agency Phone,Agency Information ** Agency/Facility Information ** Agency State,Agency Information ** Agency/Facility Information ** Agency Zip,Agency Information ** Agency/Facility Information ** Department ID,Agency Information ** Agency/Facility Information ** Department Type,Agency Information ** Director Information ** Director First Name,Agency Information ** Director Information ** Director Last Name,Agency Information ** Director Information ** Director Middle Name,Agency Information ** Director Information ** Director Salutation,Agency Information ** Director Information ** Reporter Email,Agency Information ** Director Information ** Reporter Name,CDR Information ** CDR Information from mainframe data ** Agency Address,CDR Information ** CDR Information from mainframe data ** Agency City,CDR Information ** CDR Information from mainframe data ** Agency Name,CDR Information ** CDR Information from mainframe data ** Agency Phone,CDR Information ** CDR Information from mainframe data ** Agency Zip,CDR Information ** CDR Information from mainframe data ** County,CDR Information ** CDR Information from mainframe data ** PA Number,CDR Information ** CDR Information from mainframe data ** Report Date,CDR Information ** CDR Information from mainframe data ** Reporter Name Original CDR,CDR Information ** CDR Information from mainframe data ** Reporter Title,CDR Summary ** Summary ** Summary,Decedent Info ** Decedent Information ** Age At Time Of Death,Decedent Info ** Decedent Information ** Date of Birth,Decedent Info ** Decedent Information ** Education,Decedent Info ** Decedent Information ** First Name,Decedent Info ** Decedent Information ** Last Name,Decedent Info ** Decedent Information ** Marital Status,Decedent Info ** Decedent Information ** Middle Name,Decedent Info ** Decedent Information ** Occupation,Decedent Info ** Decedent Information ** Race,Decedent Info ** Decedent Information ** Sex,"Decedent Information ** Date/Time of Custody (arrest, incarceration) (mm/dd/yyyy hh:mm AM/PM): ** Date/Time of Custody or Incident",Decedent Information ** Date/Time of Death (mm/dd/yyyy hh:mm AM/PM): ** Death Date and Time,Decedent Information ** Identity of Deceased ** Age At Time Of Death,Decedent Information ** Identity of Deceased ** Date of Birth,Decedent Information ** Identity of Deceased ** Ethnicity,Decedent Information ** Identity of Deceased ** Ethnicity Other,Decedent Information ** Identity of Deceased ** First Name,Decedent Information ** Identity of Deceased ** Last Name,Decedent Information ** Identity of Deceased ** Middle Name,Decedent Information ** Identity of Deceased ** Race,Decedent Information ** Identity of Deceased ** Sex,Decedent Information ** Identity of Deceased ** Suffix,"General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent ** Decedent display/use of weapons","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Decedent Display or Use Weapon Details","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Discharged firearm; Displayed other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Displayed other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Displayed other weapon, specify:; Used other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Specify Weapon Displayed","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Specify Weapon Used","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent display or use a weapon? ** Used other weapon, specify","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Appear intoxicated (alcohol or drugs)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Attempt gain possession officer's weapon","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Attempt to Injure Others?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Barricade self or initiate standoff?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Escape or attempt to escape/flee custody","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Exhibit any medical problems?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Exhibit any mental health problems?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Gain possession of officer's weapon","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Grab, hit or fight with the officer(s)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Make suicidal statements?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Other Behavior","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Physically attempt/assault officer(s)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Resist being handcuffed or arrested?","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Specify Other Behavior","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Specify weapon used to threaten/assault","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Threaten the officer(s) involved","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Try to escape/flee from custody","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Use weapon threaten/assault officer(s)","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Verbally threaten other(s) including law","General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Ways Decedent Attempted To Injure Others",General Information ** Did any other law enforcement agencies respond to calls for service related to this incident? ** Other Agencies Respond?,General Information ** Injuries of Decedent ** Injured By,"General Information ** Type of restraint ** Other device, specify",General Information ** Type of restraint ** Type of Restraint,General Information ** Was the deceased under restraint in the time leading up to the death or the events causing the ** Under Restraint,"General Information ** Was the deceased under restraint in the time leading up to the death or the events causing the death? ** Other device, specify",General Information ** What were the most serious offense(s) with which the deceased was (or would have been) ** Offense 1,General Information ** What were the most serious offense(s) with which the deceased was (or would have been) charged with at the time of death? ** Offense 2,General Information ** What were the most serious offense(s) with which the deceased was (or would have been) charged with at the time of death? ** Offense 3,General Information ** What were the most serious offense(s) with which the deceased was (or would have been) charged with at the time of death? ** Were the Charges:,General Information ** What were the types of charges or reason for contact? (Hold CTRL to select all that apply) ** Type of Offense,"General Information ** What were the types of charges or reason for contact? (Hold CTRL to select all that apply) ** Type of Offense, Other",General Information ** What were the types of charges or reason for contact? ** Type of Offense,"General Information ** What were the types of charges or reason for contact? ** Type of Offense, Other",Location / Custody Information ** Specific type of custody/facility: ** Custody Type Facility,Location / Custody Information ** Specific type of custody/facility: ** Facility/Detention Center,Location / Custody Information ** Specific type of custody/facility: ** Specific Type of Custody/Facility,Location / Custody Information ** Specific type of custody/facility: ** TDCJ - Specify Unit,Location / Custody Information ** What location category best describes where the event causing the death occurred? ** Location Category,Location / Custody Information ** What location category best describes where the event causing the death occurred? ** Other Location Category,Location / Custody Information ** What type of custody/facility was the Decedent in at the time of death: ** Type of Custody,Location / Custody Information ** What was the time and date of the deceased's entry into the law enforcement facility where ** Entry Date Time,Location / Custody Information ** What was the time and date of the deceased's entry into the law enforcement facility where the death occurred (mm/dd/yyyy hh:mm AM/PM): ** Entry Date Time N/A,Location / Custody Information ** Where did the death occur? ** Death Location,Location / Custody Information ** Where did the death occur? ** Death Location Elsewhere,Location / Custody Information ** Where did the event causing the death occur? ** City,Location / Custody Information ** Where did the event causing the death occur? ** County,Location / Custody Information ** Where did the event causing the death occur? ** Street Address,Location / Custody Information ** Where did the event causing the death occur? ** Zip,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death ** Contributory Condition: Acute bronchopneumonia Medical Treatment,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death ** Medical Treatment,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Abdominal discomfort was taking the following medications,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Enalapril-Blood Pressure; Tramterene-Blood Pressure,"Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** End Liver Disease, Hepatitis C, Hypertension diabetes Hyperlipoidemia",Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Medical Treatment Description,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Medications,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** Prostate Cancer,Manner / Cause of Death ** Had the decedent been receiving treatment for the medical condition that caused the death after admission to your jail's jurisdiction? ** The deceased had been receiving the following medications,Manner / Cause of Death ** Has a medical examiner or coroner conducted an evaluation to determine a cause of death? ** Medical Examinor/Coroner Evalution?,"Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (Hold CTRL to select ** Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (Hold CTRL to select ** Type of weapon that caused death?","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (Hold CTRL to select all that apply) ** Other weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Conducted energy device (e.g. Taser); Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Not Applicable; Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Other Weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Other weapon, specify","Manner / Cause of Death ** If a weapon caused the death, what type of weapon caused the death? (mark all that apply) ** Type of Death Weapon","Manner / Cause of Death ** If death was an accident, homicide or suicide, what was the means of death? ** Means of Death","Manner / Cause of Death ** If death was an accident, homicide or suicide, what was the means of death? ** Means of Death Other","Manner / Cause of Death ** If death was an accident, homicide or suicide, who caused the death? ** Death Causer Other","Manner / Cause of Death ** If death was an accident, homicide or suicide, who caused the death? ** Who caused the death?",Manner / Cause of Death ** Medical Cause of Death: ** Medical Cause of Death,Manner / Cause of Death ** Was the cause of death the result of a pre-existing medical condition or did the decedent ** Pre existing medical condition?,Manner / Cause of Death ** What was the manner of death? (select only one) ** Death Reason,Manner / Cause of Death ** What was the manner of death? (select only one) ** Manner of Death,Manner / Cause of Death ** What was the manner of death? (select only one) ** Manner of Death Description,Other Information ** Other Information ** Code of Charges,Other Information ** Other Information ** Custody Code,Other Information ** Other Information ** Custody or Incident,Other Information ** Other Information ** Date/Time of Custody or Incident,Other Information ** Other Information ** Death Code,Other Information ** Other Information ** Death Date and Time,Other Information ** Other Information ** Exhibit any medical problems?,Other Information ** Other Information ** Exhibit any mental health problems?,Other Information ** Other Information ** Intoxicated,Other Information ** Other Information ** Make suicidal statements?,Other Information ** Other Information ** Manner of Death Description,Other Information ** Other Information ** Medical Treatment,Other Information ** Other Information ** Medical Treatment Description,Other Information ** Other Information ** Type of Custody,"Summary of Incident ** Summary of How the Death Occurred: (max. 30,000 characters) ** Summary",filename,COUNT Agency Information,COUNT CDR Information,COUNT CDR Summary,COUNT Decedent Info,COUNT Decedent Information,COUNT General Information,COUNT Location / Custody Information,COUNT Manner / Cause of Death,COUNT Other Information,COUNT Summary of Incident,COUNT filename
6112,,PA07011P,1/25/2007 8:30 AM,Submitted,,"2503 Lake Road, Suite 5",Huntsville,Travis,Texas Department Of Criminal Justice,TX236065C,9364380000.0,TX,77340,848.0,STAGENCY,Brad,Livingston,,Mr.,delana.wilson@tdcj.state.tx.us,Delana Wilson,,,,,,,,,,,,,,,,,,,,,,7/24/1999 12:00 AM,1/11/2007 3:24 AM,,10/8/1937,Anglo,,Phillip,Harwell,Coleman,,Male,,,,,,,,,,No,,,,,,,,No,,,,No,,,No,No,0.0,,,,Injured by NA,,,No,,Assault w/Intent to Commit Rape,Felony DWI,,Convicted,,,,,Estelle,,TDCJ,,,,Penitentiary,7/24/1999 12:00 AM,,At law enforcement facility,,Huntsville,Walker,264 FM 3478,,,Not Applicable,,,,,,,,"Yes, results are available",,,,,,,,Not Applicable,Not applicable; cause of death was intoxicatio...,,,"Not applicable; cause of death was suicide, in...",Cardiac arrhythmia due to severe coronary arte...,Don't know,Not applicable,Natural Causes/Illness,Cardiac arrhythmia,,,,,,,,,,,,,,,Phillip Harwell was found on the floor of his ...,20070111_a2C31000001ujwCEAQ.pdf,18,0,0,0,8,11,8,10,0,1,1
9400,,PA15054P,2/5/2015 4:58 PM,Submitted,,"2503 Lake Road, Suite 5",Huntsville,Walker,Texas Department Of Criminal Justice,TX236065C,9364375116.0,TX,77340,848.0,STAGENCY,Brad,Livingston,,Mr.,Analou.Sievers@tdcj.texas.gov,Analou Sievers,,,,,,,,,,,,,,,,,,,,,,3/15/1990 12:00 AM,2/4/2015 11:44 AM,,10/27/1946,African-American,,Danny,Anderson,,,Male,,,,,,,,,,No,,,,,,,,No,,,,No,,,No,No,0.0,,,,Injured by NA,,,No,,Murder,,,Convicted,,,,,Michael Unit,,TDCJ,,,,Penitentiary,3/15/1990 12:00 AM,,At law enforcement facility,,Tennessee Colony,Anderson,2664 FM 2054,,,Yes,,,,Treatment for Rectal Cancer Medications: Aceta...,,,,"Yes, results are available",,,,,,,,Not Applicable,Not applicable; cause of death was intoxicatio...,,,"Not applicable; cause of death was suicide, in...",Stage IV metastatic colrectal carcinoma,Don\'t know,Not applicable,Natural Causes/Illness,Rectal Cancer,,,,,,,,,,,,,,,"On February 4, 2015, Offender Anderson was pro...",20150204_a2C31000001usjJEAQ.pdf,18,0,0,0,7,10,8,11,0,1,1
9474,,PA15111P,3/19/2015 10:37 AM,Submitted,,"2503 Lake Road, Suite 5",Huntsville,Walker,Texas Department Of Criminal Justice,TX236065C,9364375116.0,TX,77340,848.0,STAGENCY,Brad,Livingston,,Mr.,Analou.Sievers@tdcj.texas.gov,Analou Sievers,,,,,,,,,,,,,,,,,,,,,,1/13/2014 12:00 AM,3/16/2015 5:27 AM,,12/4/1960,Hispanic,,Richard,Ortega,,,Male,,,,,,,,,,No,,,,,,,,No,,,,No,,,No,No,0.0,,,,Injured by NA,,,No,,Sexual Assault of a Child,,,Convicted,,,,,Hospital Galveston,,TDCJ,,,,Penitentiary,1/13/2014 12:00 AM,,At medical facility,,Galveston,Galveston,809 Harborside Drive,,,Yes,,,,"Treatment for Hepatitis C Medications: Invanz,...",,,,"No, evaluation not planned",,,,,,,,Not Applicable,Not applicable; cause of death was intoxicatio...,,,"Not applicable; cause of death was suicide, in...","Respiratory Failure, Decompensated Liver Cirrh...",Don\'t know,Medical condition only (e.g. heart attack),Natural Causes/Illness,Respiratory Failure,,,,,,,,,,,,,,,Offender Ortega was admitted to hospital for t...,20150316_a2C31000001ussLEAQ.pdf,18,0,0,0,7,10,8,11,0,1,1
7640,,PA11264CJ,10/4/2011 12:00 AM,Submitted,,P. O. Box 401,Marlin,Falls,Falls County Sheriff's Dept.,TX0730000,2548830000.0,TX,76661,266.0,SHERIFF,Ben,Kirk,,Sheriff,renee.gray@oag.state.tx.us,Sheriff Ben Kirk,,,,,,,,,,,,,,,,,,,,,,10/4/2011 2:00 AM,10/4/2011 4:02 AM,,1/4/1979,African-American,,Andrea,Yantis,Kathleen,,Female,,,,,,,,,,No,,,,,,,,No,,,,No,,,No,No,0.0,,,,Injured by NA,,,No,,Possession of controlled substance,,,Filed,,,,,,,Jail - holding cell,,,,County Jail,10/4/2011 2:00 AM,,At law enforcement facility,,Marlin,Falls,2847 State Highway 6,,,Not Applicable,,,,,,,,"Yes, results are available",,,,,,,,Not Applicable,"Hanging, strangulation",,,"Not applicable; cause of death was suicide, in...",Hanging,Not Applicable; cause of death was accidental ...,Not applicable,Suicide,,,,,,,,,,,,,,,,SUPPLEMENTAL OFFENSE REPORT Case Number 11-10-...,20111004_a2C31000001uoVhEAI.pdf,18,0,0,0,8,10,7,9,0,1,1
10159,,PA16152C,4/13/2016 3:23 PM,Submitted,,1200 Travis,Houston,Harris,Houston Police Dept.,TXHPD0000,7133081600.0,TX,77002,399.0,POLICE,Martha,Montalvo,I.,Acting Chief,stephanie.reyes@houstonpolice.org,Off SA Reyes (Info provided by Lt. KJ Deese),,,,,,,,,,,,,,,,,,,,,,3/17/2016 6:45 PM,3/17/2016 6:55 PM,,8/21/1986,African-American,,Scott,Bennett,Lance,,Male,,,,,,,,,,No,,,,,,,,No,,,,No,Reached toward waistband,,No,Yes,0.0,,,,Injured by Officer,,,No,,Aggravated Robbery,,,Not filed at time of death,,,,,,,Custody of Peace Officer during/fleeing arrest,,,,Police Custody (pre-booking),,,At the crime/arrest scene,,Houston,Harris,11314 North Fwy,,,Not Applicable,,,,,,,,"Yes, results are available",,,,,,,,Rifle/Shotgun,Firearm,,,Law enforcement/correctional staff,"Gunshot Wounds of the neck, back, chest, left ...",Not Applicable; cause of death was accidental ...,Injuries only,Justifiable Homicide,,,,,,,,,,,,,,,,"On Thursday March 17, 2016, at about 10:00 am,...",20160317_a2C31000001uuaOEAQ.pdf,19,0,0,0,8,11,6,9,0,1,1
6033,,PA06270C,11/21/2006 3:57 PM,Submitted,,1200 Travis,Houston,Harris,Houston Police Dept.,TXHPD0000,7133080000.0,TX,77002,399.0,POLICE,Harold,Hurtt,L,Chief,Henry.Gaw2@cityofhouston.net,Henry J. Gaw,,,,,,,,,,,,,,,,,,,,,,10/23/2006 1:15 AM,10/23/2006 1:15 AM,,9/19/1960,African-American,,Ronald,Taylor,Joseph,,Male,,,,,,,,,,No,,,,,,,,No,,,,Yes,,,Yes,Yes,1.0,,,,Injured by Officer,,,No,,Attempted Capital Murder of a Peace Officer,,,Not filed at time of death,,,,,,,Custody of Peace Officer during/fleeing arrest,,,,Police Custody (pre-booking),,,At the crime/arrest scene,,Houston,Harris,4323 Osby,,,Not Applicable,,,,,,,,"Yes, results are available",,,,,,,,Handgun,Firearm,,,Law enforcement/correctional staff,Gun shot wound of Chest,Not Applicable; cause of death was accidental ...,Injuries only,Justifiable Homicide,,,,,,,,,,,,,,,,"In the early morning hours of October 23, 2006...",20061023_a2C31000001ujnjEAA.pdf,19,0,0,0,8,10,6,9,0,1,1
10643,16-64-P,,1/12/2017 2:05 PM,Submitted,AMENDED,"2503 Lake Road, Suite 5",Huntsville,,TDCJ/Office of the Inspector General,,,TX,77340,,,John,West,,Other,analou.sievers@tdcj.texas.gov,Analou Sievers,,,,,,,,,,,,,,,,,,,,,,2/1/2013 12:00 AM,12/26/2016 4:26 PM,53.0,3/18/1963,,,Ronny,Smitley,,Anglo or White,Male,,No,,,,,,,,,,No,,,,,,,,,,,,,,,,,,No,,,,No,,Indecency with a Child,Fail to Comply with Sex Offender Registration,,Convicted,Crimes Against Child(ren),,,,,,"TDCJ, specify",Skyview Unit,Law Enforcement Facility,,Penitentiary,2/1/2013 12:00 AM,,Medical facility,,Rusk,Cherokee,379 FM 2972 W,75785.0,,No,,,,,,,,"Yes, results are available",,Not Applicable,,,,,,,"Not applicable, cause of death was illness/nat...",,,Not applicable,Sudden Cardiac Death,Deceased developed condition after admission,,Natural,,,,,,,,,,,,,,,,"On December 25, 2016, security staff discovere...",20161226_a2C31000001uxUOEAY.pdf,14,0,0,0,8,8,10,8,0,1,1
10628,16-51-P,,1/9/2017 2:23 PM,Submitted,AMENDED,"2503 Lake Road, Suite 5",Huntsville,,TDCJ/Office of the Inspector General,,,TX,77340,,,John,West,,Other,analou.sievers@tdcj.texas.gov,Analou Sievers,,,,,,,,,,,,,,,,,,,,,,6/16/2014 12:00 AM,12/16/2016 12:58 PM,31.0,11/5/1985,,,Michael,Brock,,Anglo or White,Male,,No,,,,,,,,No,No,No,No,No,No,No,No,,No,,No,No,,,,,,No,,No,,,,No,,Aggravated Assault with a Deadly Weapon,,,Convicted,Violent Crime Against Persons,,,,,,"TDCJ, specify",Telford Unit,Law Enforcement Facility,,Penitentiary,6/16/2014 12:00 AM,,Law enforcement facility/booking center,,New Boston,Bowie,3899 State Highway 98 South,75570.0,,Unknown,,,,,,,,"Yes, results are available",,Not Applicable,,,,,,,Unknown,,,Not applicable,Atherosclerotic Cardiovascular Disease,Could not be determined,,Natural,,,,,,,,,,,,,,,,"On December 16, 2016, Offender Brock was found...",20161216_a2C31000001uxDJEAY.pdf,14,0,0,0,8,18,10,8,0,1,1
5511,,PA05037C,4/18/2005 11:04 AM,Submitted,,P. O. Box 2000,Lubbock,LUBBOCK,Lubbock Police Dept.,TX1520200,8067750000.0,TX,79457,537.0,POLICE,Kenneth,Walker,A.,Chief,renee.gray@oag.state.tx.us,Renee Gray,,,,,,,,,,,,,,,,,,,,,,2/13/2005 1:10 AM,2/15/2005 1:30 PM,,11/30/1961,Hispanic,,Pedro,Alcorte,Briseno,,Male,Jr.,,,,,,,,,Yes,,,,,,,,No,,,,No,,,No,Yes,0.0,,,,Injured by Self Accident,,,No,,Alcohol/Drug Offense,,,Not filed at time of death,,,,,,,Custody of Peace Officer during/fleeing arrest,,,,Police Custody (pre-booking),,,At medical facility,,Lubbock,Lubbock,1100 4th Street,,,Not Applicable,,,,,,,,"Yes, results are available",,,,,,,,Not Applicable,Other,,,Deceased,Cranio Cerebral Injuries due to Blunt Force Tr...,Not Applicable; cause of death was accidental ...,Injuries only,Accidental injury to self,,,,,,,,,,,,,,,,"While on patrol, Officer Dustin Tucker observe...",20050215_a2C31000001uiVYEAY.pdf,19,0,0,0,9,10,6,9,0,1,1
10620,16-47-P,,1/9/2017 11:40 AM,Submitted,AMENDED,"2503 Lake Road, Suite 5",Huntsville,,TDCJ/Office of the Inspector General,,,TX,77340,,,John,West,,Other,analou.sievers@tdcj.texas.gov,Analou Sievers,,,,,,,,,,,,,,,,,,,,,,6/14/2012 12:00 AM,12/13/2016 11:55 AM,66.0,4/25/1950,,,Gary,Myre,,Anglo or White,Male,,No,,,,,,,,Unknown,No,No,No,No,Unknown,Unknown,No,,Unknown,,No,No,,,,,,No,,No,,,,No,,Felony Driving While Intoxicated,,,Convicted,Alcohol / drug offense,,,,,,"TDCJ, specify",Michael Unit,Law Enforcement Facility,,Penitentiary,6/14/2012 12:00 AM,,Medical facility,,Tennessee Colony,Anderson,2664 FM 2064,75886.0,,Unknown,,,,,,,,"Yes, results are available",,Unknown,,,,,,,"Not applicable, cause of death was illness/nat...",,,Not applicable,Sudden cardiac death due to coronary heart dis...,Could not be determined,,Natural,,,,,,,,,,,,,,,,"On December 13, 2016, Offender Myre was having...",20161213_a2C31000001uxBhEAI.pdf,14,0,0,0,8,18,10,8,0,1,1


In [87]:
MHEALTH = 'General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Exhibit any mental health problems?'

In [88]:
print(frame[MHEALTH].isnull().sum())
frame[MHEALTH].value_counts()

5239


Unknown    527
No         293
Yes         87
Name: General Information ** At any time during the incident and/or entry into the law enforcement facility, did the decedent: ** Exhibit any mental health problems?, dtype: int64

In [90]:
frame['Agency Information **  ** Version Type'].isnull().mean()

0.8301334201106411

In [91]:
frame['Agency Information **  ** Version Type'].value_counts()

ORIGINAL VERSION    754
AMENDED             290
Name: Agency Information **  ** Version Type, dtype: int64

In [98]:
for k, v in frame.isnull().mean().items():
    print("%.2f" % v, k)

0.83 Agency Information **  ** CDR Number
0.17 Agency Information **  ** PA Number
0.00 Agency Information **  ** Report Date
0.00 Agency Information **  ** Status
0.83 Agency Information **  ** Version Type
0.00 Agency Information ** Agency/Facility Information ** Agency Address
0.00 Agency Information ** Agency/Facility Information ** Agency City
0.17 Agency Information ** Agency/Facility Information ** Agency County
0.00 Agency Information ** Agency/Facility Information ** Agency Name
0.17 Agency Information ** Agency/Facility Information ** Agency Number
0.17 Agency Information ** Agency/Facility Information ** Agency Phone
0.01 Agency Information ** Agency/Facility Information ** Agency State
0.00 Agency Information ** Agency/Facility Information ** Agency Zip
0.17 Agency Information ** Agency/Facility Information ** Department ID
0.17 Agency Information ** Agency/Facility Information ** Department Type
0.00 Agency Information ** Director Information ** Director First Name
0.00 Ag

In [99]:
pd.crosstab(frame['Agency Information **  ** CDR Number'].notnull(), frame['Agency Information **  ** PA Number'].notnull())

Agency Information ** ** PA Number,False,True
Agency Information ** ** CDR Number,Unnamed: 1_level_1,Unnamed: 2_level_1
False,10,5093
True,1043,0
