# 0. Presentación

En este cuaderno se realiza la etapa del ***Proceso KDD de Preproceso de Datos*** en este caso, aplicándola sobre el conjunto de datos de casos clínicos de enfermedades, para poder realizar el análisis de datos en su vertiente de diagnosticar enfermedades infecciosas a través de los datos obtenidos.

# 1. Librerías 

Antes de iniciar cualquier tarea debemos de asegurarnos de que las librerías que sean necesarias están correctamente importadas.

In [None]:
# Carga y manipulación de datos
from google.colab import files
import io
 
# Librería para manipular DataFrames
import pandas as pd

# 2. Carga de datos

El primer paso es subir todos los archivos CSV que sean necesarios para llevar a cabo este proceso. En este caso cargamos los datos correspondientes a la base de datos de casos clínicos saliente del la etapa de selección de datos, así como una base de datos auxiliar que nos ayudará a acceder a todos los síntomas. Para ello utilizaremos las herramientas que nos ofrece Google Colab a la hora de subir archivos. 

In [None]:
casos = pd.read_csv('casos_infecciosas.csv', sep=',')
casos = casos.drop(['Unnamed: 0'], axis = 1)
casos = casos.fillna(0)
casos

Unnamed: 0,Disease,Symptom_1,Symptom_2,Symptom_3,Symptom_4,Symptom_5,Symptom_6,Symptom_7,Symptom_8,Symptom_9,Symptom_10,Symptom_11,Symptom_12,Symptom_13,Symptom_14,Symptom_15,Symptom_16,Symptom_17
0,Fungalinfection,itching,skin_rash,nodal_skin_eruptions,dischromic_patches,0,0,0,0,0,0,0,0,0,0,0,0,0
1,Fungalinfection,skin_rash,nodal_skin_eruptions,dischromic_patches,0,0,0,0,0,0,0,0,0,0,0,0,0,0
2,Fungalinfection,itching,nodal_skin_eruptions,dischromic_patches,0,0,0,0,0,0,0,0,0,0,0,0,0,0
3,Fungalinfection,itching,skin_rash,dischromic_patches,0,0,0,0,0,0,0,0,0,0,0,0,0,0
4,Fungalinfection,itching,skin_rash,nodal_skin_eruptions,0,0,0,0,0,0,0,0,0,0,0,0,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
3235,Pharyngotonsillitis,high_fever,red_tonsils,malaise,swollen_tonsils,cough,0,0,0,0,0,0,0,0,0,0,0,0
3236,Pharyngotonsillitis,malaise,swollen_tonsils,cough,mild_fever,0,0,0,0,0,0,0,0,0,0,0,0,0
3237,Pharyngotonsillitis,swollen_tonsils,red_tonsils,high_fever,cough,0,0,0,0,0,0,0,0,0,0,0,0,0
3238,Pharyngotonsillitis,high_fever,cough,swollen_tonsils,0,0,0,0,0,0,0,0,0,0,0,0,0,0


In [None]:
casos_sintomas = pd.read_csv('Symptom-severity.csv', sep=',')
casos_sintomas

Unnamed: 0,Symptom,weight
0,itching,1
1,skin_rash,3
2,nodal_skin_eruptions,4
3,continuous_sneezing,4
4,shivering,5
...,...,...
128,inflammatory_nails,2
129,blister,4
130,red_sore_around_nose,2
131,yellow_crust_ooze,3


# 3. Preproceso de datos

El objetivo del preproceso es transformar la base de datos actual en otra, con tantas columnas como síntomas existan, y tantas filas como registros de casos clínicos tenga el conjunto de datos. 

El primer paso por tanto, es guardar en una lista todos los síntomas existentes.

In [None]:
lista_sintomas = []
lista_sintomas.append('Enfermedad')
for i in casos_sintomas.index:
   lista_sintomas.append(casos_sintomas.Symptom[i])

# Añadimos también los síntomas creados que no estaban en la BBDD clínica
lista_sintomas.append("red_and_dry_tongue")
lista_sintomas.append("red_skin")
lista_sintomas.append("vascular_complications")
lista_sintomas.append("respiratory_insufficiency")
lista_sintomas.append("immunosuppression")
lista_sintomas.append("lung_pain")
lista_sintomas.append("insomia")
lista_sintomas.append("itchy_anus")
lista_sintomas.append("swollen_tonsils")
lista_sintomas.append("red_tonsils")
lista_sintomas.append("wheeze")

Con esta lista, generamos un nuevo dataframe, que cuente con una columna por cada síntoma de la lista.

In [None]:
dataframe_casos = pd.DataFrame(columns = lista_sintomas)
dataframe_casos

Unnamed: 0,Enfermedad,itching,skin_rash,nodal_skin_eruptions,continuous_sneezing,shivering,chills,joint_pain,stomach_pain,acidity,ulcers_on_tongue,muscle_wasting,vomiting,burning_micturition,spotting_urination,fatigue,weight_gain,anxiety,cold_hands_and_feets,mood_swings,weight_loss,restlessness,lethargy,patches_in_throat,irregular_sugar_level,cough,high_fever,sunken_eyes,breathlessness,sweating,dehydration,indigestion,headache,yellowish_skin,dark_urine,nausea,loss_of_appetite,pain_behind_the_eyes,back_pain,constipation,...,increased_appetite,polyuria,family_history,mucoid_sputum,rusty_sputum,lack_of_concentration,visual_disturbances,receiving_blood_transfusion,receiving_unsterile_injections,coma,stomach_bleeding,distention_of_abdomen,history_of_alcohol_consumption,fluid_overload,blood_in_sputum,prominent_veins_on_calf,palpitations,painful_walking,pus_filled_pimples,blackheads,scurring,skin_peeling,silver_like_dusting,small_dents_in_nails,inflammatory_nails,blister,red_sore_around_nose,yellow_crust_ooze,prognosis,red_and_dry_tongue,red_skin,vascular_complications,respiratory_insufficiency,immunosuppression,lung_pain,insomia,itchy_anus,swollen_tonsils,red_tonsils,wheeze


Añadimos tantas filas como registros de casos clínicos halla, de momento, rellenamos todos los síntomas con el valor 0.

In [None]:
dataframe_casos["Enfermedad"] = casos["Disease"]
dataframe_casos = dataframe_casos.fillna(0)

Se comprueba ahora registro por registro cuales son los síntomas que presenta. Se rellenará con el valor 1 aquellos síntomas que posea el registro.

In [None]:
def funcion(col):
  for i in casos.index:
    if casos[col][i] != 0:
      sintoma = casos[col][i]
      dataframe_casos.at[i,sintoma] = 1

col_sintomas = ['Symptom_1','Symptom_2','Symptom_3','Symptom_4','Symptom_5', 'Symptom_6', 'Symptom_7', 'Symptom_8', 
                'Symptom_9','Symptom_10', 'Symptom_11', 'Symptom_12', 'Symptom_13', 'Symptom_14','Symptom_15', 
                'Symptom_16', 'Symptom_17']

for i in col_sintomas:
  funcion(i)

In [None]:
dataframe_casos

Unnamed: 0,Enfermedad,itching,skin_rash,nodal_skin_eruptions,continuous_sneezing,shivering,chills,joint_pain,stomach_pain,acidity,ulcers_on_tongue,muscle_wasting,vomiting,burning_micturition,spotting_urination,fatigue,weight_gain,anxiety,cold_hands_and_feets,mood_swings,weight_loss,restlessness,lethargy,patches_in_throat,irregular_sugar_level,cough,high_fever,sunken_eyes,breathlessness,sweating,dehydration,indigestion,headache,yellowish_skin,dark_urine,nausea,loss_of_appetite,pain_behind_the_eyes,back_pain,constipation,...,increased_appetite,polyuria,family_history,mucoid_sputum,rusty_sputum,lack_of_concentration,visual_disturbances,receiving_blood_transfusion,receiving_unsterile_injections,coma,stomach_bleeding,distention_of_abdomen,history_of_alcohol_consumption,fluid_overload,blood_in_sputum,prominent_veins_on_calf,palpitations,painful_walking,pus_filled_pimples,blackheads,scurring,skin_peeling,silver_like_dusting,small_dents_in_nails,inflammatory_nails,blister,red_sore_around_nose,yellow_crust_ooze,prognosis,red_and_dry_tongue,red_skin,vascular_complications,respiratory_insufficiency,immunosuppression,lung_pain,insomia,itchy_anus,swollen_tonsils,red_tonsils,wheeze
0,Fungalinfection,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
1,Fungalinfection,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
2,Fungalinfection,1,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
3,Fungalinfection,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
4,Fungalinfection,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
3235,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0
3236,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0
3237,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0
3238,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0


Ahora, pasamos a comprobar si algún sintoma no está presente en ningún caso clínico, en ese caso, se borrará dicho síntoma.

In [None]:
lista_borrar = []
for (colname,colval) in dataframe_casos.iteritems():
    if any([True if 1 in colval.values else False]) == False:
      lista_borrar.append(colname)

lista_borrar.remove('Enfermedad')
print(lista_borrar)
len(lista_borrar)

['shivering', 'stomach_pain', 'acidity', 'ulcers_on_tongue', 'muscle_wasting', 'burning_micturition', 'spotting_urination', 'anxiety', 'patches_in_throat', 'irregular_sugar_level', 'fluid_overload', 'swelling_of_stomach', 'blurred_and_distorted_vision', 'weakness_in_limbs', 'pain_during_bowel_movements', 'pain_in_anal_region', 'bloody_stool', 'irritation_in_anus', 'neck_pain', 'cramps', 'bruising', 'obesity', 'swollen_legs', 'swollen_blood_vessels', 'extra_marital_contacts', 'drying_and_tingling_lips', 'slurred_speech', 'knee_pain', 'hip_joint_pain', 'stiff_neck', 'swelling_joints', 'movement_stiffness', 'spinning_movements', 'loss_of_balance', 'unsteadiness', 'weakness_of_one_body_side', 'bladder_discomfort', 'foul_smell_ofurine', 'continuous_feel_of_urine', 'altered_sensorium', 'watering_from_eyes', 'increased_appetite', 'polyuria', 'mucoid_sputum', 'lack_of_concentration', 'visual_disturbances', 'distention_of_abdomen', 'history_of_alcohol_consumption', 'fluid_overload', 'prominent_

60

In [None]:
dataframe_casos = dataframe_casos.drop(lista_borrar,axis=1)
dataframe_casos

Unnamed: 0,Enfermedad,itching,skin_rash,nodal_skin_eruptions,continuous_sneezing,chills,joint_pain,vomiting,fatigue,weight_gain,cold_hands_and_feets,mood_swings,weight_loss,restlessness,lethargy,cough,high_fever,sunken_eyes,breathlessness,sweating,dehydration,indigestion,headache,yellowish_skin,dark_urine,nausea,loss_of_appetite,pain_behind_the_eyes,back_pain,constipation,abdominal_pain,diarrhoea,mild_fever,yellow_urine,yellowing_of_eyes,acute_liver_failure,swelled_lymph_nodes,malaise,phlegm,throat_irritation,...,fast_heart_rate,dizziness,puffy_face_and_eyes,enlarged_thyroid,brittle_nails,swollen_extremeties,excessive_hunger,muscle_weakness,loss_of_smell,passage_of_gases,internal_itching,toxic_look_(typhos),depression,irritability,muscle_pain,red_spots_over_body,belly_pain,abnormal_menstruation,dischromic_patches,family_history,rusty_sputum,receiving_blood_transfusion,receiving_unsterile_injections,coma,stomach_bleeding,blood_in_sputum,blister,red_sore_around_nose,yellow_crust_ooze,red_and_dry_tongue,red_skin,vascular_complications,respiratory_insufficiency,immunosuppression,lung_pain,insomia,itchy_anus,swollen_tonsils,red_tonsils,wheeze
0,Fungalinfection,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
1,Fungalinfection,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
2,Fungalinfection,1,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
3,Fungalinfection,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
4,Fungalinfection,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
3235,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0
3236,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,1,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0
3237,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0
3238,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0


Además, en este punto, se consulta al experto sobre los síntomas con los que contamos en la base de datos, así como con las enfermedades. El experto, sobre las diferentes hepatitis, nos recomienda solo tener en cuenta los casos clínicos de la hepatitis tipo A, ya que es la más frecuente en niños, predominando sobre las demás. 

También recomienda el borrado de algunos síntomas que o bien son demasiado elevados o que no corresponden a niños (coma, sangre en la saliva, transfusiones de sangre, menstruación anormal, depresión), o cuya aparición sea poco frecuente y además no sea visible para el ser humano a la hora de manifestar sus síntomas.

In [None]:
# Borrado enfermedades innecesarias
dataframe_casos = dataframe_casos.drop(dataframe_casos[dataframe_casos['Enfermedad']=='HepatitisB'].index)
dataframe_casos = dataframe_casos.drop(dataframe_casos[dataframe_casos['Enfermedad']=='HepatitisC'].index)
dataframe_casos = dataframe_casos.drop(dataframe_casos[dataframe_casos['Enfermedad']=='HepatitisD'].index)
dataframe_casos = dataframe_casos.drop(dataframe_casos[dataframe_casos['Enfermedad']=='HepatitisE'].index)

In [None]:
# Borrado de síntomas no convenientes
lista = ['depression','abnormal_menstruation','family_history','receiving_blood_transfusion',
         'coma','rusty_sputum','blood_in_sputum','blister','back_pain','lung_pain','fast_heart_rate',
         'brittle_nails','yellow_crust_ooze','pain_behind_the_eyes','passage_of_gases','sunken_eyes','yellowing_of_eyes']

dataframe_casos = dataframe_casos.drop(lista,axis=1)

dataframe_casos

Unnamed: 0,Enfermedad,itching,skin_rash,nodal_skin_eruptions,continuous_sneezing,chills,joint_pain,vomiting,fatigue,weight_gain,cold_hands_and_feets,mood_swings,weight_loss,restlessness,lethargy,cough,high_fever,breathlessness,sweating,dehydration,indigestion,headache,yellowish_skin,dark_urine,nausea,loss_of_appetite,constipation,abdominal_pain,diarrhoea,mild_fever,yellow_urine,acute_liver_failure,swelled_lymph_nodes,malaise,phlegm,throat_irritation,redness_of_eyes,sinus_pressure,runny_nose,congestion,chest_pain,dizziness,puffy_face_and_eyes,enlarged_thyroid,swollen_extremeties,excessive_hunger,muscle_weakness,loss_of_smell,internal_itching,toxic_look_(typhos),irritability,muscle_pain,red_spots_over_body,belly_pain,dischromic_patches,receiving_unsterile_injections,stomach_bleeding,red_sore_around_nose,red_and_dry_tongue,red_skin,vascular_complications,respiratory_insufficiency,immunosuppression,insomia,itchy_anus,swollen_tonsils,red_tonsils,wheeze
0,Fungalinfection,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0
1,Fungalinfection,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0
2,Fungalinfection,1,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0
3,Fungalinfection,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0
4,Fungalinfection,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
3235,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0
3236,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0
3237,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0
3238,Pharyngotonsillitis,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0


# 4. Exportación de datos

Por último, convertimos el dataframe en un archivo CSV, para poder descargarlo.

In [None]:
dataframe_casos.to_csv('casos_clinicos_preprocesado.csv')