<a href="https://colab.research.google.com/github/fermuba/Titanic/blob/main/notebooks/02.EDA_sweetviz.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Análisis Exploratorio con SweetViz

**SweetViz** es una herramienta de análisis exploratorio de datos (EDA) que genera informes interactivos visualmente atractivos para datasets. Es ideal para obtener una visión general rápida y completa de los datos, con estadísticas descriptivas, distribuciones, y análisis de correlación.

Este informe proporciona:
- Un resumen visual de las distribuciones de las columnas.
- Comparaciones entre diferentes categorías o clases.
- Análisis de las relaciones entre características.

Para generar este informe, se utilizó **SweetViz** para analizar el dataset de Titanic, ofreciendo una visualización intuitiva y detallada de las distribuciones y relaciones de las variables.


In [None]:
# Instalacion de la librería
!pip install sweetviz



In [None]:
# Cargamos las librerias
import pandas as pd
import numpy as np
import sweetviz as sv

In [None]:
# Cargamos el dataset
df = pd.read_csv('/content/Titanic_Research_v6_clean.csv', sep=';')

df.head()

Unnamed: 0,pclass,survived,name,sex,age,sibsp,parch,ticket,embarked,home.dest
0,1,1,"Allen, Miss. Elisabeth Walton",female,29,0.0,0,24160,S,USA
1,1,1,"Allison, Master. Hudson Trevor",male,1,1.0,2,113781,S,CANADA
2,1,0,"Allison, Miss. Helen Loraine",female,2,1.0,2,113781,S,CANADA
3,1,0,"Allison, Mr. Hudson Joshua Creighton",male,30,1.0,2,113781,S,CANADA
4,1,0,"Allison, Mrs. Hudson J C (Bessie Waldo Daniels)",female,25,1.0,2,113781,S,CANADA


In [None]:
# Generar reporte
report = sv.analyze(df)

report.show_notebook(h= 'full')

                                             |          | [  0%]   00:00 -> (? left)

In [None]:
# Generamos un archivo html independiente del notebook
report.show_html('Titanic_full.html')

Report Titanic_full.html was generated! NOTEBOOK/COLAB USERS: the web browser MAY not pop up, regardless, the report IS saved in your notebook/colab files.


In [None]:
# Realizamos un analisis con relacion a una caracteristica
report_class = sv.analyze(df, target_feat= 'age')

report_class.show_html('Titanic_age.html')
report_class.show_notebook()

                                             |          | [  0%]   00:00 -> (? left)

Report Titanic_age.html was generated! NOTEBOOK/COLAB USERS: the web browser MAY not pop up, regardless, the report IS saved in your notebook/colab files.


In [None]:
# Generemos uno sin headmap
report = sv.analyze(df, pairwise_analysis= 'off')

report.show_html('Titanic_simple.html')
report.show_notebook()

                                             |          | [  0%]   00:00 -> (? left)

Report Titanic_simple.html was generated! NOTEBOOK/COLAB USERS: the web browser MAY not pop up, regardless, the report IS saved in your notebook/colab files.


In [None]:
# Generamos un reporte de comparacion
report = sv.compare_intra(df, df['sex'] == 'male', ['Male', 'Female'])

report.show_html('Titanic_compare.html')
report.show_notebook()

                                             |          | [  0%]   00:00 -> (? left)

Report Titanic_compare.html was generated! NOTEBOOK/COLAB USERS: the web browser MAY not pop up, regardless, the report IS saved in your notebook/colab files.
