# Example: Importg data to visualize patterns of missing values

This example shows the methods that you can use to import data and visualize it. The set data is contained in a Membership object, which stores the sets in each intersection, and the combination of sets that each record belongs to.

## Includes: setvis and other libraries

In [1]:
import pandas as pd

from pace import Membership
from pace.plots import PlotSession

## Visualizing patterns of missing values

The following three data import methods produce identical visualizations.

In [2]:
input_file = "../examples/datasets/Synthetic_APC_DIAG_Fields.csv"

### Create a Membership object directly from the input file

In [3]:
missing1 = Membership.from_csv(input_file)
# PlotSession is the core class that provides the functionality to analyse and explore the missingness patterns found in a dataset
missing_session1 = PlotSession(missing1)
# To visualise the dataset, call add_plot(), providing a name.
# Naming the plot is important: It allows any interactive selection made in the plot to be referred to later.
# The result is a Bokeh widget with a number of tabs, each with a different visualisation of the missingness data.
missing_session1.add_plot(name="example")

### Import the file as a data frame and then create a Membership object

In [4]:
df2 = pd.read_csv(input_file)
missing2 = Membership.from_data_frame(df2)
# Create a PlotSession object and visualise the patterns of missing values
missing_session2 = PlotSession(missing2)
missing_session2.add_plot(name="example")

### Create a PlotSession object directly from the input file

In [5]:
df3 = pd.read_csv(input_file)
missing_session3 = PlotSession(df3)
missing_session3.add_plot(name="example")