<a href="https://colab.research.google.com/github/fengfrankgthb/Demonstrations/blob/main/LIT_sentiment_classifier.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Using the Learning Interpretability Tool in Notebooks

This notebook shows use of the [Learning Interpretability Tool](https://pair-code.github.io/lit) on a binary classifier for labelling statement sentiment (0 for negative, 1 for positive).

The LitWidget object constructor takes a dict mapping model names to model objects, and a dict mapping dataset names to dataset objects. Those will be the datasets and models displayed in LIT. Running the constructor will cause the LIT server to be started in the background, loading the models and datasets and enabling the UI to be served.

Render the LIT UI in an output cell by calling the `render` method on the LitWidget object. The LIT UI can be rendered multiple times in separate cells if desired. The widget also contains a `stop` method to shut down the LIT server.

Copyright 2020 Google LLC.
SPDX-License-Identifier: Apache-2.0

In [1]:
# The pip installation will install all necessary prerequisite packages for use of the core LIT package.
!pip install lit-nlp

Collecting lit-nlp
  Downloading lit_nlp-1.3.1-py3-none-any.whl.metadata (26 kB)
Collecting annoy>=1.17.3 (from lit-nlp)
  Downloading annoy-1.17.3.tar.gz (647 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m647.5/647.5 kB[0m [31m12.2 MB/s[0m eta [36m0:00:00[0m
[?25h  Preparing metadata (setup.py) ... [?25l[?25hdone
Collecting Levenshtein>=0.21.1 (from lit-nlp)
  Downloading levenshtein-0.27.1-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (3.6 kB)
Collecting ml-collections>=0.1.1 (from lit-nlp)
  Downloading ml_collections-1.1.0-py3-none-any.whl.metadata (22 kB)
Collecting numpy<2.0.0,>=1.24.1 (from lit-nlp)
  Downloading numpy-1.26.4-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (61 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m61.0/61.0 kB[0m [31m1.8 MB/s[0m eta [36m0:00:00[0m
Collecting rouge-score>=0.1.2 (from lit-nlp)
  Downloading rouge_score-0.1.2.tar.gz (17 kB)
  Preparing metadata 

In [2]:
!pip install "numpy>=2.0.0,<3.0.0"

Collecting numpy<3.0.0,>=2.0.0
  Downloading numpy-2.2.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (62 kB)
[?25l     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m0.0/62.0 kB[0m [31m?[0m eta [36m-:--:--[0m[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m62.0/62.0 kB[0m [31m3.9 MB/s[0m eta [36m0:00:00[0m
[?25hDownloading numpy-2.2.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (16.4 MB)
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m16.4/16.4 MB[0m [31m96.7 MB/s[0m eta [36m0:00:00[0m
[?25hInstalling collected packages: numpy
  Attempting uninstall: numpy
    Found existing installation: numpy 1.26.4
    Uninstalling numpy-1.26.4:
      Successfully uninstalled numpy-1.26.4
[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
lit-nlp 1.3.1 requires numpy<2.0.0,>=1.24

In [1]:
from lit_nlp import notebook
from lit_nlp.examples.glue import data
from lit_nlp.examples.glue import models

# Hide INFO and lower logs. Comment this out for debugging.
from absl import logging
logging.set_verbosity(logging.WARNING)

In [2]:
# Fetch the trained model weights
!wget https://storage.googleapis.com/what-if-tool-resources/lit-models/sst2_tiny.tar.gz
!tar -xvf sst2_tiny.tar.gz

--2025-05-17 04:15:12--  https://storage.googleapis.com/what-if-tool-resources/lit-models/sst2_tiny.tar.gz
Resolving storage.googleapis.com (storage.googleapis.com)... 172.253.63.207, 142.250.31.207, 142.251.111.207, ...
Connecting to storage.googleapis.com (storage.googleapis.com)|172.253.63.207|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 16362834 (16M) [application/octet-stream]
Saving to: ‘sst2_tiny.tar.gz’


2025-05-17 04:15:13 (107 MB/s) - ‘sst2_tiny.tar.gz’ saved [16362834/16362834]

./
./tokenizer_config.json
./tf_model.h5
./config.json
./train.history.json
./vocab.txt
./special_tokens_map.json


In [None]:
# Create the LIT widget with the model and dataset to analyze.
datasets = {'sat_dev': data.SST2Data('validation')}
models = {'sst_tiny': models.SST2Model('./')}

widget = notebook.LitWidget(models, datasets, port=8890)

In [None]:
# Render the widget
widget.render(height=1000)

If you've found interesting examples using the LIT UI, you can access these in Python using `widget.ui_state`:

In [None]:
widget.ui_state.primary  # the main selected datapoint

In [None]:
widget.ui_state.selection  # the full selected set, if you have multiple points selected

In [None]:
widget.ui_state.pinned  # the pinned datapoint, if you use the 📌 icon or comparison mode

Note that these include some metadata; the bare example is in the `['data']` field for each record:

In [None]:
widget.ui_state.primary['data']

In [None]:
[ex['data'] for ex in widget.ui_state.selection]