Code for paper: Estimating Semantic Alphabet Size for LLM Uncertainty Quantification (ICLR '26).

Installation

Clone our repo:

>>> git clone https://github.com/lucasmccabe/esa.git

Install dependencies:

>>> pip install -r requirements.txt
>>> pip install -r requirements_data.txt

Example

A simple example, running from the root directory, is shown below:

from src import EntropyEstimator, TextPassages

text_passages = TextPassages(
    passages=[
        "The sky is blue.",
        "It's blue.",
        "Today, it is green!",
        "Sometimes, it is orange"
    ],
    question = "What color is the sky?"
)

entropy_estimator = EntropyEstimator(
    text_passages=text_passages,
    cluster_ids=[0,0,1,2]
)

print(f"Plugin entropy estimate: {entropy_estimator.get_entropy():.2f}")
print(f"Hybrid entropy estimate: {entropy_estimator.get_entropy("hybrid"):.2f}")

Output:

Plugin entropy estimate: 1.04
Hybrid entropy estimate: 1.76

Repo Overview

Scripts

The scripts found in scripts and src/experiments/collect_data pertain to LLM response generation and LLM-as-judge evaluation. These require running the Docker container for vLLM, as defined as compose.yaml. Semantic equivalence class labels for LLM responses are obtained via src/experiments/semantic_labeling.py. src/experiments/collect_alphabet.py collects alphabet size estimation information and packages it in a CSV that is more convenient for analysis. Similarly, src/experiments/collect_uncertainty.py does the same for uncertainty measures.

Figures and Tables

With the exception of Figure 1, which is an illustration, the underlying content for the figures and tables in the paper are generated via notebooks found in src/experiments.

src/experiments/Uncertainty Estimation.ipynb corresponds to the content of Figures 2, 7, 8 and Table 1;
src/experiments/Incorrectness.ipynb corresponds to the content of Figures 3, 4, 10 and Table 3;
src/experiments/MEAT and POTATO.ipynb corresponds to the content of Figure 5 and Table 2;
src/experiments/Information Gain.ipynb corresponds to the content of Figure 6;
src/experiments/Gemma-3-12B POTATO Examination.ipynb corresponds to the content of Figure 9;
src/experiments/RBMCI vs Empirical.ipynb corresponds to the content of Table 4; and
src/experiments/Alphabet Size Estimation.ipynb corresponds to the content of Figure 11.

Citation

To cite this work, please use the following:

@inproceedings{
    mccabe2026estimating,
    title={Estimating Semantic Alphabet Size for {LLM} Uncertainty Quantification},
    author={Lucas Hurley McCabe and Rimon Melamed and Thomas Hartvigsen and H Howie Huang},
    booktitle={The Fourteenth International Conference on Learning Representations},
    year={2026},
    url={https://openreview.net/forum?id=uYK6GPVg1O}
}

License

This project is licensed under GPL-3.0.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compose.yaml		compose.yaml
example.env		example.env
requirements.txt		requirements.txt
requirements_data.txt		requirements_data.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for paper: Estimating Semantic Alphabet Size for LLM Uncertainty Quantification (ICLR '26).

Table of Contents

Installation

Example

Repo Overview

Scripts

Figures and Tables

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Languages

Folders and files

Latest commit

History

Repository files navigation

Code for paper: Estimating Semantic Alphabet Size for LLM Uncertainty Quantification (ICLR '26).

Table of Contents

Installation

Example

Repo Overview

Scripts

Figures and Tables

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Packages