ReAnalogy Dataset

pip install reanalogy

ReAnalogy is a dataset curated from regular expressions used in open-source python projects at GitHub and combined with three previous dataset available KB13, NL-RX-Turk, Lingua Franca

We use ReAnalogy to test the inductive reasoning ability of Large Language Models.

What are regular expressions?

Regular expressions are a formal language where we evaluate a sequence of characters by a program. Similarly, the same program can be used to generate text.

Why ReAnalogy?

ReAnalogy has more complex expressions than the previously mentioned datasets. The regular expressions in ReAnalogy contain natural language such as:

ReAnalogy regex applying.*?jquery.*?script will generate applying+WjqueryaK-6py|w9$script
whereas, KB13 regex (.*[0-9].*){5,} will generate Er9?=0mQL92:?$)\\BzG 1

The quasi-natural language in ReAnalogy make it a benchmark that is:

Closer to natural language
Challenging
Contain a ground-truth

Additionally, it is an unbiased benchmark to evaluate reasoning of a Language Model as there is a ground-truth to evaluate whether the Language Model can succesfully reason. For example, we can evaluate whether a generated Fact (text) can be matched by a Rule (regex).

Usage

from reanalogy.dataset import ReAnalogy
from torch.utils.data import DataLoader
ds = ReAnalogy(
        data_path,
        split="train",
        return_regex=True,
        dataset_name="reanalogy",
        n_examples=5,
    )
dl = DataLoader(ds, batch_size=128, num_workers=0)

The examples are generated on the fly. We recommend setting num_workers>0 for multi-processing and avoiding a bottleneck by the generation process of the examples.

Additional supported datasets are: dataset_name = "deep" and dataset_name = "kb13".

Decoding and Scoring

idx = 0
# This is the ground-truth regex
regex = ds.dataset[idx]
# this is the encoded sequence
seqs = ds[idx]
# the regex is appended to the end of the sequence if `return_regex=True`
examples = ds.decode_examples(seqs)
if ds.return_regex:
    assert ds.decode_regex(seqs) == str(regex)
ds.score(regex, examples).mean()
ds.validate(regex, examples).mean()

NOTE We use ␀ (\u2400) to represent unknown tokens.

Cite

@inproceedings{fostiropoulos2023probing,
  title={Probing Reasoning of Language Models with Inductive In-Context Learning},
  author={Fostiropoulos, Iordanis and Itti, Laurent},
  booktitle={International Joint Conference on Artificial Intelligence 2023 Workshop on Knowledge-Based Compositional Generalization},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
assets		assets
data		data
reanalogy		reanalogy
tests		tests
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

data

data

reanalogy

reanalogy

tests

tests

.gitignore

.gitignore

README.md

README.md

setup.py

setup.py

Repository files navigation

ReAnalogy Dataset

What are regular expressions?

Why ReAnalogy?

Usage

Decoding and Scoring

Cite

About

Languages

fostiropoulos/ReAnalogy

Folders and files

Latest commit

History

Repository files navigation

ReAnalogy Dataset

What are regular expressions?

Why ReAnalogy?

Usage

Decoding and Scoring

Cite

About

Topics

Resources

Stars

Watchers

Forks

Languages