LogiConBench Data Construction

This repository provides the pipeline for constructing the LogiConBench dataset, a large-scale benchmark for evaluating logical consistency in LLMs. The dataset contains 280K samples with $k=2,3,4,5$ statements, automatically generated from symbolic logical graphs and translated into natural language.

🔧 Data Construction Pipeline

The dataset is built through five main steps, each implemented in a corresponding Jupyter notebook:

Generate logical graphs and sample nodes
- Notebook: step1_generate logical graph and select.ipynb
- Construct logical graphs where nodes are symbolic propositions and edges denote reasoning relations.
- Sample subsets of nodes for downstream processing.
Reorder edges into walk sequences
- Notebook: step2_reorder.ipynb
- Reorder the edges of sampled nodes into sequential walk structures to ensure consistent reasoning paths.
Label consistency sets and count inconsistent nodes
- Notebook: step3_label and count.ipynb
- Assign consistency labels to sampled nodes.
- Count how many nodes admit inconsistent sets.
Rewrite equivalent logical expressions
- Notebook: step4_rewrite.ipynb
- Apply symbolic rewriting rules to generate multiple logically equivalent variants for each node, enhancing structural diversity.
Translate into natural language
- Notebook: step5_nl_template.ipynb
- Convert symbolic expressions into natural language statements using predefined templates and WordNet-based lexical substitutions.

📂 Dataset

All generated data is stored under the data/ directory.
The dataset includes 280K samples, distributed across four difficulty levels:
- data/k=2/
- data/k=3/
- data/k=4/
- data/k=5/

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
2statements.jsonl.zip		2statements.jsonl.zip
3statements.jsonl.zip		3statements.jsonl.zip
4statements.jsonl.zip		4statements.jsonl.zip
5statements.jsonl.zip		5statements.jsonl.zip
README.md		README.md
step1_generate logical graph and select.ipynb		step1_generate logical graph and select.ipynb
step2_reorder.ipynb		step2_reorder.ipynb
step3_label and count.ipynb		step3_label and count.ipynb
step4_rewrite.ipynb		step4_rewrite.ipynb
step5_nl_template.ipynb		step5_nl_template.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LogiConBench Data Construction

🔧 Data Construction Pipeline

📂 Dataset

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LogiConBench Data Construction

🔧 Data Construction Pipeline

📂 Dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages