Exploring the Use of LLMs for Predicate Logic Understanding and Formalization

This project explores the capabilities of Large Language Models (LLMs) in understanding, translating, and reasoning with predicate logic.

Project Overview

The repository contains experiments analyzing how LLMs perform in translating natural language sentences into formal first-order logic (FOL) notation. The analysis includes both Chain of Thought (CoT) and In-Context Learning (ICL) approaches.

Repository Structure

Notebooks:
- LLM_Predicate_Logic_Project.ipynb: Natural language to logic translation
- LLM_Predicate_Logic_CoT.ipynb: Chain of Thought experiments
- LLM_Predicate_Logic_ICL.ipynb: In-Context Learning experiments
- analysis_cot.ipynb: Analysis of CoT results
- analysis_icl.ipynb: Analysis of ICL results
- analysis.ipynb: General analysis
- data_visualization.ipynb: Visualizations of results
Data:
- dataset/data.csv: Source dataset
- results_cot.csv: Results from Chain of Thought experiments
- results_icl.csv: Results from In-Context Learning experiments
- results.csv: Aggregated results
- llm_text2log_outputs.csv: Model outputs translating text to logic

Methodology

The project investigates the performance of LLMs in understanding and translating natural language sentences (like "Some men are wolves") into first-order logic expressions (such as "exists x1.(_man(x1) & exists x2.(_wolf(x2) & (x1 = x2)))").

Two main approaches are explored:

Chain of Thought (CoT): Prompting the model to explain its reasoning process step by step
In-Context Learning (ICL): Providing examples to guide the model's translations

Results

The notebooks contain detailed analysis of model performance, including:

Accuracy metrics for different types of logical statements
Comparison between CoT and ICL appro

Getting Started

Clone this repository
Install the required dependencies (see requirements in notebooks)
Run the notebooks in your preferred environment

Requirements

Python 3.x
Pandas
Jupyter
LLM API access (specifics in notebooks)

Contact

f.shafi@queensu.ca and ing.tao@queensu.ca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Exploring the Use of LLMs for Predicate Logic Understanding and Formalization

Project Overview

Repository Structure

Methodology

Results

Getting Started

Requirements

Contact

About

Uh oh!

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
dataset		dataset
.gitignore		.gitignore
LLM_Predicate_Logic_CoT.ipynb		LLM_Predicate_Logic_CoT.ipynb
LLM_Predicate_Logic_ICL copy.ipynb		LLM_Predicate_Logic_ICL copy.ipynb
LLM_Predicate_Logic_ICL.ipynb		LLM_Predicate_Logic_ICL.ipynb
LLM_Predicate_Logic_Project.ipynb		LLM_Predicate_Logic_Project.ipynb
README.md		README.md
analysis.ipynb		analysis.ipynb
analysis_cot.ipynb		analysis_cot.ipynb
analysis_icl.ipynb		analysis_icl.ipynb
data_visualization.ipynb		data_visualization.ipynb
llm_text2log_outputs.csv		llm_text2log_outputs.csv
results.csv		results.csv
results_cot.csv		results_cot.csv
results_icl.csv		results_icl.csv
test.py		test.py

devshafi/Logic-Analysis-with-LLMs

Folders and files

Latest commit

History

Repository files navigation

Exploring the Use of LLMs for Predicate Logic Understanding and Formalization​

Project Overview

Repository Structure

Methodology

Results

Getting Started

Requirements

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Exploring the Use of LLMs for Predicate Logic Understanding and Formalization

Packages