GitHub - Estrellajer/Knots

【ADVEI】Knots: A Large-Scale Multi-Agent Enhanced Expert-Annotated Dataset and LLM Prompt Optimization for NOTAM Semantic Parsing

Code for the Knots Dataset introduced in the Advanced Engineering Informatics paper 'Knots: A Knowledge-Guided Self-Evolving Optimization Framework with LLMs for NOTAM Interpretation'.

If you like our project, please give us a star ⭐ on GitHub for the latest updates.

[![arXiv](https://img.shields.io/badge/Arxiv-Paper-b31b1b.svg?logo=arXiv)](https://arxiv.org/abs/2511.12630)
[![License](https://img.shields.io/badge/License-Apache%202.0-yellow)](https://github.com/Estrellajer/Knots/blob/main/LICENSE) [![GitHub issues](https://img.shields.io/github/issues/Estrellajer/Knots?color=critical&label=Issues)](https://github.com/Estrellajer/Knots/issues?q=is%3Aopen+is%3Aissue)

Category and subcategory distribution of Q-codes within the NOTAM dataset

🙏 Acknowledgment

Great thanks to the Beijing Natural Science Foundation for funding the Qiyuan Research Program, and to the Aviation Data Communication Corporation for providing data, as well as to all the teachers and classmates who have offered their help.

📝 Citation

If you find this paper useful, please consider staring 🌟 this repo and citing 📑 our paper:

@article{liu2025notam,
  title={Knots: A Large-Scale Multi-Agent Enhanced Expert-Annotated Dataset and LLM Prompt Optimization for NOTAM Semantic Parsing},
  author={Liu, Maoqi and Fang, Quan and Yang, Yang and Zhao, Can and Cai, Kaiquan},
  journal={arXiv preprint arXiv:2511.12630},
  year={2025}
}

Project Structure

├── config/                 # Configuration files
│   ├── prompts.py         # Traditional prompt definitions
│   └── settings.py        # System settings
├── src/                   # Source code
│   ├── agents.py         # Intelligent agents
│   ├── api_manager.py    # API manager
│   ├── debate.py         # Debate mechanism
│   ├── mining.py         # Data mining
│   ├── models.py         # Data models
│   ├── post_processor.py # Post processor
│   └── utils.py          # Utility functions
├── config.yaml           # Main configuration file
├── main.py               # Main program entry
└── requirements.txt      # Dependencies list

Environment Setup

Installing Dependencies with uv

We recommend using uv to manage Python dependencies:

# Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Create virtual environment and install dependencies
uv venv
source .venv/bin/activate  # Linux/macOS
# or .venv\Scripts\activate  # Windows

# Install project dependencies
uv pip install -r requirements.txt

Environment Variables Configuration

Create a .env file to configure API keys:

# DMX API
DMXAPI_API_KEY=sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

# Other API keys...

Usage

Basic Command Format

uv run main.py <input_file> <output_file> --prompt <prompt> [options]

Example Commands

1. Airport Data Processing

uv run main.py data/output/airport.json data/output/airport_icl_gpt-4.1-nano.json \
  --prompt AIRPORT_PROMPT_ICL \
  --provider dmxapi \
  --api-key sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx \
  --model gpt-4.1-nano \
  --base-url https://www.dmxapi.com/v1 \
  --evaluate

2. Runway Data Processing (with Self-consistency)

uv run main.py data/output/runway.json data/output/runway_processed.json \
  --prompt RUNWAY_PROMPT_ICL \
  --provider deepseek \
  --api-key sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx \
  --model deepseek-chat \
  --self-consistency \
  --consistency-rounds 5 \
  --consistency-strategy majority_vote \
  --evaluate

3. Light Data Processing (with Different Temperature)

uv run main.py data/output/light.json data/output/light_processed.json \
  --prompt LIGHT_PROMPT_A_COT \
  --provider qwen \
  --api-key sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx \
  --model qwen3-8b \
  --temperature 0.2 \
  --evaluate

4. Taxiway Data Processing

uv run main.py data/output/taxiway.json data/output/taxiway_processed.json \
  --prompt TAXIWAY_PROMPT_ICL \
  --provider qwen \
  --api-key sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx \
  --model qwen3-8b \
  --evaluate

5. Navigation Data Processing

uv run main.py data/output/navigation.json data/output/navigation_processed.json \
  --prompt NAVIGATION_PROMPT_COT \
  --provider deepseek \
  --api-key sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx \
  --model deepseek-chat \
  --temperature 0.1 \
  --evaluate

6. POML Mode Processing

uv run main.py data/output/airport.json data/output/airport_poml_processed.json \
  --use-poml \
  --poml-file config/Airport.poml \
  --provider qwen \
  --api-key sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx \
  --model qwen3-8b \
  --evaluate

7. POML Mode with Self-consistency

uv run main.py data/output/runway.json data/output/runway_poml_processed.json \
  --use-poml \
  --poml-file config/Airport.poml \
  --provider deepseek \
  --api-key sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx \
  --model deepseek-chat \
  --self-consistency \
  --consistency-rounds 3 \
  --consistency-strategy majority_vote \
  --evaluate

Command Line Arguments

Required Parameters

input_file: Input JSON file path
output_file: Output JSON file path
--prompt: Prompt content or predefined prompt name (required in traditional mode)

POML Mode Parameters

--use-poml: Enable POML (Prompt Optimization Markup Language) mode
--poml-file: Path to POML file (required when --use-poml is set)

API Configuration Parameters

--provider: API provider (qwen/deepseek/dmxapi/openai)
--api-key: API key
--model: Model name to use
--base-url: API base URL
--temperature: Temperature parameter, controls output randomness (0.0-1.0)

Self-consistency Parameters

--self-consistency: Enable self-consistency validation
--consistency-rounds: Number of self-consistency rounds (default: 3)
--consistency-strategy: Strategy selection
- majority_vote: Majority voting (default)
- first_success: First success
- most_confident: Highest confidence

Other Options

--evaluate: Show evaluation report after processing
--evaluate_only FILE: Only evaluate specified file, no processing

Predefined Prompts

The system provides various predefined prompts covering different data types and reasoning strategies:

Data Types

AIRPORT_PROMPT_*: Airport-related data
RUNWAY_PROMPT_*: Runway-related data
LIGHT_PROMPT_*: Light-related data
TAXIWAY_PROMPT_*: Taxiway-related data
AIRWAY_PROMPT_*: Airway-related data
AREA_PROMPT_*: Area-related data
STAND_PROMPT_*: Stand-related data
NAVIGATION_PROMPT_*: Navigation-related data
PROCEDURE_PROMPT_*: Procedure-related data
STANDARD_PROMPT_*: Standard-related data
RVR_PROMPT_*: RVR-related data

Reasoning Strategies

*_Vanilla: Basic prompts
*_ICL: In-context learning with examples
*_COT: Chain-of-thought reasoning
*_POML: POML (Prompt Optimization Markup Language) mode

Post-processing(SRCV)

For further optimization and validation of parsed data:

uv run src/post_processor.py input_file.json output_file.json \
  --provider qwen \
  --api-key sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx \
  --model qwen3-8b \
  --evaluate

Configuration Files

config.yaml

Main configuration file containing API settings, path configurations, and processing parameters:

api:
  default_provider: "deepseek"
  providers:
    deepseek:
      api_key: "${DEEPSEEK_API_KEY}"
      base_url: "https://api.deepseek.com/v1"
      model: "deepseek-chat"

paths:
  data_dir: "./data"
  output_dir: "./data/output"

processing:
  default_sample_size: 100
  max_workers: 4

Logging and Monitoring

Log files located in logs/ directory
Real-time progress display and success rate statistics
API call statistics and error tracking
Automatic evaluation report generation after processing completion

Supported Data Formats

Input Format

{
  "records": [
    {
      "raw_text": "NOTAM raw text...",
      "category": "runway",
      "telex": "A001/23"
    }
  ]
}

Output Format

{
  "metadata": {
    "total_records": 100,
    "success_count": 95,
    "processing_time": "2023-01-01T12:00:00"
  },
  "records": [
    {
      "raw_text": "Raw text",
      "parse_fields": {
        "airport": "ZBAA",
        "runway": "18L/36R",
        "status": "closed"
      }
    }
  ],
  "api_stats": {
    "total_requests": 100,
    "successful_requests": 95
  }
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
config		config
data/output		data/output
images		images
src		src
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
config.yaml		config.yaml
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

【ADVEI】Knots: A Large-Scale Multi-Agent Enhanced Expert-Annotated Dataset and LLM Prompt Optimization for NOTAM Semantic Parsing

Code for the Knots Dataset introduced in the Advanced Engineering Informatics paper 'Knots: A Knowledge-Guided Self-Evolving Optimization Framework with LLMs for NOTAM Interpretation'.

If you like our project, please give us a star ⭐ on GitHub for the latest updates.

🙏 Acknowledgment

📝 Citation

Project Structure

Environment Setup

Installing Dependencies with uv

Environment Variables Configuration

Usage

Basic Command Format

Example Commands

1. Airport Data Processing

2. Runway Data Processing (with Self-consistency)

3. Light Data Processing (with Different Temperature)

4. Taxiway Data Processing

5. Navigation Data Processing

6. POML Mode Processing

7. POML Mode with Self-consistency

Command Line Arguments

Required Parameters

POML Mode Parameters

API Configuration Parameters

Self-consistency Parameters

Other Options

Predefined Prompts

Data Types

Reasoning Strategies

Post-processing(SRCV)

Configuration Files

config.yaml

Logging and Monitoring

Supported Data Formats

Input Format

Output Format

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages