Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

This project introduces the Context-Agent and the Non-linear Task Multiturn Dialogue (NTM) benchmark.

News

🎉 Apr, 2026: Happy birthday! Context-Agent was accepted to ACL 2026 Findings.
🔥 Dec, 2025: We released Context-Agent.

Requirements

Python 3.10+
For local models, refer to the Ollama Setup section.
For cloud models, ensure you have the corresponding API keys set as environment variables:
- OPENAI_API_KEY for OpenAI
- ZAI_API_KEY for Zhipu

Alternatively, you can place a .env file in the project root; main.py will load it automatically if present.

Install Python dependencies:

pip install -r requirements.txt

Ollama Setup (for local models)

Install Ollama: Follow the official instructions at ollama.com.
Pull a model: Before running main.py with Ollama, ensure you have pulled the desired model. For example:
```
ollama pull qwen3:30b
```
Ensure Ollama is running: The script expects the Ollama server to be available at its default address http://localhost:11434.

Dataset Format (`input/*.jsonl`)

Each line in the dataset is a JSON object with the following structure:

{
  "conversation_id": "<string|number>",
  "user_turns": [
    { "turn_id": 1, "content": "..." },
    { "turn_id": 2, "content": "..." }
  ],
  "metadata": { "optional": true }
}

Example datasets are provided under input/.

Quickstart

The main entry point for the project is main.py. It processes JSONL datasets and generates conversation JSON files under output/....

Minimal run (local Ollama, smart context ON by default):

python main.py

🚀 A quick start for most users is to run the command above, then inspect the generated JSON files in output/.

Common flags:

--use-model {ollama|openai|zhipu}: Specifies the backend model to use.
--smart-context / --no-smart-context: Enables/disables dynamic tree context management.
--input-path <path>: Path to a JSONL file or a folder containing JSONL files.
--ollama-model <name>: E.g., qwen3:30b.
--openai-model <name>: E.g., gpt-4.1 or any gateway model ID.
--zhipu-model <name>: E.g., GLM-4.

Output Layout

Output files are saved based on the --smart-context flag:

Smart Context: output/smart/<model_name>/
Direct Context: output/direct/<model_name>/

Filename convention: <backend>-<S|D>-<conversation_id>.json where S = smart, D = direct. Each output JSON contains:

{
  "conversation_id": "...",
  "metadata": { },
  "turns": [
    { "role": "user", "turn_id": 1, "content": "..." },
    { "role": "assistant", "content": "...", "context_tokens": 123 }
  ]
}

context_tokens counts tokens for the context portion only (excludes current query and reply) via src/token_counter.py.

Evaluation

Use evaluate.py to score conversations with a judge model. It pairs each conversation’s final assistant reply with a checkpoint question from input/dataset-full.jsonl and asks the judge to output "x/y" satisfied goals.

Basic usage:

# Evaluate a specific output folder with OpenAI judge
python evaluate.py --mode output/smart/gpt-4.1 --judge openai

Key flags:

--mode: One of direct, smart, rag, a subfolder under output/direct, or any folder path containing conversation JSONs.
--judge {openai|ollama} and --judge-model <name>.
--openai-base-url <url> to point to OpenAI-compatible gateways.
--limit <N> to evaluate a subset.

Artifacts:

CSV with per-conversation rows: evaluation_result/<label>_<suffix>_scores.csv
Summary JSON without per-conversation details: evaluation_result/<label>_<suffix>_score.json

Citation

If you find this work useful for your research, please cite our paper:

@article{hu2026context,
  title={Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue},
  author={Hu, Junan and Guo, Shudan and Liu, Wenqi and Yin, Jianhua and Wei, Yinwei},
  journal={arXiv preprint arXiv:2604.05552},
  year={2026}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

News

Table of Contents

Requirements

Ollama Setup (for local models)

Dataset Format (`input/*.jsonl`)

Quickstart

Output Layout

Evaluation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
input		input
model		model
src		src
README.md		README.md
evaluate.py		evaluate.py
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

News

Table of Contents

Requirements

Ollama Setup (for local models)

Dataset Format (input/*.jsonl)

Quickstart

Output Layout

Evaluation

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Dataset Format (`input/*.jsonl`)

Packages