SCAN

This is the official code of SCAN.

⬇️ Preparation

Environment Installation

conda create -n SCAN python=3.12
conda activate SCAN
 
pip install -r requirements.txt

🌲 1 Taxonomy Preparation

You can choose to use the taxonomy we provide. See: visualization_and_analysis/cata_tree.json.

(Optional) You can also choose to customize your own taxonomy since SCAN is highly extensible. Guidelines for building your customized taxonomy tree can be found in: Build Customized Taxonomy Tree.

🎁 2 Dataset Preparation

You can choose to use the evaluation dataset we provide. See: evaluation/outputs/evaluation_dataset.jsonl.

If you want to use the criteria and baseline model we provide, you can directly use the criteria we generated.

(Optional) You can also create your own custom evaluation dataset using our RealMix. Guidelines for building your customized evaluation dataset can be found in: Generate New Queries.

🎯 3 Evaluation

If you simply want to test our Visualization and Analysis Toolkits, you can directly use the generation and evaluation results we provided in evaluation/outputs.

🧩 3.1 Model Service Preparation

When you reach this step, we first recommend preparing several models:

Model to be evaluated: The model you want to evaluate.
(Optional) Model for pre-comparison: Our evaluation method requires several models to generate their responses to assist in extracting more effective evaluation criteria. This can be any model. We adopt gpt-4o, deepseek-v3, and doubao-1-5-pro in our paper. (If you're using the criteria we generated, you do not need to prepare this model.)
(Optional) Baseline model: The model that serves as the baseline in the evaluation. Our evaluation results are relative to its performance. We adopt gpt-4o in our paper. (If you're using the baseline model we use, you do not need to prepare this model.)
Evaluation model: This model is used to generate criteria and evaluate other models. We recommend using more advanced models, especially reasoning models. We adopt DeepSeek-R1 in our paper.

Note that you need to prepare your models in OpenAI-compatible format. Evaluation requires three things: model name, base url, and API key.

🚀 3.2 Execute the Evaluation

After you have prepared these services, you can follow the guidance in Evaluate Models to perform the evaluation.

📊 4 Visualization and Analysis Toolkits

Place the evaluation results obtained from the previous step into the visualization_and_analysis/evaluation_source_data directory.
Enter the directory: cd ./visualization_and_analysis
Run the following command to process the data obtained above.

python source_result_processing.py

Run the following command to get the analysis results.

python auto_analysing.py

Then, you can run the visualization and analysis tools locally.

python -m http.server 8103

For more details, refer to: Visualization and Analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
evaluation		evaluation
query_synthesis		query_synthesis
static		static
taxonomy		taxonomy
utils		utils
visualization_and_analysis		visualization_and_analysis
.gitignore		.gitignore
LICENSE-CC-BY-NC-4.0.md		LICENSE-CC-BY-NC-4.0.md
README.md		README.md
index.html		index.html
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SCAN

⬇️ Preparation

Environment Installation

🌲 1 Taxonomy Preparation

🎁 2 Dataset Preparation

🎯 3 Evaluation

🧩 3.1 Model Service Preparation

🚀 3.2 Execute the Evaluation

📊 4 Visualization and Analysis Toolkits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SCAN

⬇️ Preparation

Environment Installation

🌲 1 Taxonomy Preparation

🎁 2 Dataset Preparation

🎯 3 Evaluation

🧩 3.1 Model Service Preparation

🚀 3.2 Execute the Evaluation

📊 4 Visualization and Analysis Toolkits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages