FlyCatcher: Neural Inference of Runtime Checkers from Tests

This repository contains the implementation of FlyCatcher and supplementary material for the paper "FlyCatcher: Neural Inference of Runtime Checkers from Tests"

Execution Environment

Follow the requirements and instalation instructions to set up the execution environment in ENVIRONMENT.md.

Replication Guide

To reproduce the results from the paper, follow these instructions. The results of the following instructions are provided in the Available Data section bellow, i.e., you can also inspect them there to skip some of the below steps.

Get Target Tests and Related Static Information:

Install CodeQL for VS Code: https://docs.github.com/en/code-security/codeql-for-vs-code/getting-started-with-codeql-for-vs-code/installing-codeql-for-vs-code
Manage the CodeQL database to analyze: Create the database using CodeQL CLI (This might be the way to go in case of specific project versions):
1. Install CodeQL CLI: https://docs.github.com/en/code-security/codeql-cli/getting-started-with-the-codeql-cli/setting-up-the-codeql-cli
2. Get project:
3. cd data/projects
4. git clone *project* (e.g. git clone https://github.com/apache/zookeeper)
5. cd *project* & git checkout *version* (e.g. git checkout tags/release-3.4.11)
6. codeql database create *project*-db --language=java --source-root=*project root* (codeql database create zk-db --language=java --build-mode=none --source-root=.)
Run query: Select one of the queries in the ./codeql-custom-queries-java folder and run it. Then, save the output as CSV file to ./data/static/project/raw/id.csv. Where id is the name of the .ql file without the get-, e.g. tests-and-methods-called-by-them.
Join the query results:

python ProcessStaticData.py --files_root=*path* --project_root=*path* (python ProcessStaticData.py --files_root=data/static/zookeeper/raw --project_root=data/projects/zookeeper/)

Build the target system:

cd data/projects/hbase & mvn package -DskipTests -Dhadoop.profile=3.0 -Dhadoop-three.version=3.2.2

Copy and fix dependencies:

mvn dependency:copy-dependencies -DoutputDirectory=./lib -DincludeScope=runtime,compile,test

find . -type f -name "asm*.jar" -delete
find . -type f -name "jopt-simple*.jar" -delete

Run each test class and add the failing ones to ./GetTargetTests_system.py

Randomly select 100 targets:

python GetTargetTests_system.py --input tests_and_their_info.csv (python GetTargetTests_hb.py --input ./data/static/hbase/tests_and_their_info.csv)

Generate and Validate Checkers:

python Main.py --input=./data/static/cassandra/tests_and_their_info.csv --target=./data/target_tests_cs.csv --project_root=./data/projects/cassandra --dependencies_cp ./data/projects/cassandra/build/classes/main:./data/projects/cassandra/build/classes/thrift:./data/projects/cassandra/build/test/classes:./data/projects/cassandra/lib/*:./data/projects/cassandra/build/lib/jars/*

Cross-Validate Checkers:

python ./evaluation/CrossValidation.py --results ./data/results_claude_cs.csv --instrumented_classes ./data/output/instrumented_classes_all/ --target ./data/target_tests_cs.csv --dependencies_cp ./data/projects/cassandra/build/classes/main:./data/projects/cassandra/build/classes/thrift:./data/projects/cassandra/build/test/classes:./data/projects/cassandra/lib/*:./data/projects/cassandra/build/lib/jars/*

Mutation Analysis

python ./evaluation/MutationAnalysis.py --input=./data/static/zookeeper/tests_and_their_info.csv --results=./data/results_claude_zk.csv --src_dir=./data/projects/zookeeper/src --src_dir_byte=./data/projects/bytecode/zookeeper-3.4.11 --tests_dir_byte=./data/projects/bytecode/zookeeper-3.4.11-tests --dependencies_cp dependencies

Study on Feedback contributions

python ./evaluation/GetFeedbackContributions.py

Costs of the Approach

Measure the execution time with and without the checkers activated:

python ./evaluation/MeasureExecutionTime.py

Summarize the results:

python ./evaluation/SummarizeExecutionTime.py

Available Data

The data from our experiments can be found in the following folders in the artifact:

The target tests data can be found at:

./data/target_tests_*.csv

./data/static

The prompts, model responses, instrumented classes, validation and cross-validation results for each target test, checker and model are available at:

./data/output/

The mutation analysis is available at:

./data/mutation_analysis_zk

The overhead analysis is available at:

./data/overhead_analysis

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
codeql-custom-queries-java		codeql-custom-queries-java
data		data
evaluation		evaluation
flycatcher		flycatcher
Conversation.py		Conversation.py
ENVIRONMENT.md		ENVIRONMENT.md
GetTargetTests_cs.py		GetTargetTests_cs.py
GetTargetTests_hb.py		GetTargetTests_hb.py
GetTargetTests_hf.py		GetTargetTests_hf.py
GetTargetTests_zk.py		GetTargetTests_zk.py
Main.py		Main.py
ProcessStaticData.py		ProcessStaticData.py
PromptTemplate.py		PromptTemplate.py
README.md		README.md
Util.py		Util.py
clean.sh		clean.sh
llmClientClaude.py		llmClientClaude.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FlyCatcher: Neural Inference of Runtime Checkers from Tests

Execution Environment

Replication Guide

Get Target Tests and Related Static Information:

Generate and Validate Checkers:

Cross-Validate Checkers:

Mutation Analysis

Study on Feedback contributions

Costs of the Approach

Available Data

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

FlyCatcher: Neural Inference of Runtime Checkers from Tests

Execution Environment

Replication Guide

Get Target Tests and Related Static Information:

Generate and Validate Checkers:

Cross-Validate Checkers:

Mutation Analysis

Study on Feedback contributions

Costs of the Approach

Available Data

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages