Skip to content

iGUITest/LLMCluster

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLMCluster: LLM-Based Crowdsourced Test Report Clustering

0. Dataset Prepare

The dataset we construct is available here.

Each csv file contains all the crowdsourced test reports for an app, with the following format in each line: <report-id>, <description>, <screenshot-url>, <category-label>.

Constructing new dataset based on the same format is also feasible.

Notice: screenshots are part of the crowdsourced test reports but are not used in LLMCluster.

1. Environment Setup

$ pip install openai scikit-learn evaluate

2. LLM API Settings

Set OpenAI API key in Environment Variable "OPENAI_API_KEY".

Make sure the "GPT-4o" model is available by running python llm.py.

Change the LLM service in llm.py is also feasible.

3. Run & Evaluate

$ python main.py <dataset-root> <app-id>

<app-id> is derived from the csv file name: app2.csv => app-id is 2.

For example, python main.py /workspace/llmcluster/reports 17

4. Code Explanation

main.py: implementation of the LLM-based crowdsourced test report clustering

eval.py: implementation of the evaluation of clustering result and generated summaries

llm.py: managing the LLM querying service

prompt.py: managing the task prompts for querying

About

LLM-based Crowdsourced Test Report Clustering

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages