llmsfor-code-analysis

Introduction

This is the online appendix for our paper Large Language Models for Code Analysis: Do LLMs Really Do Their Job?.

Data

The dataset we use consists of:

Non-Obfuscated Code

C: Selected code sample from POJ-104 dataset and classic C benchmarks (Linpack, etc.);
JavaScript: The Octane benchmark and some web apps from Github;
Python: Selected code samples from Google CodeSearchNet dataset;

Obfuscated Code

Obfuscated JavaScript code (obtained by applying different obfuscation tchniques to the JavaScript branch of our Non-Obfuscated Code dataset);
Winner code of Internet Obfuscated C Code Contest (IOCCC);

Results

Results of our analysis include responses of different models on different code sample.

Citation

@article{fang2023large,
  title={Large language models for code analysis: Do llms really do their job?},
  author={Fang, Chongzhou and Miao, Ning and Srivastav, Shaurya and Liu, Jialin and Zhang, Ruoyu and Fang, Ruijie and Asmita, Asmita and Tsang, Ryan and Nazari, Najmeh and Wang, Han and others},
  journal={arXiv preprint arXiv:2310.12357},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
case_studies		case_studies
dataset		dataset
images		images
results		results
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llmsfor-code-analysis

Introduction

Data

Results

Citation

About

Releases

Packages

Contributors 2

Languages

aseec-lab/llms-for-code-analysis

Folders and files

Latest commit

History

Repository files navigation

llmsfor-code-analysis

Introduction

Data

Results

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages