Skip to content

Code for paper "An Empirical Study on Noisy Label Learning for Program Understanding" (in ICSE 2024)

Notifications You must be signed in to change notification settings

jacobwwh/noise_SE

Repository files navigation

An Empirical Study on Noisy Label Learning for Program Understanding

Requirements

pytorch

transformers

DGL

Datasets:

Code classification: CodeNet Link

Vulnerability detection: Devign link, please follow the instructions in link for preprocessing the data.

Code summarization: TLC Link

The samples with human evaluation results are stored in data/

Models

Trained-from-scratch models: LSTM, GNN, Transformer (summarization)

Pre-trained models: CodeBERT, GraphCodeBERT, UniXCoder, PLBART (summarization)

How to Run

Program classification and vulnerability detection: python run_classification_xxx.py

Code summarization: see code_sum/

About

Code for paper "An Empirical Study on Noisy Label Learning for Program Understanding" (in ICSE 2024)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published