Skip to content

yrahul3910/ghost-dl

Repository files navigation


about :: code :: license :: contribute :: issues :: contact

       


On the Value of Oversampling for Deep Learning in Software Defect Prediction

This repository is the reference implementation for the paper in the title above.

GHOST (Goal-oriented Hyperparameter Optimization for Scalable Training) is a paradigm for fast training and tuning of deep learners for software engineering. We use DODGE for hyperparameter optimziation; as such, the code is based off the DODGE repository. We use our lab's internal package, raise-utils, for standardized, high-quality code.

Usage

  • Use pip install -r requirements.txt to install all the python3 dependencies.
  • Run python3 steps.py

Within steps.py, you will see a dictionary of options that can be turned on or off. This was used to perform the ablation study. You can choose how you wish to run GHOST using this dictionary as well as the helper functions below. To run the Scott-Knott tests, we used our package's implementation (see the docs here).

Paper

Our paper was accepted to IEEE Transactions on Software Engineering 2021. You can view it here.

Cite this

@article{yedida2021value,
  title={On the Value of Oversampling for Deep Learning in Software Defect Prediction},
  author={Yedida, Rahul and Menzies, Tim},
  journal={IEEE Transactions on Software Engineering},
  year={2021},
  publisher={IEEE}
}

About

GHOST paradigm for deep learning in SE

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages