Skip to content

BoShiangKe/InfluenceAUC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

influenceAUC Package

CRAN_Status_Badge Downloads License R build status

Description

influenceAUC is an R package that focuses on identifying influential observations from the perspective of model diagnostics in binary classification. Most of the related research is based on the assumption that positive instances tend to have higher values, then the sources of influential cases can categorize as follows:

  • negative cases with comparatively higher scores
  • positive cases with relatively lower scores

The proposed methods rely on the area under the receiver operating characteristic curve (AUC) and cumulative lift chart (CLC), which indirectly facilitate the methods suitable to any classifiers with continuous score outputs. The theoretical approaches evaluate the influences of observations to the overall AUC, and adjusted CLCs offer the existence and approximate locations of those influential cases through data visualization. Because each method may have its pros and cons, we suggest end-users to apply all of them together to reach reliable results. Please see the reference for more information.

AUC

  • Influence function focuses on case-deletion diagnostics with potential masking effect limitations.
  • Local influence quantifies influences of all observations simultaneously to alleviate the masking effect, but the results may be misleading due to the well-known imbalanced data effect.

CLC

  • Positive CLC reveals negative cases with comparatively higher scores
  • Negative CLC uncovers positive cases with relatively lower scores

These modified CLCs disclose influential observations without masking and imbalanced data effects but lack quantitative values for further comparison.

Author

Bo-Shiang Ke, Yuan-chin Ivan Chang and Wen-Ting Wang

Maintainer

Bo-Shiang Ke

Reference

Ke, B. S., Chiang, A. J., & Chang, Y. C. I. (2018). Influence Analysis for the Area Under the Receiver Operating Characteristic Curve. Journal of biopharmaceutical statistics, 28(4), 722-734.

License

GPL-3

Citation

  • To cite package influenceAUC in publications use:

    Ke B, Chang Y, Wang W (2020). influenceAUC: Identify Influential Observations in Binary Classification. R package version 0.1.2, https://CRAN.R-project.org/package=influenceAUC.

  • A BibTeX entry for LaTeX users is

    @Manual{, title = {influenceAUC: Identify Influential Observations in Binary Classification}, author = {Bo-Shiang Ke and Yuan-chin Ivan Chang and Wen-Ting Wang}, year = {2020}, note = {R package version 0.1.2}, url = {https://CRAN.R-project.org/package=influenceAUC}, }

About

R package - identify influential observations through AUC and CLC.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages