Skip to content

An R package for analyzing categorical variable splitting patterns in decision trees and random forests

Notifications You must be signed in to change notification settings

halleewong/cofa

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CoFA

This R package provides an implementation of the methods described in:

Categorical Co-Frequency Analysis: Clustering Diagnosis Codes to Predict Hospital Readmissions
Hallee E. Wong, Brianna C. Heggeseth, Steven J. Miller

Installation

To install and load this package in R from GitHub, run the following commands:

install.packages("devtools")
library(devtools) 
install_github("halleewong/cofa")
library(cofa)

test/test_adult.R shows an example script using functions from this package on the adult data set from the UCI machine learning repository.

Development

This package is under active development and may change substantially with each commit.

Reference

If you use our code, please cite our paper.

@misc{wong2019categorical,
    title={Categorical Co-Frequency Analysis: Clustering Diagnosis Codes to Predict Hospital Readmissions},
    author={Hallee E. Wong and Brianna C. Heggeseth and Steven J. Miller},
    year={2019},
    eprint={1909.00306},
    archivePrefix={arXiv},
    primaryClass={stat.AP}
}

About

An R package for analyzing categorical variable splitting patterns in decision trees and random forests

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages