Skip to content

📝 ML Paper implementation of machine learning paper, chimerge

Notifications You must be signed in to change notification settings

Anylee2142/ChiMerge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Outlines

  1. Implementation of ChiMerge (https://www.aaai.org/Papers/AAAI/1992/AAAI92-019.pdf)

Works sklearn way

  1. Supervised discretization using target, Chi2 statistics & test
  2. Can be configured to multiprocess (n_jobs)
from discretization.chi_merge import *
chi_merge = ChiMerge(con_features=X.columns, significance_level=0.1, n_jobs=-3)
chi_merge.fit_transform(X)

Concept

It follows below rules.

  • If continuous feature is discretized,
    1. Within interval, class frequency is stable.
    2. Two adjacent intervals should have no similar class frequencies.
  • This is tested by Chi2 test

Random thoughts

  • What if considering k-adjacent, not 2-adjacent ?

    Should be normalized and reflected to formula in paper

References

TODO

  • dataset to s3

About

📝 ML Paper implementation of machine learning paper, chimerge

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published