Java Machine Learning / Optimization package


Rudder is a meant to provide a machine learning framework and package. Using rudder you can extend a base record class and use annotations to define your features and labels, then pass those records into an analytics algorithm without having to write any additional connectors.

There will also be a few utilities that will come in and out of existence as this project grows and is refined.

Everything in this framework should be thread safe, if you find something that is not, please post an issue, a patch, or a unit test illustrating the issue.

Example of running clustering

class MyRecordClass  {
    double myFeature1
    double myFeature2
    double myFeature3
    String myLabel; 
    Object nonFeatureItem;
    String nonFeatureOrLabelString;

    void setLabel(String label);

    //Only use non-primitives in the set method signature
    void setLabel(Double doubleLabel);

    //Other stuff I want to do

    //This API uses equals often, so your object should override it
    boolean equals(Object o);

class MyClusterRunner {
    public static void main(String[] args) {
        double epsilon = 4d;  //Maximum distance between records in a cluster
        int minClusterSize = 3;  //Minimum number of records in a cluster
        IRudderList<MyRecordClass> myRecords = new RudderList<MyRecordClass>();
        //populate your object  
        DBScan dbscan = new DBScan(epsilon, minClusterSize);
        ArrayList<Cluster> clusters = db.getClusters(); 
        //do stuff with your clusters i.e. print record label and cluster
        for(int i=0; i<clusters.size(); i++) {
            for(MyRecordClass r : clusters.getElements()) {
                System.out.println("Cluster " + i+ ": " + myRecords.getStringLabel(r));