Java Machine Learning "jFlow"

Java Machine Learning Project By Brendan C Reidy

For C++/CUDA GPU implementation of this project click here

Description:

The Java DNN Program can be used to train models using either FP32, Binary Neural Networks (BNN), and Ternary Neural Networks (TNN). These are the same networks implemented in our paper "An In-Memory Analog Computing Co-Processor for Energy-Efficient CNN Inference on Mobile Devices". This repository has been The examples in this repository are specifically for the MNIST dataset, but can be used with any dataset.

Instructions for downloading and formatting the MNIST dataset can be found here

Loading Files

Files in CSV format are loaded into two dimensional floating point arrays using the MatrixIO object Example:

float[][] trainingData = MatrixIO.readTrainingData("FILE_NAME", NUM_LINES); // Loads csv file to trainingData variable

Creating a Neural Network

In order to create a Neural Network, start by creating a Neural Network object:

NeuralNetwork network = new NeuralNetwork(); // Initializes object

Next you want to set the training and validation (test) data for the network:

network.setTrainingData(trainingData, trainLabels);
network.setTestingData(testingData, testLabels);

Next you can begin definining the topology of the Network. Start with the input layer:

network.addLayer(new Input2D(28, 28)); // Creates input layer; MNIST's dimensions are 28x28

Next you can begin defining the intermediate layers (you can define as many or as few as you like):

network.addLayer(new FullyConnectedBinary(16, "reverse sigmoid")); // Creates a hidden layer with 16 hidden neurons and uses 'reverse sigmoid' as the activation function (see 'Activation functions')

Next define the output layer:

network.addLayer(new FullyConnectedBinary(10, "reverse sigmoid")); // 10 neurons, reverse sigmoid activation

Running a Neural Network

Now that the neural network hsa been created, you can run the network for one epoch with:

network.train();

Run for multiple epochs with:

for(int i=0; i<numEpochs; i++) {
    network.train();
}

Getting accuracy

You can find the validation accuracy by adding the following line before running the train method:

network.addDatasetAnaylsisModel(new MaxOutputAccuracy());

Saving results

You can save the neural network to a file using the following:

network.saveToFile("OUTPUT_DIRECTORY_NAME/");

This will auto generate a README with info about the network, a file with the training and validation cost over time, and all of the floating point and ternary/binary weights and biases throughout the network

Activation functions

The in circuit Neural Network uses 'reverse sigmoid' as the activation function, which is why it is used in training as well.

Creating Activation Functions

Activtion functions can be easily implemented by modifying the following template (we'll use Sigmoid as an example)

public class Sigmoid implements ActivationFunction {
    String name = "Sigmoid";
    public float sigmoid(float x)
    {
        return 1 / (1 + (float) Math.exp(-x));
    }
    public float inverseSigmoidDerivative(float y)
    {
        return y*(1-y);
    }
    public float[] activate(float[] aLayer)
    {
        for(int i=0; i<aLayer.length; i++)
            aLayer[i] = sigmoid(aLayer[i]);
        return aLayer;
    }
    public float[] activationError(float[] aLayer)
    {
        for(int i=0; i<aLayer.length; i++)
            aLayer[i] = inverseSigmoidDerivative(aLayer[i]);
        return aLayer;
    }
    public String getName()
    {
        return this.name;
    }
    public String toString()
    {
        return this.name;
    }
}

Change the "activate" method to your activation function and the "activationError" method to you error created by that layer (propogated through the model)

If you use this code please cite

@INPROCEEDINGS{9516756,
  author={Elbtity, Mohammed and Singh, Abhishek and Reidy, Brendan and Guo, Xiaochen and Zand, Ramtin},
  booktitle={2021 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)}, 
  title={An In-Memory Analog Computing Co-Processor for Energy-Efficient CNN Inference on Mobile Devices}, 
  year={2021},
  volume={},
  number={},
  pages={188-193},
  doi={10.1109/ISVLSI51109.2021.00043}}

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
ActivationFunction.java		ActivationFunction.java
ActivationFunctions.java		ActivationFunctions.java
CostFunction.java		CostFunction.java
DatasetAnalysis.java		DatasetAnalysis.java
FullyConnected.java		FullyConnected.java
Layer.java		Layer.java
Main.java		Main.java
Matrix2D.java		Matrix2D.java
MatrixIO.java		MatrixIO.java
MatrixMath.java		MatrixMath.java
MeanSquaredError.java		MeanSquaredError.java
NeuralNetwork.java		NeuralNetwork.java
PlaceHolder2D.java		PlaceHolder2D.java
README.md		README.md
RELU.java		RELU.java
ReverseSigmoid.java		ReverseSigmoid.java
Sigmoid.java		Sigmoid.java
SoftMax.java		SoftMax.java
Vector.java		Vector.java

BrendanCReidy/Java-ML-Framework

Folders and files

Latest commit

History

Repository files navigation

Java Machine Learning "jFlow"

Description:

Loading Files

Creating a Neural Network

Running a Neural Network

Getting accuracy

Saving results

Activation functions

Creating Activation Functions

About

Resources

Stars

Watchers

Forks

Languages