improve speed of SparseAutoencoder and make it more flexible #451

stereomatchingkiss · 2015-09-21T06:12:59Z

This commit intent to complete two things

1 : improve the speed of SparseAutoencoder
I cache the computation result in the data member, so the algorithm do not need to compute it two times.Since the member function is const, I declared the data member as mutable;Do anyone think that make the member function as non-const would be better?

2 : make the SparseAutoencoder more versatile
I add two template parameters as following

template<typename HiddenLayer, typename OutputLayer>
class SparseAutoencoderFunction;

This allow users use different layer from ann two compute the features

…repeated calculation or not

rcurtin · 2015-09-23T02:34:57Z

So you mentioned in IRC that this implementation is faster than the implementation in src/mlpack/methods/sparse_autoencoder; can I get a little more information on how you calculated that? I don't doubt your results or anything, I just want to run some tests of my own and see if I can replicate the speedup on other interesting architectures and systems. :)

rcurtin · 2015-09-23T02:38:39Z

src/mlpack/tests/sparse_autoencoder_test_2.cpp

+//sparse autoencoder function greedy
+using SAEFG = ann::SparseAutoencoderFunction<FSigmoidLayer, FSigmoidLayer, std::true_type>;
+
+BOOST_AUTO_TEST_SUITE(SparseAutoencoderTest2);


Just a note -- when this is merged, we can remove the old sparse_autoencoder_test.cpp and revert this to SparseAutoencoderTest instead of SparseAutoencoderTest2.

stereomatchingkiss · 2015-09-23T04:17:13Z

1 : can I get a little more information on how you calculated that?
Of course, the file sparse_autoencoder_function already provide the example(I write it as part of the class comments), I hope this is a correct way to measure the performance

2 : This file looks like it's heavily based on the code in mlpack/methods/sparse_autoencoder/sparse_autoencoder_function.hpp
Yes, I put it in ann mainly because this class break the old api and SparseAutoEncoder looks like part of the family member of neural network(I am not an expert of neural network, please correct me if I am wrong), for backward compatibility I do not change the original version.

3 : But maybe it would be a better idea to make this class work with the Trainer class in src/mlpack/methods/ann/
I do not know how to make it work yet, but if this could work, it should be able to provide unified interface. I think I would wait for the comments of zoq's.

a : add functions Train b : add new constructor c : add Serialize 2 : move it from ann to methods/sparse_autoencoder

(because submat of arma is a proxy object) 2 : make codes meet the requirement of style guide

zoq · 2015-09-29T13:23:16Z

src/mlpack/methods/ann/activation_functions/lazy_logistic_function.hpp

+  template<typename InputVecType, typename OutputVecType>
+  static void fn(const InputVecType& x, OutputVecType& y)
+  {
+    y = (1.0 / (1 + arma::exp(-x)));


Originally my intention for the overflow checks in the LogisticFunction class was to avoid strange issues during the training process. But I'm intended to remove the checks in the LogisticFunction class, if you agree we can remove the LazyLogisticFunction class and just use the LogisticFunction class.

I am totally fine with that

2 : add static functions to initialize weights 3 : fix bug--did not initialize parameters correctly

stereomatchingkiss · 2015-10-20T08:20:25Z

After some experiments, not all of the layers of ann are suitable for SparseAutoencoder, it need to do some modification(ex : you cannot just call the dropout function, you need to call the activation function like sigmoid or RELU after calling Forward), I would like to open two new folders

1 : SparseAutoencoder/layers
2 : SparseAutoencoder/activation_functions
to collect the useable layers and activation functions, the implementation will based on the layers and activation functions of ann

Besides, I think the function "GetNewFeatures" of SparseAutoencoder should call the activation function of the hidden layer rather the Sigmoid.

I will remove LazyLogisticFunction and remove the check of LogisticFunction as zoq mentioned

Any suggestions?

Edit :
After these changes, the FineTuneFunction(another pull request) has to accept the SparseAutoencoder functions rather than input data, since the GetNewFeatures function is depend on the policy of the HiddenLayer

stereomatchingkiss · 2016-01-23T15:15:15Z

Sorry for the slow response

Don't mind, thanks for giving review, I know it takes time to do those things.

I guess since the forward and backward functions provide the same output as the old code, the test should run without any problems. If that's the case I'll be happy to merge the changes.

If you think so, then I would write a simple test case to test the output of forward and backward later on. After this has been done, we could deal with another problem

zoq · 2016-01-23T17:32:24Z

No need to write another test for the forward and backward function, if the existing sparse autoencoder test works with the new code it's absolutely fine. I guess sparse_autoencoder_test_2.cpp is the modified test and we can remove the sparse_autoencoder_test.cpp file and rename the other test.

stereomatchingkiss · 2016-01-24T10:44:54Z

I guess sparse_autoencoder_test_2.cpp is the modified test and we can remove the sparse_autoencoder_test.cpp file and rename the other test.

I would like to do that too, but the problem is old sparseAutoencoder and the new one do not have the same api, so the tests need to have some change.

stereomatchingkiss · 2016-01-24T20:39:03Z

I upload the test file, the name is "sparse_autoencoder_test_3.cpp".
If you think it is ok, you could rename it to "sparse_autoencoder_test".

The api of the FFN are different than the original sparseAutoencoder, therefore
we need to do some change on the original codes.

…e because travis build cannot pass

…putLayer from public api of FFN

stereomatchingkiss · 2016-01-25T15:26:37Z

Finally, all jobs have been done. I do some change on FFN(do not take the reference of outputLayer but add a public api to users, this make serialization become easier).

I use FeedForward to get the Evaluate value(Evaluate api do not give the value)
Call FeedForward and FeedBackWard to get the gradient value

Please check the test cases, they test with the same data(except of the gradient part) but with different api, I hope my implementation meet the requirement of the semantic.

If you think everything going well, you can merge it now. Thanks for taking time review

Edit :
After this case is finished, let us finish the part of serialization.Besides, do convolution layer provide padding options?

rcurtin · 2016-01-25T15:32:12Z

When the merge is done, I'll go ahead and add a reverse-compatibility layer for the 2.x.x releases and then remove the old SparseAutoencoder code.

…ncoder Improve speed of SparseAutoencoder and make it more flexible.

zoq · 2016-02-01T01:18:33Z

Thanks for the contribution. I made a couple of changes:

moved the main sparse autoencoder into a separate folder in 4ad39f8
minor formatting and comment fixes in 896937d03c5eb32a4e980
modified the sparse autoencoder test in 443ecdc, so that it uses the SparseAutoencoder class you already implemented. Since we already have a gradient test for each activation function, I removed the gradient sparse autoencoder test.

Let me know if I messed anything up.

Since, we have this nice SparseAutoencoder class, it should be easy provide a reverse-compatibility layer for the 2.x.x releases. I'll go and write the necessary code, if nobody really likes to do it.

We should also think about a test case that tests the code in combination with an optimizer. I run into a couple of problems, once I tested the code with the existing trainer class. I solved the issues in f34ae33. Another test could also test the ability to work with additional layer. We only test the standard sparse autoencoder model structure (input layer, hidden layer, output layer). Wich is fine since the former code uses this static model structure, but since we build the sparse autoencoder using the ann modules, we have the ability to add a bunch of interesting layer, e.g. Dropout layer.

zoq · 2016-02-01T01:20:17Z

Also, I think a command line program would be neat feature.

stereomatchingkiss · 2016-02-01T04:05:54Z

Thanks for the fix.

I would provide a command line programs, with different activations(relu, tanh, sigmoid) and dropout(if it works) after serialization of FFN done.

stereomatchingkiss added 11 commits September 21, 2015 14:05

improve speed of SparseAutoencoder and make it more flexible

ab68cf1

fix example of sparse_autoencoder

dde2b78

reduce the cost of construction and destruction of the layers

d165890

refine the example; the size of the basis should not too big

ea2d0dc

refine the example

f0ec539

refine the example

a094a62

can use Greedy parameters to determine the Gradient should omit some …

92b4e44

…repeated calculation or not

add test cases of modified SparseAutoEncoder

b7cfe68

add test

af6b87e

change the name of test suite to avoid definition confliction

30266a0

change boost test header to avoid build issue

a1d37c5

rcurtin reviewed Sep 23, 2015
View reviewed changes

stereomatchingkiss added 6 commits September 24, 2015 08:28

prevent nan/inf values

f16a615

1 : Enhance SparseAutoencoder

c38a631

a : add functions Train b : add new constructor c : add Serialize 2 : move it from ann to methods/sparse_autoencoder

update header and type define

0b03cdc

1 : fix bug--Forward function of DropOutLayer cannot deduce correct type

88d486a

(because submat of arma is a proxy object) 2 : make codes meet the requirement of style guide

fix build error

7b1411a

replace old test of SparseAutoencoderFunction with new test

0c1c6d1

zoq reviewed Sep 29, 2015
View reviewed changes

stereomatchingkiss added 2 commits September 30, 2015 12:14

1 : remove constructor(because it break consistency)

b3f982c

2 : add static functions to initialize weights 3 : fix bug--did not initialize parameters correctly

fix bug--InitializeWeights is not a member function

b704fb9

stereomatchingkiss added 4 commits October 20, 2015 17:40

Merge https://github.com/mlpack/mlpack into refactor_sparse_autoencoder

4fbb83b

remove boundary check

dd59401

adjust format

af34eb4

use LogisticFucntion to replace LazyLogisticFunction

0179491

stereomatchingkiss added 2 commits January 25, 2016 03:16

fix bug--forgot to move some values

5e353ff

first commit

e98bc77

stereomatchingkiss added 13 commits January 25, 2016 12:05

remove non exist and useless headers

6e2710f

use less generic solution to find out the value type of InputDatatTyp…

d697ad8

…e because travis build cannot pass

use less generic solution to find out the value type of InputDatatTyp…

18aad1c

…e because travis build cannot pass

change function name and fix type error

61d013a

change template name due to confliction

c1d23bc

change typenmae from FFN to FFNet

1c2af05

use rvalue reference to pass the value of outputLayer

94a3de9

do not make outputLayer as reference but allow user to access the out…

12ea51f

…putLayer from public api of FFN

fix bug--lvalue cannot bind to rvalue

9721cb7

remove useless variable

c34e9d3

remove useless header file

4a9ddb2

change test suite name

e481a0a

remove useless preprocessor

1a233b4

stereomatchingkiss mentioned this pull request Jan 29, 2016

ANN No copy ctor for FNN #510

Closed

zoq added a commit that referenced this pull request Jan 30, 2016

Merge pull request #451 from stereomatchingkiss/refactor_sparse_autoe…

33082f0

…ncoder Improve speed of SparseAutoencoder and make it more flexible.

zoq merged commit 33082f0 into mlpack:master Jan 30, 2016

rcurtin added the s: fixed label Feb 24, 2016

rcurtin mentioned this pull request Feb 24, 2016

[Proposal]Enhance the class SparseAutoencoder and SoftmaxRegression #454

Closed

rcurtin added this to the mlpack 2.0.2 milestone Feb 24, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve speed of SparseAutoencoder and make it more flexible #451

improve speed of SparseAutoencoder and make it more flexible #451

stereomatchingkiss commented Sep 21, 2015

rcurtin commented Sep 23, 2015

rcurtin Sep 23, 2015

stereomatchingkiss commented Sep 23, 2015

zoq Sep 29, 2015

stereomatchingkiss Sep 29, 2015

stereomatchingkiss commented Oct 20, 2015

stereomatchingkiss commented Jan 23, 2016

zoq commented Jan 23, 2016

stereomatchingkiss commented Jan 24, 2016

stereomatchingkiss commented Jan 24, 2016

stereomatchingkiss commented Jan 25, 2016

rcurtin commented Jan 25, 2016

zoq commented Feb 1, 2016

zoq commented Feb 1, 2016

stereomatchingkiss commented Feb 1, 2016

improve speed of SparseAutoencoder and make it more flexible #451

improve speed of SparseAutoencoder and make it more flexible #451

Conversation

stereomatchingkiss commented Sep 21, 2015

rcurtin commented Sep 23, 2015

rcurtin Sep 23, 2015

Choose a reason for hiding this comment

stereomatchingkiss commented Sep 23, 2015

zoq Sep 29, 2015

Choose a reason for hiding this comment

stereomatchingkiss Sep 29, 2015

Choose a reason for hiding this comment

stereomatchingkiss commented Oct 20, 2015

stereomatchingkiss commented Jan 23, 2016

zoq commented Jan 23, 2016

stereomatchingkiss commented Jan 24, 2016

stereomatchingkiss commented Jan 24, 2016

stereomatchingkiss commented Jan 25, 2016

rcurtin commented Jan 25, 2016

zoq commented Feb 1, 2016

zoq commented Feb 1, 2016

stereomatchingkiss commented Feb 1, 2016