Support for autoencoders in greedy layer-wise pre-training #50

leconteur · 2015-05-15T18:31:16Z

This pull request presents a prototype to include Autoencoders support in scikit-neuralnetwork. It does not include tests but does include a working example.

coveralls · 2015-05-15T19:17:48Z

Coverage decreased (-11.96%) to 88.04% when pulling 345a28d on leconteur:master into 61dca5c on aigamedev:master.

alexjc · 2015-05-15T22:14:38Z

OK, this is looking promising! I got it to work with minor changes after merging the latest code.

I'd suggest we iterate on the external API first. How does this look to you:

    # Setup as normal, with some additional parameters.
    nn = mlp.Classifier(layers=[
        L("Rectifier", units=32, pretrain_type='denoising', pretrain_corruption=0.5)]
        n_iter=100,
    )

    # Creates unsupervised trainer automatically, copies over weights when done.
    nn.pretrain(X_digits, layers=1)

    # Works as normal, no changes.
    net.fit(X_digits, y_digits)

Things like tied_weights should be on by default, no? Also, act_enc and act_dec should always be the same, and as used in the original layer, no?

Thoughts welcome!

ssamot · 2015-05-16T15:46:56Z

My two cents:

Autoencoders can be used to compress/denoise stuff, a nice "transform" operation in sklearn terms. The transformed output (top layer inputs * weights) can then be thrown to any classifier/regressor to learn on top of it - which should be pretty cool. So I think it has to be a seperate module/class whatever, and if you want to mix with the mlps you can do as in the example - if not, you can use them let's say in pipeline with any classifier/regressor on top, which should be nice. I am not sure the MLP class should have any knowledge of this.

alexjc · 2015-05-17T07:00:43Z

The problem I noticed is that autoencoders only seem to support sigmoid and tanh activations, so other activations may have significantly fewer benefits of pretraining — and worse results?

alexjc · 2015-05-17T07:30:01Z

Based on @ssamot's comment, I started a branch autoencoder to add support for AE first, then we can figure out the pre-training.

@leconteur

…d/refactored version of @leconteur's prototype: #50

leconteur · 2015-05-18T14:11:45Z

I agree with @ssamot that the autoencoder should implements the transform interface of sklearn. However, I think it is a good idea that the mlp class could take a PretrainedLayer in the constructor.

The transform could probably be implemented by using the "encode" method of the pylearn2 autoencoder class that form the last layer of the network.

alexjc · 2015-05-18T20:48:00Z

I'm considering it now... One problem I see currently with PretrainedLayers is that we'd have to support serialization separately than we do now, since they are not "regular" layers that had the weights copied from elsewhere.

leconteur · 2015-05-19T13:29:44Z

I don't think we need serialization at first. However, all that is needed for a pretrained layer is to pass it the pylearn2 layer in the constructor.

The reason I think is important is to facilitate the fine-tuning of a layer that could have been trained in a variety of ways. They are also already implemented in pylearn2.

If you do want to implement them, I think it should be done in a way similar to this: The autoencoder should have a method that returns a list of its autoencoder layers wrapped in a sknn layer with a type 'pretrained'. The mlp should then have a condition in its create_layer method that calls the right pylearn2 constructor.

I do understand however that the use case where I need this features is not very similar to the scikit-learn api and use case. The main problem I have is that this kind of algorithm is hard to fit in a scikit-learn pipeline paradigm.

coveralls · 2015-05-19T18:26:53Z

Coverage decreased (-13.06%) to 86.94% when pulling 8eb245c on leconteur:master into d221c57 on aigamedev:master.

alexjc · 2015-05-19T18:47:51Z

About the features in the auto-encoder, do your final neural networks also use sigmoid and tanh? ReLU doesn't seem to be supported out of the box, but could be added... otherwise it seems like you'd be better off training a full MLP in an unsupervised style.

leconteur · 2015-05-19T18:58:44Z

I don't think I'll need ReLU activation. My use case is very similar to the example I pushed, except that the pretraining is done on another dataset.

Sorry about the other changes, I seem to have misunderstood some details about pull-request.

alexjc · 2015-05-19T19:16:01Z

No problem about the Pull Request. All changes in that branch are automatically posted.

We won't merge this PR since it contains your IDE files too :-)

alexjc · 2015-05-22T21:20:14Z

Closing this since there are lots of secondary files that we don't want merging. Continuing discussion in #35, also see recent commit [8a9701a].

leconteur added 4 commits May 15, 2015 11:43

Added support for a basic autoencoder pretraining

f5a0f8f

Separated the example from the unsupervised file.

3c07e33

Created an example folder and moved the example there

d8428a3

Merge

345a28d

alexjc added a commit that referenced this pull request May 17, 2015

First implementation of the auto-encoder, based on a highly simplifie…

de28eb1

…d/refactored version of @leconteur's prototype: #50

This was referenced May 17, 2015

Autoencoder functionality in new sknn.ae module #52

Merged

Support for greedy layer-wise pretraining #35

Closed

Tried to update the repo.

7f8848d

leconteur added 2 commits May 19, 2015 09:43

merge

2711382

Add interface to 'init_bias_target_marginals' in Softmax layer creation.

8eb245c

alexjc closed this May 22, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for autoencoders in greedy layer-wise pre-training #50

Support for autoencoders in greedy layer-wise pre-training #50

leconteur commented May 15, 2015

coveralls commented May 15, 2015

alexjc commented May 15, 2015

ssamot commented May 16, 2015

alexjc commented May 17, 2015

alexjc commented May 17, 2015

leconteur commented May 18, 2015

alexjc commented May 18, 2015

leconteur commented May 19, 2015

coveralls commented May 19, 2015

alexjc commented May 19, 2015

leconteur commented May 19, 2015

alexjc commented May 19, 2015

alexjc commented May 22, 2015

Support for autoencoders in greedy layer-wise pre-training #50

Support for autoencoders in greedy layer-wise pre-training #50

Conversation

leconteur commented May 15, 2015

coveralls commented May 15, 2015

alexjc commented May 15, 2015

ssamot commented May 16, 2015

alexjc commented May 17, 2015

alexjc commented May 17, 2015

leconteur commented May 18, 2015

alexjc commented May 18, 2015

leconteur commented May 19, 2015

coveralls commented May 19, 2015

alexjc commented May 19, 2015

leconteur commented May 19, 2015

alexjc commented May 19, 2015

alexjc commented May 22, 2015