PES learning rule on decoders #202

tbekolay · 2013-11-13T16:33:36Z

This is a first proof of concept of decoder learning in new Nengo. It just uses the PES rule to learn better decoders, but the way it's implemented is how all the other learning rules will have to be implemented, so I wanted to get feedback on the syntax before I make this work on weights and implement hPES, etc.

First off, here it is learning a communication channel super fast. It starts off with decoders that solve the function f(x) = -1, but does the communication channel after that.

Obviously a trivial example, but it's the same math as before so whatev's.

Check out learn_communicationchannel.py and see what you think of the syntax. It's just a new kwarg to the connect call (and therefore to the Connection classes).

Things I would appreciate feedback on:

Does it make sense to be a kwarg to connect? I'm thinking that the way we pass neuron types to ensembles should be the same way we pass learning rules to connect, but perhaps the semantics of that don't fit as nicely as I think they do.
Right now the PES class gets connection information by sneakily storing the connection it's part of. This was just easier, but I can definitely see why this could be confusing. Neurons, for example, don't keep a handle to the ensemble they're part of, and instead have functions that the ensemble calls. Perhaps learning rules should follow this? Or perhaps neuron types should get a handle to its ensemble? This made creating build_pes very easy, because build_connect doesn't know anything about learning rules. But we have to enforce that learning rules build after all connections. Perhaps the same could happen with neural nonlinearities, which would simplify the build_ensemble method?
This may only be true of the decoder PES rule, but I think that I could do it using a DotInc or something rather than making SimPES. Is that true? It might involve reshaping some signals to do it though, is it worth it?

with OpenCL objects - Also renamed James's opencl stuff

Running nosetests in the project should now * complete successfully * pass >= three tests * skip the rest of them.

Moved Sim* classes to simulator.py, but kept math in nonlinear.py

Mainly this is work in the Ensemble-creation logic.

New make_input using Direct mode makes decoded signal show up a timestep later than before. This is tested directly in test_simulator, so I removed the assert from test_old_api.

Recent refactoring means that the old seed is interpreted differently, resulting in a slightly less-accurate fast probe. The other probes are as accurate as before.

Initial commit of neuron connection class.

Making tests into unit tests makes it easier to import them in nengo_ocl, swap the `Simulator` class attribute, and re-run all the tests with a different simulator. I'd like to do more tests this way, rather than having them be just `test_foo` functions at the module level of test files.

It doesn't yet assert the correctness of the output signal, but it builds the right graph and provides an option to show the converged signals.

This turned out to be relatively easy, since the simulator was effectively allocating signal-like buffers for the input, output, and bias terms of nonlinearities anyway. This change also had the the un-intended but nice consequence of removing the need for separate neuron connections. These are now just encoders whose signal is the `output_signal` of some non-linearity.

More to come, this should probably be included in the simulator_objects constructors, to help with debugging in general.

Filling in support for rate mode, trying to track down bug in handling of lif bias.

There is a bug in the current handling of the neuron bias, trying to find it.

Double-storage of bias_signal.value and bias caused incorrect simulation.

It is now an error to make a filter or transform whose output signal is a constant. model.filter() and model.transform() check this condition, it might be more correct to move this check to the constructors of the respective objects.

Encoder, Decoder, Transform and Filter coefficients have shapes, can potentially change over time (plasticity / adaptation / learning), and are indexed the same way as other signals in the OpenCL codebase. This change standardizes the handling of numbers within the simulator by making all these constants into signals.

tcstewar · 2013-11-13T16:45:12Z

Given the just-recently-completed discussion on changing the syntax of the API, what are you thinking of for this syntax? Will it be:

with model:
   # Create a modulated connection between the 'pre' and 'post' ensembles
   nengo.Connection(pre, post, function=lambda x: -1 * np.ones(x.shape),
              learning_rule=nengo.PES(error))

or will it be:

with model:
   # Create a modulated connection between the 'pre' and 'post' ensembles
   nengo.LearningConnection(pre, post, function=lambda x: -1 * np.ones(x.shape),
              learning_rule=nengo.PES(error))

or even:

with model:
   # Create a modulated connection between the 'pre' and 'post' ensembles
   nengo.PESConnection(pre, post, error, function=lambda x: -1 * np.ones(x.shape))

I think I'd lean towards the middle option.

I also don't quite understand the function argument. Is that going to be something that people have to specify all the time? Or is that just you initializing it to something weird for this one particular case?

How do people specify an initial weight matrix, or an initial random distribution of weights?

(Basically, I'm trying to think of the common use cases people want when they add in a learning rule)

hunse · 2013-11-13T16:45:20Z

examples/learn_communicationchannel.py

+# Create ensembles
+model.make_ensemble('Pre', nengo.LIF(N * D), dimensions=D)
+model.make_ensemble('Post', nengo.LIF(N * D), dimensions=D)
+error = model.make_ensemble('Error', nengo.LIF(N * D), dimensions=D)



Mixing objects and strings... my eyes, my eyes!!!

tbekolay · 2013-11-13T16:59:56Z

I was just initializing it to something weird, because otherwise it already does a communication channel. This is just the decoder level learning; in the weight learning case, it will be however we specify weight matrices for non-learning neuron-to-neuron connections (hence @hunse's question about making sure those work).

Right now, I'm proposing the top option. I did consider the two possibilities you presented, and in fact, originally used something like the middle one (but not quite). The reason I went with the one I went with is to A) make the use of neruons and learning rules essentially the same, where neurons apply to ensembles and learning rules apply to connections, and B) to reduce duplication of connection code, which should be unchanged except for the additional learning op that modifies the decoder / weight signals.

A few more possibilities:

with model:
   conn = nengo.Connection(pre, post, function=lambda x: -1 * np.ones(x.shape))
   pes = nengo.PES(conn, error)

with model:
   conn = nengo.Connection(pre, post, function=lambda x: -1 * np.ones(x.shape))
   pes = nengo.LearningRule(conn, nengo.PES(error))

The advantages of these two is that it is much cleaner when applying multiple learning rules to the same connection. This is one place where the analogy neuron type is to ensembles as learning rule is to connections breaks down, because you can only have one neuron type, but possibly multiple learning rules. We could do this by accepting a list for learning_rule, but that's a bit messy.

jaberg · 2013-11-13T18:29:48Z

Aren't there a significant variety of proposed and
possibly-worth-implementing learning mechanisms? What's the purpose or
sense of a "learning rule" in general?

I think the suggestion with nengo.PES(conn, error) is best because it's
short, non-committal regarding what nengo provides, and get's just as much
info from the user as anything else.

On Wed, Nov 13, 2013 at 11:59 AM, Trevor Bekolay
notifications@github.comwrote:

I was just initializing it to something weird, because otherwise it
already does a communication channel. This is just the decoder level
learning; in the weight learning case, it will be however we specify weight
matrices for non-learning neuron-to-neuron connections (hence @hunsehttps://github.com/hunse's
question about making sure those work).

Right now, I'm proposing the top option. I did consider the two
possibilities you presented, and in fact, originally used something like
the middle one (but not quite). The reason I went with the one I went with
is to A) make the use of neruons and learning rules essentially the same,
where neurons apply to ensembles and learning rules apply to connections,
and B) to reduce duplication of connection code, which should be unchanged
except for the additional learning op that modify the decoder / weight
signals.

A few more possibilities:

with model:
conn = nengo.Connection(pre, post, function=lambda x: -1 * np.ones(x.shape))
pes = nengo.PES(conn, error))

with model:
conn = nengo.Connection(pre, post, function=lambda x: -1 * np.ones(x.shape))
pes = nengo.LearningRule(conn, nengo.PES(error))

The advantages of these two is that it is much cleaner when applying
multiple learning rules to the same connection. This is one place where the
analogy neuron type is to ensembles as learning rule is to connections
breaks down, because you can only have one neuron type, but possibly
multiple learning rules. We could do this by accepting a list for
learning_rule, but that's a bit messy.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/202#issuecomment-28412286
.

tcstewar · 2013-11-13T18:43:34Z

I like what @jaberg is suggesting here. It may be very premature to talk about a generic LearningRule class, when we've only got one example of it and we know that different learning rules require very different backend implementations

I also really like the idea of initially making a connection exactly how you would normally, then adding a learning rule to it. That could work very well in teaching situations.

with model:
   conn = nengo.Connection(pre, post, function=lambda x: -1 * np.ones(x.shape))
   pes = nengo.PES(conn, error)

This is also the first time I've seen a use for keeping the Connection object around for later.

tbekolay · 2013-11-13T18:44:15Z

The nengo.LearningRule is to make it clear what PES is. Looking at this script alone doesn't tell you what nengo.PES does, but with nengo.LearningRule you should get it without having to turn to documentation. But I don't really have any strong opinions as to any of these options, at this point any of them are easy to implement.

tcstewar · 2013-11-13T18:44:29Z

Ooh, let me update my suggestion a bit:

with model:
   conn = nengo.Connection(pre, post, function=lambda x: -1 * np.ones(x.shape))
   nengo.PESLearning(conn, error)

hunse · 2013-11-13T18:44:55Z

@tcstewar : just what I was thinking 👍

tcstewar · 2013-11-13T18:45:17Z

(calling it PESLearning I think helps make it semi self-documenting)

tbekolay · 2013-11-13T18:46:43Z

Sorry, I didn't mean to introduce a generic LearningRule class. Consider this the first option instead.

with model:
   conn = nengo.Connection(pre, post, function=lambda x: -1 * np.ones(x.shape))
   model.learn(conn, nengo.PES(error))

or

conn = model.connect(pre, post, function=lambda x: -1 * np.ones(x.shape))
model.learn(conn, nengo.PES(error))

hunse · 2013-11-13T19:30:17Z

@tbekolay : we had dreamed at one point about being able to write a learning rule as a Python function in a script, so that the user can easily play around with different learning rules. Do you think this will be possible with the way you have things set up? Could it be done through a CustomLearningRule class? Or is there some better way to do it? Obviously this wouldn't be the type of thing that would be supported by all backends, but could we at least get it to work in the reference simulator, and maybe OCL, to make it easy to prototype learning rules? Should I add another issue for this?

tbekolay · 2013-11-13T20:03:03Z

Nah, this is a WIP so we can talk about learning rules in general here. The problem with that is the need to have a common interface for all learning rules, which is like, impossible. In Java, I split it into learning rules without error signals (unsupervised) and those with error signals (supervised) which seemed like a general split, and you could essentially ask people to provide the step function, given what the step function uses in PES. So, it's possible, but I think there are 4 cases: unsupervised decoder (encoder?) learning, unsupervised weight learning, supervised decoder learning, supervised weight learning. Is it useful to expose all those cases, and have PES / HPES be instances of those learning rule classes? I don't know if I'm convinced that that's useful...

jaberg · 2013-12-12T16:48:37Z

Hey guys, I'd like to revive this issue. I wrote a learning rule for learning a classifier. I wrote it to do some comparisons with less-neural machine learning algorithms, but I wonder if it might be interesting as a learning rule for the BG?

Anyway, I think I'd like to add it to this PR so that if others agree it would be useful in Nengo, then we can coordinate the two of them so they match.

First item of business though: rebasing this on master. Has anyone (@tbekolay) done this? I'll start with that.

tbekolay · 2013-12-12T16:57:08Z

I haven't rebased to master yet, so go nuts! Should it be part of the same PR though? Perhaps we could make it another PR but base that one on the PES branch? Or is that too hard to coordinate?

jaberg · 2013-12-12T16:59:18Z

@tbekolay Sounds good. I'll make the new rule a PR against the PES branch.

studywolf · 2014-03-25T15:03:00Z

What's the status with learning now? I saw the PR in #232, any movement on this since then?

tbekolay · 2014-03-25T15:12:16Z

I haven't worked on it, but @e2crawfo has, and has it working with Nengo OCL. #232 introduced some things in addition to the learning that makes it incompatible with the current codebase. I suspect that learning rules are going to need something similar to what's being discussed in #285, so perhaps I'll give this a rewrite in that style.

youssefzaky · 2014-03-25T15:19:37Z

I've been using the PES rule from the branch cleanup_learning, seems to work fine.

studywolf · 2014-04-21T20:24:10Z

Gahh, that branch isn't up to date with the current syntax though.

arvoelke · 2014-04-21T20:27:49Z

See #303.

tbekolay · 2014-05-24T14:24:07Z

Now superceded by other learning PRs, so closing.

tbekolay and others added 30 commits November 7, 2013 18:23

Some early steps implementing the new API

db9b4e9

with OpenCL objects - Also renamed James's opencl stuff

Merged James's and Trevor's work. Tests run!

8b3a4e2

Removed out-of-date comment from old_api

45c870c

Cleanup test dir

da5c7ec

Running nosetests in the project should now * complete successfully * pass >= three tests * skip the rest of them.

Reorganization of nonlinear.py

a2c68e8

Moved Sim* classes to simulator.py, but kept math in nonlinear.py

Upgrade old_api.py to use refactored sim objects

11a8827

Mainly this is work in the Ensemble-creation logic.

Implement old_api make_input using Direct mode

123e2a8

Simulator workaround annoying np.dot semantics

bfec1b7

Revert crippling of test_old_api

f738825

Relax test_old_api with new make_input

d8a3a6b

New make_input using Direct mode makes decoded signal show up a timestep later than before. This is tested directly in test_simulator, so I removed the assert from test_old_api.

Relaxed tolerance on test_api fast probe

b8fc991

Recent refactoring means that the old seed is interpreted differently, resulting in a slightly less-accurate fast probe. The other probes are as accurate as before.

Added neuron to neuron connections

b511677

Initial commit of neuron connection class.

Added neuron to neuron connections to step

302869a

added old_api test of input probe

7bec410

Adding matmul test via old_api

b6cc5a2

It doesn't yet assert the correctness of the output signal, but it builds the right graph and provides an option to show the converged signals.

Adding preliminary support for signal names.

f410623

More to come, this should probably be included in the simulator_objects constructors, to help with debugging in general.

Adding test_nonlinear

ccb7cd1

Filling in support for rate mode, trying to track down bug in handling of lif bias.

New old_api test (same old test with more neurons)

68cd185

There is a bug in the current handling of the neuron bias, trying to find it.

Added input, bias, output signals to nonlinearity

21b980a

Double-storage of bias_signal.value and bias caused incorrect simulation.

Simulator supports Constant signals without filter

0a57840

It is now an error to make a filter or transform whose output signal is a constant. model.filter() and model.transform() check this condition, it might be more correct to move this check to the constructors of the respective objects.

tighten up old_api test tolerance

1862fef

returning test_old_api show flag -> False

658c8e4

Fix tests now that filter dests cannot be Constant

be654bc

Removed non-api prints from old_api test

34610af

Added ndim property to SignalView

f2555db

Fix: added shape and elemstrides properties for Constant

17c1c4a

Simulator overide for CircularConv TestCase

b10031c

hunse reviewed Nov 13, 2013
View reviewed changes

hunse mentioned this pull request Nov 13, 2013

Discussion: how many connection classes? #204

Closed

jaberg mentioned this pull request Nov 15, 2013

Allow neurons to connect to things #203

Closed

jaberg mentioned this pull request Dec 12, 2013

PES rebase on ctn/master #232

Closed

e2crawfo mentioned this pull request Jan 12, 2014

SignalViews with sim_ocl nengo/nengo-ocl#55

Closed

tbekolay added the work in progress label Mar 15, 2014

tbekolay added this to the 2.0.0 release milestone Apr 23, 2014

tbekolay closed this May 24, 2014

tbekolay deleted the pes branch May 24, 2014 14:24

tbekolay removed the work in progress label Aug 8, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PES learning rule on decoders #202

PES learning rule on decoders #202

tbekolay commented Nov 13, 2013

tcstewar commented Nov 13, 2013

hunse Nov 13, 2013

tbekolay commented Nov 13, 2013

jaberg commented Nov 13, 2013

tcstewar commented Nov 13, 2013

tbekolay commented Nov 13, 2013

tcstewar commented Nov 13, 2013

hunse commented Nov 13, 2013

tcstewar commented Nov 13, 2013

tbekolay commented Nov 13, 2013

hunse commented Nov 13, 2013

tbekolay commented Nov 13, 2013

jaberg commented Dec 12, 2013

tbekolay commented Dec 12, 2013

jaberg commented Dec 12, 2013

studywolf commented Mar 25, 2014

tbekolay commented Mar 25, 2014

youssefzaky commented Mar 25, 2014

studywolf commented Apr 21, 2014

arvoelke commented Apr 21, 2014

tbekolay commented May 24, 2014

PES learning rule on decoders #202

PES learning rule on decoders #202

Conversation

tbekolay commented Nov 13, 2013

tcstewar commented Nov 13, 2013

hunse Nov 13, 2013

Choose a reason for hiding this comment

tbekolay commented Nov 13, 2013

jaberg commented Nov 13, 2013

tcstewar commented Nov 13, 2013

tbekolay commented Nov 13, 2013

tcstewar commented Nov 13, 2013

hunse commented Nov 13, 2013

tcstewar commented Nov 13, 2013

tbekolay commented Nov 13, 2013

hunse commented Nov 13, 2013

tbekolay commented Nov 13, 2013

jaberg commented Dec 12, 2013

tbekolay commented Dec 12, 2013

jaberg commented Dec 12, 2013

studywolf commented Mar 25, 2014

tbekolay commented Mar 25, 2014

youssefzaky commented Mar 25, 2014

studywolf commented Apr 21, 2014

arvoelke commented Apr 21, 2014

tbekolay commented May 24, 2014