Learning unmodulated #642

hunse · 2015-02-04T19:25:23Z

This makes it so that we connect directly to LearningRules instead of making modulatory connections, as discussed in #632. It fixes that problem.

The second commit switches the sign of the PES error, so that it is actually an error (i.e. the actual value minus the target value). This means that the PES learning rule moves in the opposite direction of the error, as one would expect (e.g., if the output is too large, make it smaller). This makes a lot more sense to me, and hopefully to everyone else as well. Now seemed like a good time to do it, because the changes from the first commit will require that everyone re-write their learning code anyway.

EDIT: this also addresses #366.

drasmuss · 2015-02-16T19:23:15Z

I keep going back and forth on the error sign switch. It makes for a pretty sharp discontinuity with previous models. Also, the PES rule is essentially just a renaming of the delta rule, and the delta rule is usually written the way the PES rule is now (i.e. target - actual).

But you're right that in that case, referring to the feedback signal as an "error" is kind of odd. What we probably should have done is named it "feedback" instead of "error" or something. I don't know whether it's better to change our nomenclature to match the implementation, or change the implementation to match the nomenclature.

celiasmith · 2015-02-16T20:18:24Z

Every rule is essentially just a renaming of the delta rule ;)

Daniel Rasmussen wrote:

I keep going back and forth on the error sign switch. It makes for a
pretty sharp discontinuity with previous models. Also, the PES rule is
essentially just a renaming of the delta rule, and the delta rule is
usually written the way the PES rule is now (i.e. |target - actual|).

But you're right that in that case, referring to the feedback signal
as an "error" is kind of odd. What we probably should have done is
named it "feedback" instead of "error" or something. I don't know
whether it's better to change our nomenclature to match the
implementation, or change the implementation to match the nomenclature.

—
Reply to this email directly or view it on GitHub
#642 (comment).

drasmuss · 2015-02-16T20:45:17Z

In that it's all just gradient descent, yeah, although some are more similar than others 😉. And in general with gradient descent the convention is to use the error signal as in this PR, which is the main argument for the swap I think. It's probably more likely that people used to things like backprop would be confused by the swapped error in the current implementation, as @hunse suggests.

hunse · 2015-05-19T14:47:36Z

Do people have comments on this, or is it ready to go?

tbekolay · 2015-05-22T16:00:15Z

I'll review this shortly. I'm considering the error signal sign flip uncontested!

tbekolay · 2015-05-23T20:54:46Z

nengo/utils/simulator.py

@@ -83,10 +83,6 @@ def validate_ops(sets, ups, incs):
    for node in sets:
        assert len(sets[node]) == 1, (node, sets[node])

-    # -- assert that only one op updates any particular view
-    for node in ups:
-        assert len(ups[node]) == 1, (node, ups[node])


Hm, is this OK? Why did we have this restriction in the first place?

I couldn't think of a good reason to have it. The original reason may have just been because we always used updates for filters, in which case the signal in the filter being updated should only ever be updated once.

When you get into multiple updates per step, there is a possibility that the order of updates matters. I was going to say that that's not a problem if the updates are just increments, but that's not true. Any learning rule that depends on the magnitude of the connection weights will act differently if it's computed before or after another rule modifying the same weights. Maybe this is something we need to consider? We should at least be able to make this deterministic within a model, so I don't think it's too big of an issue.

Yeah, I think this is something we need to consider... I'm sure the difference (in this case) is pretty small, but it could be large in some cases. I'm not sure how we'd order ops that both update the same signal... So yeah, I'd be keen to keep this assertion in and figure out another way to do these updates. I think the PreserveValue op from before was pretty good.

tbekolay · 2015-05-23T20:58:58Z

This was a bit old, so I gave it a rebase. Looking good!

Only two things I think before merge. First is the all-important changelog entry. I also had a thought about the modulator parameter... rather than completely remove all mentions of it here, how about for 2.1.x, if someone tries to set modulator to True, we raise a ValueError explaining the new API for learning rules, or at least pointing them to this PR (or a doc page on the learning API or something)? Definitely we will have some people upgrade Nengo without reading the changelog, so I think it'll be kinder to tell them about this change when we know it's happening (setting modulatory=True).

hunse · 2015-05-24T21:04:41Z

Adding that error sounds like a good idea.

arvoelke · 2015-05-28T21:27:18Z

Updated the changelog and made it so that you get

Parameter is no longer supported; please refer to pull-request #642: modulatory connections removed

This introduced a new AbandonedParam parameter. My only concern is that this should not encourage people to abandon parameters liberally and destroy backwards compatibility.

arvoelke · 2015-05-28T21:48:59Z

Should #643 be implemented here too, in order to contain these rather significant changes to PES to a single PR? (Note: The work is done, I just need to know where to attach)

neworderofjamie · 2015-05-29T16:06:08Z

Hello Nengoers, I'm trying to restore my SpiNNaker version of PES to it's former glory and the learning rate seems to have been scaled somehow. In the old learn_communications_channel example it was 1 and now it's 1e-6 - How has the calculation of the PES update changed?

THANKYOU!

drasmuss · 2015-05-29T16:26:52Z

It was probably this change #527, where things were changed to scale with dt. I know we had also talked about changing the learning rate to scale with the number of neurons in the pre-synaptic population (#643), but I'm not sure if that actually happened yet?

neworderofjamie · 2015-05-29T16:40:36Z

The implementation I was working on at last years summer school multiplied the learning rate by dt (in seconds) - Is that the scaling in question or something more cunning involved?

drasmuss · 2015-05-29T18:17:39Z

It's a little bit more complicated, in that what's actually happening is that the output activities are being scaled by 1/dt. So the a(x) in the PES equations are all larger than they were before, so the learning rate needs to be scaled down accordingly (@hunse made the change so he can double check that I'm not saying lies here).

arvoelke · 2015-05-29T18:23:02Z

@neworderofjamie That sounds right, although the correct way to do it may depend on whether your spikes are scaled by dt (you can see how this was done in #527 as Dan said). Note that the above commit (dc22f16) also divides the effective learning rate by n_neurons in the pre population, and this PR also flips the sign of the error signal. So the final rule is

where kappa is the learning rate (invariant under dt), n is number of neurons in pre, E is the error signal, and a is the filtered pre-synaptic activity.

This PR just needs a final review now and then it should be good to go.

hunse · 2015-05-29T19:26:24Z

Can we change AbandonedParam to DeprecatedParam?

Also, did we want to add scaling by number of pre neurons in this PR? Might as well change it now, and revise the learning rates again. EDIT: oh sorry, just saw the commit for that.

hunse · 2015-05-29T19:27:58Z

CHANGES.rst

+- ``LearningRuleType`` objects no longer take modulatory connections.
+  Instead, a connection should be made directly from the error population
+  to the learning rule. Also, the error's sign has been flipped for
+  the PES learning rule to be ``actual - target``.


"learning rule" -> "learning rule error"

arvoelke · 2015-05-29T19:56:06Z

Sorry about throwing more onto the pile, but realized that the pre-activity should be filtered. It was just a one line change in the builder, and a new parameter for PES. Note that the parameter order for PES is different from BCM, otherwise all the examples (and possible models that people have already built), need to change their learning rate to a key-word argument.

Addressed @hunse comments. We agree'd that ObsoleteParam is best, since deprecated implies it still works.

tbekolay · 2015-06-13T22:41:02Z

This makes sense to me; we don't use any neural info from the post population in decoder PES, so why not. I wonder if we should revisit the modifies attribute in the learning rule types? Right now it's either 'Neurons' 'Ensemble' or both. Maybe it should be 'transform' and 'decoders', since that's what the learning rule actually modifies.

arvoelke · 2015-06-13T22:42:54Z

+1 to revisiting modifies. Voja modifies encoders, and so my solution in that PR was to have a post_modifies, but makes more sense to say what signals are being modified.

tbekolay · 2015-06-14T16:10:57Z

OK, 'Neurons' -> 'transform' and 'Ensemble' -> 'decoders' in 44768d4. I think neither of the last two commits need changelog entries? Only people implementing learning rules would care...

hunse · 2015-06-15T21:14:01Z

Added a fixup for neuron to ensemble learning connections. This should all be cleaned up in the monolithic-transform branch (it'll need rebasing, though).

hunse · 2015-06-17T16:25:11Z

Added a test for neuron to ensemble PES learning. I think this should be good to go, @tbekolay.

tbekolay · 2015-06-17T16:26:16Z

Great; I'll look this over this afternoon!

tbekolay · 2015-06-17T19:52:23Z

The new tests LGTM! @arvoelke mentioned that he wanted to give this a look over too. I'll probably merge tomorrow afternoon unless someone forcibly stops me.

Connect the error directly into a learning rule (as discussed in #632), instead of making a modulatory connection. This is cleaner and clearer. Learning rules are now built by their parent connections, and since a connection to a learning rule must added to a network after the connection containing the learning rule, the learning rule will always be built when the builder for the error connection into said learning rule is called. This fixes #632 (slicing `post` in learned connections). The test `test_learning_rules.py:test_pes_decoders_multidimensional` has been updated to test this feature.

The PES rule previously moved in the direction of what it called the "error" signal, meaning that the error signal had to be the target value for the learning minus the actual value. This is the opposite of what people typically refer to as error. This commit changes the sign of this error so that it is actually an error, and the PES rule moves in the opposite direction of the error.

This operator is a bit hacky, but after some experimenting, this is essentially the cleanest way to ensure that all learning operations are applied correctly. Increments and reads happen *after* a signal is set. By setting the transform with PreserveValue, we allow increments to be done by learning rules. The only other alternative is to update it, which happens *last*. It's not obvious why one learning rule would be applied before another, so by using PreserveValue, we do the delta computations on the previous timestep, and then increment the transform on the current timestep's 'increments' phase. The transform isn't updated, but that's okay because it's set.

BCM and Oja calculate their delta, then apply it to the transform. PES skipped this step and just incremented the transform (or decoders) directly. By refactoring PES to calculate the delta and then apply it, we get two big benefits: 1. The order in which learning rules are applied no longer matters; the delta calculations are all done before the transform changes, since delta calcs are updates and the transform changes are incs. 2. The code for defining a delta signal and applying it to the transform can be refactored into the `LearningRule` builder, rather than being duplicated in each `LearningRuleType` builder. This should make learning rules easier to implement in the future. As another minor benefit, it's now possible to probe the 'delta' of each learning rule. This makes it easy to see exactly what each each learning rule is doing, which is especially useful when you're using multiple learning rules at the same time. Also made a few other minor enhancements: better docstrings for `SimOja` and `SimBCM`, and a note about ordering of connections in the `Network` builder.

Previously, we used the type of the pre or post object as a clue to what the learning rule modifies. This makes it more explicit by saying what signal the learning rule actually modifies. This will make it easier to implement learning rules that modify other quantities, like for example `post.encoders`.

All PES tests now run through a single helper function, making the code more concise. I also added a test for neuron to ensemble PES learning, as well as a test for PES learning when using a weight solver.

tbekolay · 2015-06-19T14:25:01Z

OK, this branch is now rebased to master, and the history's been squashed and whatnot. I think this is ready to merge; will do so in an hour or so unless anyone stops me!

mundya · 2015-06-19T17:00:34Z

Awesome!

hunse added the needs review label Feb 4, 2015

hunse mentioned this pull request Feb 5, 2015

Should we normalize learning rate by # neurons? #643

Closed

hunse force-pushed the learning-unmodulate branch from 17a8729 to c1ae217 Compare February 5, 2015 21:12

hunse force-pushed the learning-unmodulate branch from c1ae217 to 9169d9f Compare February 18, 2015 21:52

hunse force-pushed the master branch from e4e7977 to e89fb03 Compare February 18, 2015 22:04

hunse force-pushed the learning-unmodulate branch 2 times, most recently from b31d72a to 9822751 Compare February 18, 2015 22:19

tbekolay added this to the 2.1.0 release milestone Mar 3, 2015

tbekolay self-assigned this May 22, 2015

tbekolay force-pushed the learning-unmodulate branch from 9822751 to 327245f Compare May 23, 2015 20:52

tbekolay reviewed May 23, 2015
View reviewed changes

hunse reviewed May 29, 2015
View reviewed changes

arvoelke mentioned this pull request Jun 2, 2015

Vector Oja unsupervised encoder learning rule #727

Merged

tbekolay force-pushed the learning-unmodulate branch from dceab9d to 44768d4 Compare June 14, 2015 16:07

mundya mentioned this pull request Jun 16, 2015

Implement PES project-rig/nengo_spinnaker#29

Closed

2 tasks

hunse mentioned this pull request Jun 17, 2015

Learning rules on recurrent connections from ens.neurons cause graph cycle #744

Closed

hunse and others added 4 commits June 19, 2015 10:09

tbekolay force-pushed the learning-unmodulate branch from 075eb45 to ed8a566 Compare June 19, 2015 14:19

arvoelke and others added 6 commits June 19, 2015 10:19

Disallow setting Connection.modulatory

766d745

Scale PES by n_neurons

3f16974

Filter pre activity in PES

f0b451b

Nodes allowed to be post on learned connection

3252403

Combined PES tests, added test for neuron -> ens

ed8a566

All PES tests now run through a single helper function, making the code more concise. I also added a test for neuron to ensemble PES learning, as well as a test for PES learning when using a weight solver.

tbekolay merged commit ed8a566 into master Jun 19, 2015

tbekolay deleted the learning-unmodulate branch June 19, 2015 16:24

s72sue mentioned this pull request Jul 7, 2015

Assigning BCM learning rule to connections without weight matrix gives misleading error #732

Closed

hunse mentioned this pull request Sep 8, 2015

Connect to a connection instead of modulatory=True #366

Closed

tbekolay added reviewed and removed needs review labels Aug 8, 2016

tbekolay removed their assignment Oct 16, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Learning unmodulated #642

Learning unmodulated #642

hunse commented Feb 4, 2015

drasmuss commented Feb 16, 2015

celiasmith commented Feb 16, 2015

drasmuss commented Feb 16, 2015

hunse commented May 19, 2015

tbekolay commented May 22, 2015

tbekolay May 23, 2015

hunse May 24, 2015

tbekolay Jun 10, 2015

tbekolay commented May 23, 2015

hunse commented May 24, 2015

arvoelke commented May 28, 2015

arvoelke commented May 28, 2015

neworderofjamie commented May 29, 2015

drasmuss commented May 29, 2015

neworderofjamie commented May 29, 2015

drasmuss commented May 29, 2015

arvoelke commented May 29, 2015

hunse commented May 29, 2015

hunse May 29, 2015

arvoelke commented May 29, 2015

tbekolay commented Jun 13, 2015

arvoelke commented Jun 13, 2015

tbekolay commented Jun 14, 2015

hunse commented Jun 15, 2015

hunse commented Jun 17, 2015

tbekolay commented Jun 17, 2015

tbekolay commented Jun 17, 2015

tbekolay commented Jun 19, 2015

mundya commented Jun 19, 2015

Learning unmodulated #642

Learning unmodulated #642

Conversation

hunse commented Feb 4, 2015

drasmuss commented Feb 16, 2015

celiasmith commented Feb 16, 2015

drasmuss commented Feb 16, 2015

hunse commented May 19, 2015

tbekolay commented May 22, 2015

tbekolay May 23, 2015

Choose a reason for hiding this comment

hunse May 24, 2015

Choose a reason for hiding this comment

tbekolay Jun 10, 2015

Choose a reason for hiding this comment

tbekolay commented May 23, 2015

hunse commented May 24, 2015

arvoelke commented May 28, 2015

arvoelke commented May 28, 2015

neworderofjamie commented May 29, 2015

drasmuss commented May 29, 2015

neworderofjamie commented May 29, 2015

drasmuss commented May 29, 2015

arvoelke commented May 29, 2015

hunse commented May 29, 2015

hunse May 29, 2015

Choose a reason for hiding this comment

arvoelke commented May 29, 2015

tbekolay commented Jun 13, 2015

arvoelke commented Jun 13, 2015

tbekolay commented Jun 14, 2015

hunse commented Jun 15, 2015

hunse commented Jun 17, 2015

tbekolay commented Jun 17, 2015

tbekolay commented Jun 17, 2015

tbekolay commented Jun 19, 2015

mundya commented Jun 19, 2015