WIP: unify Learner1D, Learner2D and LearnerND #220

jbweston · 2019-10-06T11:49:19Z

While writing the adaptive paper we managed to write down a simple algorithm formulated in terms of abstract domains and subdomains.

Implementing such an algorithm in a learner should allow us to unify Learner1D, Learner2D and LearnerND.

jbweston · 2019-10-06T11:50:56Z

The following are not yet supported:

2D and 3D domains
BalancingLearner (requires removing points from the interior of domains)

akhmerov · 2019-10-06T11:54:46Z

Also compared to the existing implementation, global rescaling is missing. (e.g. computing y-scale of the values, and normalizing the data by it)

jbweston · 2019-10-06T12:14:45Z

Also compared to the existing implementation, global rescaling is missing. (e.g. computing y-scale of the values, and normalizing the data by it)

Should this perhaps be something that the loss function does itself? For example the isosurface loss needs the unmodified data. I could imagine extending the loss function signature to include the x and y scales.

akhmerov · 2019-10-06T12:21:44Z

I could imagine extending the loss function signature to include the x and y scales.

Indeed, but then the learner needs two extra hooks: one at each step to update global metrics, and another one to trigger loss recomputation for all subdomains once the global metrics changes sufficiently much so that old losses become obsolete.

jbweston · 2019-10-06T14:32:35Z

Indeed, but then the learner needs two extra hooks: one at each step to update global metrics, and another one to trigger loss recomputation for all subdomains once the global metrics changes sufficiently much so that old losses become obsolete.

I added this now

adaptive/learner/new_learnerND.py

implementation

jbweston · 2019-10-08T15:50:51Z

TODO

don't evaluate boundary points in __init__
revisit the loss function signature
add tests

basnijholt · 2019-10-08T17:09:28Z

All other learners implement pending_points which is a set.

Would that change anything? Now I see you set self.data[x] = None.

jbweston · 2019-10-13T15:40:04Z

Now I believe the only 3 things left to do are:

decide on the loss function signature (at the moment the loss function gets all data etc.
implement scaling within the loss functions
whether to overwrite the existing LearnerND with this one (Bas says yes)

jbweston · 2019-10-13T15:42:33Z

There's also the question of how to review this MR. It's a lot of code.

It may also be that this implementation is inferior to the current LearnerND, and we don't want to merge it at all. For example, the old LearnerND is 1180 lines, whereas this new implementation in 1350 (including the priority queue, domain definitions etc)

This case is very common when inserting 1 point and then splitting at the same point. This change gives a ~25% speedup to the new LearnerND.

adaptive/learner/triangulation.py

akhmerov · 2019-10-14T12:52:13Z

adaptive/learner/triangulation.py

+            vectors = np.subtract(coords[1:], coords[0])
+            if np.linalg.matrix_rank(vectors) < dim:
+                raise ValueError(
+                    "Initial simplex has zero volumes "


It's not necessarily a single simplex at this point; this should read "Hull has a zero volume", right?

Yes. This error message was already wrong before I touched it. Maybe I should update it

akhmerov · 2019-10-14T17:57:14Z

adaptive/domain.py

+__all__ = ["Domain", "Interval", "ConvexHull"]
+
+
+class Domain(metaclass=abc.ABCMeta):


I think it would be useful to add a class docstring with a description of the important properties of this object.

akhmerov · 2019-10-14T17:59:30Z

adaptive/domain.py

+        # in 'sub_intervals' in a SortedList.
+        self.bounds = (a, b)
+        self.sub_intervals = dict()
+        self.points = SortedList([a, b])


It would in principle be sufficient to store only the boundaries of the interval and always query for pairs of neighboring points. The current implementation is quite OK OTOH.

Before this fix when telling more than 1 point it was possible that we would attempt to remove subdomains from the queue that were only produced "temporarily" when adding points.

These are not catching an interesting class of bugs for now. We should reintroduce these when we come up with a firmer learner API.

This does not test very degenerate domains, but for the moment we just want to test that everything works for domains defined over hulls with different numbers of points. Adaptive will typically not be used on very degenerate domains.

We now use numpy.random for generating random points, which produces much less degenerate examples than using Hypothesis' "floats" strategy. At this point we don't want to test super awkward cases.

akhmerov · 2019-10-21T08:37:16Z

Re: loss function format

In ND we can pass the following data:

The original simplex coordinates
An array of the following tuples:

Coordinates of the point being replaced
Index of the simplex in which a point is replaced
Number of the point being replaced

In 1D we probably should adopt the naturally ordered format.

WIP: add new learner concept and example notebook

1e633cd

jbweston force-pushed the feature/unify-learners branch from cd579ae to 1e633cd Compare October 6, 2019 11:49

WIP: correct implementation of unfinished point removal

a5a8f5f

jbweston added 4 commits October 6, 2019 16:32

add necessary information to scale data for loss functions that want to

d7cb56c

start internal queue attributes with underscore

a288b64

recalculate losses when scale increases by a certain factor

f2d6125

blackify

5d3c2b1

basnijholt reviewed Oct 6, 2019

View reviewed changes

adaptive/learner/new_learnerND.py Outdated Show resolved Hide resolved

jbweston added 12 commits October 6, 2019 20:58

make 1D points use floats rather than length-1 tuples

9db175a

correct point insertion logic

cdb669d

add parallel example to proof of concept notebook

042eba0

remove superfluous method and docstrings from Interval class

84cc671

import SortedList and SortedDict and update 'insert_points'

9c565ce

implementation

fixup docstrings etc.

963fb07

add first implementation of ConvexHull domain

eb712b9

make everything work

18a88bc

make everything work again

e3fd6cc

update Interval class

632faf9

add 'remove' method to Domain

bd574ab

implement 'ask(tell_pending=False)'

3e27066

flakify

772a5f4

jbweston added 2 commits October 9, 2019 11:11

small refactors and comments

053ba24

rename bound_points to boundary_points

5e36e24

jbweston added 4 commits October 13, 2019 11:08

do 1 operation per line only

d227e1b

correct docstring

9f8774a

correct test that all internal points are reassigned when splitting

6bdbe88

run all domain tests over random domains, rather than hypercubes

32518d1

jbweston requested review from akhmerov and basnijholt October 13, 2019 15:38

jbweston added 4 commits October 13, 2019 17:48

update priority queue docstring

5d3434d

neater ordering

6efc250

add clarifying comment about adding points to neighboring subdomains

06003cb

add shortcuts when creating triangulations with exactly 1 simplex

b4a4814

This case is very common when inserting 1 point and then splitting at the same point. This change gives a ~25% speedup to the new LearnerND.

akhmerov reviewed Oct 14, 2019

View reviewed changes

adaptive/learner/triangulation.py Show resolved Hide resolved

akhmerov reviewed Oct 14, 2019

View reviewed changes

jbweston added 11 commits October 16, 2019 15:43

fix bug where we receive more points than we asked for

52175f0

add learner tests that simulate the runner and randomly asking/telling

cdf26a7

create queue on initialization rather than calling 'insert'

011c5f3

suppress context when raising when an item was not in a queue

51901cc

fix bug in subdomain updating logic when telling data

d401cbb

Before this fix when telling more than 1 point it was possible that we would attempt to remove subdomains from the queue that were only produced "temporarily" when adding points.

improve comment

445c6f1

blackify domain.py

d41b355

check that points matrix is well conditioned

7430143

remove stateful invariant checks

6c0cafa

These are not catching an interesting class of bugs for now. We should reintroduce these when we come up with a firmer learner API.

simplify utilities for generating domains and points inside/outside them

587d027

We now use numpy.random for generating random points, which produces much less degenerate examples than using Hypothesis' "floats" strategy. At this point we don't want to test super awkward cases.

jbweston mentioned this pull request Oct 17, 2019

run test_balancing_learner for all strategies #218

Open

akhmerov mentioned this pull request Dec 21, 2019

deprecate Learner2D #56

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: unify Learner1D, Learner2D and LearnerND #220

WIP: unify Learner1D, Learner2D and LearnerND #220

jbweston commented Oct 6, 2019

jbweston commented Oct 6, 2019

akhmerov commented Oct 6, 2019

jbweston commented Oct 6, 2019 •

edited

Loading

akhmerov commented Oct 6, 2019

jbweston commented Oct 6, 2019

jbweston commented Oct 8, 2019 •

edited

Loading

basnijholt commented Oct 8, 2019

jbweston commented Oct 13, 2019 •

edited

Loading

jbweston commented Oct 13, 2019 •

edited

Loading

akhmerov Oct 14, 2019

jbweston Oct 14, 2019

akhmerov Oct 14, 2019

akhmerov Oct 14, 2019

akhmerov commented Oct 21, 2019

		__all__ = ["Domain", "Interval", "ConvexHull"]


		class Domain(metaclass=abc.ABCMeta):

WIP: unify Learner1D, Learner2D and LearnerND #220

Are you sure you want to change the base?

WIP: unify Learner1D, Learner2D and LearnerND #220

Conversation

jbweston commented Oct 6, 2019

jbweston commented Oct 6, 2019

akhmerov commented Oct 6, 2019

jbweston commented Oct 6, 2019 • edited Loading

akhmerov commented Oct 6, 2019

jbweston commented Oct 6, 2019

jbweston commented Oct 8, 2019 • edited Loading

TODO

basnijholt commented Oct 8, 2019

jbweston commented Oct 13, 2019 • edited Loading

jbweston commented Oct 13, 2019 • edited Loading

akhmerov Oct 14, 2019

Choose a reason for hiding this comment

jbweston Oct 14, 2019

Choose a reason for hiding this comment

akhmerov Oct 14, 2019

Choose a reason for hiding this comment

akhmerov Oct 14, 2019

Choose a reason for hiding this comment

akhmerov commented Oct 21, 2019

jbweston commented Oct 6, 2019 •

edited

Loading

jbweston commented Oct 8, 2019 •

edited

Loading

jbweston commented Oct 13, 2019 •

edited

Loading

jbweston commented Oct 13, 2019 •

edited

Loading