DM-13065: SipApproximation #303

TallJimbo · 2018-01-02T23:13:44Z

Best to start with the docs for the afw/geom/SipApproximation class and the afw/math/polynomials namespace (in afw/math/polynomials.h) to get the big picture.

r-owen

I'm going to postpone the rest of the review until I can see DM-13156 and DM-13157 in JIRA (which is down at the moment).

r-owen · 2018-01-08T16:57:34Z

include/lsst/afw/math/polynomials/BinomialMatrix.h

+
+    // Because binomial coefficients only depend on two ints, and we compute
+    // them via a recurrence relation, we actually store all of the ones we
+    // have every calculated in a static matrix.  We expand that matrix


every -> ever

r-owen · 2018-01-08T17:06:48Z

include/lsst/afw/math/polynomials/Basis1d.h

+ *  A basis concept for 1-d series expansions.
+ *
+ *  @note This class is only present in the documentation, as it represents an
+ *        abstract concept.


I have never seen this done before. Clearly you could make this a real class by using template parameters. I assume you are trying to avoid inheritance, as per your documentation in polynomials.h ("the classes... are not polymorphic (no virtual functions)"). A few more words of explanation here would be appreciated.

Concepts are very much a part of C++ - they're not in the language yet, but they've been a part of how people think about the language since the inception of the STL (see, e.g. http://en.cppreference.com/w/cpp/concept). Sometimes people try to implement those with a templated base class (i.e. CRTP, as in the Eigen class hierarchies), and I tried that a bit here, but I ultimately rejected that approach because I felt the associated boilerplate obfuscated things more than it helped.

But this way of documenting a Concept by telling Doxygen that it's a class is something new; it seemed the best way to provide a separate documentation page with the right sections that had more-or-less the right relationship to the concrete classes that implement the Concept. I'm pretty happy with the result, but documentation is best judged by someone other than the author and I'm open to other ways of documenting this. Given how focused it is on C++, it's a bit strange that Doxygen doesn't include something native for Concepts.

A few words in the documentation would be helpful -- at least a link to the cppreference page you site. I had no idea the word "concept" was formal in this note.

Good idea. Will do. I'll also capitalize Concept in the docs, which I should have done before.

r-owen · 2018-01-08T17:17:05Z

include/lsst/afw/math/polynomials.h

+ *   - PackedIndexIterator and PackedIndexRange, which handle the flattening
+ *     of pairs of 1-d polynomials coefficients and basis functions into 1-d
+ *     arrays.
+ */


I strongly suspect you should put this code (including useful functions based on it) in a new package, rather than afw, since it seems potentially generally useful and has a minimum of dependencies. I'm thinking along the lines of your RFC for using sphgeom and I'd like to avoid a change like that in the future.

Working on this was part of the inspiration for that RFC, but because some of this code does depend on afw.geom it can't be moved out of afw yet, and I'd rather not split it up based on that dependency problem; I'd rather address the dependency problem by moving both the geometry primitives and this code to another package. But since that's a lot more work, I can't make a good case that it's what LSST should spend its time on, and hence the RFC.

r-owen

More comments. Still working my way through this.

r-owen · 2018-01-10T18:26:48Z

include/lsst/afw/math/polynomials/PackedBasis2d.h

+    /// Construct workspace for a basis with the given order.
+    explicit PackedBasisWorkspace2d(std::size_t order) : _x(order + 1), _y(order + 1) {}
+
+    /// Return the maximum order this workspace can support.


I know this is picky, but I suggest "can support" -> "supports". I first misunderstood this as the maximum order you can request for this class, partly because I think I recall reading about other classes in this ticket that are vaguely similar but whose size can be increased if needed.

Also, if the workspace may contain data for a smaller order than the maximum, is it reasonable to call this getMaxOrder in hopes of perhaps eventually having getOrder return the current order.

r-owen · 2018-01-10T18:30:06Z

include/lsst/afw/math/polynomials/PackedBasis2d.h

+
+
+/**
+ *  A workspace object that can be used to void extra memory allocations in


void -> avoid

r-owen · 2018-01-10T18:39:06Z

include/lsst/afw/math/polynomials/PackedBasis2d.h

+ *
+ *  If @f$B_n(x)@f$ are the basis functions for the nested Basis1d, the basis
+ *  functions of a PackedBasis2d with order @f$N@f$ are @f$B_m(x)B_n(y)@f$ for all combinations
+ *  with @f$m + n \le N@f$.


If I understand correctly, this gives a linear way of going through x and y coefficients. I know of two standard orders for doing this: x0y0 x1y0 x0y1 x2y0 x1y1 x0y2.... and x0y0 x0y1 x1y0 x0y2 x1y1 x2y0.... I believe we have different bits of code in our stack that uses each of the two orders. The point is: please document which order you are using here, and anywhere else this ambiguity comes up. I hope whatever it is will match the existing FunctionLibrary.h code and will become our de-facto standard.

This is documented in the classes in PackedIndex.h; I've added a reference to that documentation here. The convention is the latter of the two examples you gave. I'm not sure what one FunctionLibrary uses, but this is the one used by lsst.shapelet and (internally) by ChebyshevBoundedField.

r-owen · 2018-01-10T18:57:03Z

include/lsst/afw/math/polynomials/PackedIndex.h

+    }
+
+    /// Move to the next element in the packed array and return a copy of the iterator before the move.
+    PackedIndexIterator operator++(int) noexcept {


Given that pre-increment is pretty much guaranteed to be faster (especially since post-increment calls pre-increment), is it necessary to define post-increment? I'm not complaining about this code (which is nice and simple); it is more of a C++ style question.

It is necessary to meet the formal requirements of being a conformant STL InputIterator.

r-owen · 2018-01-10T19:01:00Z

include/lsst/afw/math/polynomials/PolynomialBasis2d.h

+
+/**
+ *  Construct a ScaledChebyshev1Basis2d that remaps the given box to [-1, 1]x[-1, 1] before
+ *  evaluating Chebyshev polynomials.


These aren't Chebyshev polynomials, right? I suggest looking for all instances of Chebyshev in this file and the 1d file.

r-owen · 2018-01-10T19:04:53Z

include/lsst/afw/math/polynomials/PolynomialFunction1d.h

+ *  This operation is not numerically stable at high order, but when fitting
+ *  standard polynomials, it is still much more stable to first fit in a scaled
+ *  basis and then (if necessary) use `simplify` to compute the coefficients
+ *  of an equivalent unscaled polynomial.


I encourage you to try to find a less ambiguous name, such as unscaled. (Note that Transform supports simplify for a very different purpose.)

It is a pity it is necessary, but I imagine it was needed in order to compute the final SIP coefficients.

TallJimbo · 2018-07-23T20:12:08Z

@r-owen, I think this is ready for another look.

I actually took some time to look more closely into why unscaled involved a loss of precision, including adding a "safe sum" algorithm that should avoid any loss of precision beyond the truncation imposed by the actual result type (you still can't compute more digits than fit in a double, of course). The results were a bit surprising: the loss of precision in unscaled really isn't that bad, even without the safe algorithm. The safe algorithm is slightly better, so I've kept it around (especially as I'm much more confident in its worst-case performance), but the bottom line is that all of the loss of precision in the high-level test in afw happens in the fit of the unscaled polynomials (which we can't use the safe sum on, because it's a matrix inversion, not a simple sum), rather than the conversion to unscaled polynomials. The 50*DEFAULT_RTOL seen in the tests is scarier than it looks - note that this is still an accuracy of ~1E-15. As a result, I don't actually see a problem with renaming unscaled to simplified, and that's what I've done here.

Also, note that there is an afw PR for this, with mostly obsolete comments dating back to when the polynomial was in afw. Please take a look at that, too.

r-owen

I have some suggestions to improve interoperability but overall this looks very nice.

Do you anticipate any syntactic sugar for making a WCS from one of these (e.g. a new function)?

r-owen · 2018-07-24T21:11:43Z