Use sample standard deviation for AMx. #105

wmwv · 2019-04-04T16:00:58Z

Use 1/(n-1) instead of 1/n to define standdard deviation.
Multiplies np.std, which is 1/n by n/n-1.

Passes Jenkins.

yalsayyad · 2019-04-04T17:45:20Z

python/lsst/validate/drp/calcsrd/amx.py

+            n = len(finiteEntries)
+            if n > 1:
+                rawRmsDist = np.std(np.array(distances)[finiteEntries])
+                rmsDist = n/(n-1) * rawRmsDist  # Correct biased 1/n np.std()


I think it should be sqrt(n/n-1).

You could also just add a ddof=1

rmsDist = np.std(np.array(distances)[finiteEntries], ddof=1)

https://docs.scipy.org/doc/numpy/reference/generated/numpy.std.html

I think it should be sqrt(n/n-1).

Doh! I shouldn't code while sick. Thank you.

You could also just add a ddof=1

Much better!

I.e., use sqrt(1/(n-1)) normalization Was previously using sqrt(1/n) Add 'ddof=1' to np.std keywords to get sample std deviation.

wmwv · 2019-04-10T05:52:23Z

OK. After being a bit embarrassed by not getting a one-line change right, I lightly re-factor to make at least this part testable with a doctest.

Take another look. I think this is now ready.

wmwv · 2019-04-18T17:29:35Z

@yalsayyad Could you take another look at this?

yalsayyad · 2019-05-29T19:48:42Z

python/lsst/validate/drp/calcsrd/amx.py

-                visit[obj1], ra[obj1], dec[obj1],
-                visit[obj2], ra[obj2], dec[obj2])
-            if not distances:
+            rmsDist = calcRmsDistanceForOneObject(visit, ra, dec, obj1, obj2)


Are you sure this runs? You're calling it with a different call signature than its definition.

yalsayyad · 2019-05-29T19:56:17Z

python/lsst/validate/drp/calcsrd/amx.py

+
+    Should return None
+    >>> visit1, ra1, dec1 = [1], [10.12344], [0, 0]
+    >>> visit2, ra2, dec2 = [1], [20.00000], [0, 0]


Interesting that ra and dec don't have to be the same length.

As you likely suspect, this was a mistake. I didn't mean to test this.

I'm not surprised that it works because matchVisitComputeDistance iterates over visits and indexes ra and dec by that index, so if dec is longer then it never accesses those extra entries.

yalsayyad · 2019-05-29T20:01:40Z

python/lsst/validate/drp/calcsrd/amx.py

+        # ddof=1 to get sample standard deviation (e.g., 1/(n-1))
+        return np.std(np.array(distances)[finiteEntries], ddof=1)
+
+    return None


return None is implicit, but if you like it being obvious, it's fine.

wmwv · 2019-06-04T12:53:01Z

@yalsayyad OK, I've pulled back from the refactor+testing rabbit hole and made this a simple one-line (plus two comment line changes) request.

yalsayyad

Looks good!

wmwv requested a review from yalsayyad April 4, 2019 16:00

yalsayyad requested changes Apr 4, 2019

View reviewed changes

wmwv force-pushed the tickets/DM-18751 branch from ebcb0e1 to 023e6be Compare April 5, 2019 12:45

Use sample standard deviation for AMx.

3d3e284

I.e., use sqrt(1/(n-1)) normalization Was previously using sqrt(1/n) Add 'ddof=1' to np.std keywords to get sample std deviation.

wmwv force-pushed the tickets/DM-18751 branch from 023e6be to 3d3e284 Compare April 9, 2019 17:23

yalsayyad requested changes May 29, 2019

View reviewed changes

wmwv force-pushed the tickets/DM-18751 branch from 9daee27 to 3d3e284 Compare June 4, 2019 12:51

yalsayyad approved these changes Jun 4, 2019

View reviewed changes

wmwv merged commit 3c94890 into master Aug 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use sample standard deviation for AMx. #105

Use sample standard deviation for AMx. #105

wmwv commented Apr 4, 2019

yalsayyad Apr 4, 2019

wmwv Apr 5, 2019 •

edited

wmwv commented Apr 10, 2019

wmwv commented Apr 18, 2019

yalsayyad May 29, 2019

yalsayyad May 29, 2019

wmwv May 30, 2019

yalsayyad May 29, 2019

wmwv commented Jun 4, 2019

yalsayyad left a comment

Use sample standard deviation for AMx. #105

Use sample standard deviation for AMx. #105

Conversation

wmwv commented Apr 4, 2019

yalsayyad Apr 4, 2019

Choose a reason for hiding this comment

wmwv Apr 5, 2019 • edited

Choose a reason for hiding this comment

wmwv commented Apr 10, 2019

wmwv commented Apr 18, 2019

yalsayyad May 29, 2019

Choose a reason for hiding this comment

yalsayyad May 29, 2019

Choose a reason for hiding this comment

wmwv May 30, 2019

Choose a reason for hiding this comment

yalsayyad May 29, 2019

Choose a reason for hiding this comment

wmwv commented Jun 4, 2019

yalsayyad left a comment

Choose a reason for hiding this comment

wmwv Apr 5, 2019 •

edited