Add flagging functions for power spectra #157

shaunakmodak · 2018-07-13T22:38:00Z

Functions for generating greedy flagging masks, consolidating LST binned NSamples & Flag data into numpy arrays, and plotting visualizations of the flagging.

philbull

Review of in-progress PR.

philbull · 2018-07-14T06:30:39Z

hera_pspec/flags.py

+    greedy : bool
+        greedy flagging is used if true (default is False)
+
+    axis : int


This would be better off as a string, and maybe with a name change, e.g. first='row'.

philbull · 2018-07-14T06:31:22Z

hera_pspec/flags.py

+        integer array with number of samples available for each frequency channel at a given LST angle
+
+    flags : numpy.ndarray
+        binary array with 1 representing flagged, 0 representing unflagged


Is this the same as the output of UVData.get_flags(), or does that need to be modified in some way before passing it to this function?

philbull · 2018-07-14T06:32:17Z

hera_pspec/flags.py

+
+mask_generator(nsamples, flags, n_threshold, greedy=False, axis, greedy_threshold, retain_flags=True):
+    """
+    Generates a greedy flags mask from input flags and nsamples arrays


I think it's worth briefly explaining how the algorithm works here.

philbull · 2018-07-14T06:32:31Z

hera_pspec/flags.py

+    n_threshold : int
+        minimum number of samples needed for a point to remain unflagged
+
+    greedy : bool


What if it's False?

philbull · 2018-07-14T06:33:25Z

hera_pspec/flags.py

@@ -0,0 +1,83 @@
+import numpy as np
+
+mask_generator(nsamples, flags, n_threshold, greedy=False, axis, greedy_threshold, retain_flags=True):


Maybe a more descriptive function name, e.g. construct_factorizable_mask?

philbull · 2018-07-14T06:33:59Z

hera_pspec/flags.py

+        if greedy=True, the threshold used to flag rows or columns if axis=1 or 0, respectively
+
+    retain_flags : bool
+        LST-Bin Flags are left flagged even if thresholds are not met (default is True)


The data going into this are not necessarily LST binned.

philbull · 2018-07-14T06:34:52Z

hera_pspec/flags.py

+
+    # comparing the number of samples to the threshold 
+
+    for i in range(shape[0]):


Can this nested for loop perhaps be replaced by a couple of strategic calls to np.where? We can discuss this.

nkern · 2018-07-15T10:12:38Z

Have either of you seen the development in PSpecData.broadcast_dset_flags? Seems like there is some possible overlap in what these two are trying to accomplish. Perhaps some of the code in broadcast_dset_flags should be put in flags.py and broadcast_dset_flags should then call routines in flags.py...

coveralls · 2018-09-25T22:32:27Z

Coverage decreased (-0.3%) to 96.205% when pulling eedee0f on flag_module into b129d47 on master.

philbull · 2019-07-10T09:08:52Z

I've moved most of the code from this PR into hera_stats, where it will be very useful for testing different ways of flagging the data. The relevant PR is here: HERA-Team/hera_stats#22

This code could have lived in hera_pspec too, but I think it's worth trying to limit the number of peripheral features in pspec and keep it more or less focused on its core function (it's complicated enough). hera_stats is supposed to be a bigger tent, covering many different aspects of the analysis, so I think it's appropriate to move this there.

added function to do greedy flagging

02db5c8

ghost assigned shaunakmodak Jul 13, 2018

ghost added the in progress label Jul 13, 2018

shaunakmodak requested a review from philbull July 13, 2018 22:38

philbull requested changes Jul 14, 2018

View reviewed changes

Shaunak Modak and others added 7 commits August 3, 2018 14:06

additional flagging functions and tests

dcd400b

travis and plot minor modifications

424c366

Merge branch 'master' into flag_module

d815ec0

fixed some errors to pass more tests

e0fab29

flags & pspecdata changes now pass all relevant tests

e176c5a

Merge branch 'master' into flag_module

ce9d731

Make tests pass

eedee0f

ghost assigned plaplant Sep 25, 2018

Merge branch 'master' into flag_module

7f41ac2

ghost assigned philbull Mar 6, 2019

philbull closed this Jul 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add flagging functions for power spectra #157

Add flagging functions for power spectra #157

shaunakmodak commented Jul 13, 2018

philbull left a comment

philbull Jul 14, 2018

philbull Jul 14, 2018

philbull Jul 14, 2018

philbull Jul 14, 2018

philbull Jul 14, 2018

philbull Jul 14, 2018

philbull Jul 14, 2018

nkern commented Jul 15, 2018

coveralls commented Sep 25, 2018

philbull commented Jul 10, 2019

		@@ -0,0 +1,83 @@
		import numpy as np

		mask_generator(nsamples, flags, n_threshold, greedy=False, axis, greedy_threshold, retain_flags=True):


		# comparing the number of samples to the threshold

		for i in range(shape[0]):

Add flagging functions for power spectra #157

Add flagging functions for power spectra #157

Conversation

shaunakmodak commented Jul 13, 2018

philbull left a comment

Choose a reason for hiding this comment

philbull Jul 14, 2018

Choose a reason for hiding this comment

philbull Jul 14, 2018

Choose a reason for hiding this comment

philbull Jul 14, 2018

Choose a reason for hiding this comment

philbull Jul 14, 2018

Choose a reason for hiding this comment

philbull Jul 14, 2018

Choose a reason for hiding this comment

philbull Jul 14, 2018

Choose a reason for hiding this comment

philbull Jul 14, 2018

Choose a reason for hiding this comment

nkern commented Jul 15, 2018

coveralls commented Sep 25, 2018

philbull commented Jul 10, 2019