Hermitian pairs of visibilities for imaging and fitting #100

iancze · 2022-10-05T13:16:29Z

It would be worthwhile reviewing/auditing how we treat raw visibilities and their Hermitian pairs through the fitting and imaging processes.

We know that Hermitian pairs must be included for any routine that uses an iFFT to synthesize an image, otherwise the image will not turn out to be real.

However, it's less clear whether they should be included when we are using an image to forward model the visibilities and evaluating a likelihood on those, because there is the potential for "double counting" measurements. I don't think it makes a technical difference (since all measurements would be double counted) and would just be a scale factor adjustment relative to other regularizer settings. However, it would be good to come to a consensus on this and document it thoroughly.

iancze · 2022-12-25T01:18:03Z

The chi_square and log_likelihood routines should not accept Hermitian pairs, this would be equivalent to double-counting the data. This doesn't change the minimum of the loss function surface, but does change the parameter uncertainties that would be inferred.

It doesn't matter whether this happens in the "normalized log likelihood" functions, since the normalization over data points makes the calculation the same as whether it has Hermitian pairs or not. These normalized loss functions should not be used for inference, anyway.

jeffjennings · 2023-02-09T20:33:20Z

Note to potentially come back to: in #132 , the RandomCellSplitGridded class takes in a griddedDataset, which contains Hermitian pairs, but the selection/splitting of data in RandomCellSplitGridded doesn't maintain that Hermitian property.

iancze · 2023-02-18T00:21:41Z

I've come to the conclusion that it would be best that any routine that is evaluating a likelihood function (or a loss function) should not be involved with Hermitian pairs, whether they are "loose" or whether they are gridded.

If Hermitian pairs were to be included in the likelihood function calculation, then, in order for the calculation to be accurate, the simple \chi^2 sum of data - model would need to be replaced with a multi-variate Gaussian likelihood that included covariances between data points. There would be a perfect covariance between each point and its Hermitian pair.

The one case where Hermitian pairs do need to be included is in any routine that is creating an image using the inverse FFT. This applies to all of the routines involved in making dirty images. It does not apply to any forward-modeling routine involved in RML, like FourierCube or the NuFFT.

iancze · 2023-02-21T01:32:37Z

Merging #156 has gone a long way towards resolving this issue, but I think a little bit of work still remains regarding documentation. I think the loose-visibilities.md tutorial should be updated to be more specific about the difference in gridding operations performed by the DirtyImager vs. DataAverager, and how only the DataAverager is the correct one for inference. This tutorial should also have a discussion about the perfect covariance matrix between points and their Hermitian pairs, as described above.

iancze · 2024-01-04T14:57:17Z

#248 makes significant progress on this issue by consolidating docs in various places.

I discovered/remembered that mpol.datasets.DartBoard and its tests have numerous references to treating Hermitian pairs. This may have made sense when datasets.GriddedDataset contained Hermitian pairs, but after #156, we made the decision that gridding.DataAverager and datasets.GriddedDataset will not contain Hermitian pairs (they should only be present internally to gridding.DirtyImager).

The tests still seem to pass but there's a good change that Dartboard.build_grid_mask_from_cells contains incorrect logic.

mask = np.zeros_like(self.cartesian_qs, dtype="bool")

# uses about a Gb..., and this only 256x256
for cell_index in cell_index_list:
    qi, pi = cell_index
    q_min, q_max = self.q_edges[qi : qi + 2]
    p0_min, p0_max = self.phi_edges[pi : pi + 2]
    # also include Hermitian values
    p1_min, p1_max = self.phi_edges[pi : pi + 2] - np.pi

    # whether or not the q and phi values of the coordinate array
    # fit in the q cell and *either of* the regular or Hermitian phi cell
    ind = (
        (self.cartesian_qs >= q_min)
        & (self.cartesian_qs < q_max)
        & (
            ((self.cartesian_phis > p0_min) & (self.cartesian_phis <= p0_max))
            | ((self.cartesian_phis > p1_min) & (self.cartesian_phis <= p1_max))
        )
    )

    mask[ind] = True

return mask

And we should update the documentation and internal comments to be clear about Hermitian pairs in these routines.

iancze added the documentation Improvements or additions to documentation label Oct 5, 2022

jeffjennings mentioned this issue Feb 9, 2023

Cross-val splitter #132

Merged

This was referenced Dec 25, 2023

Library Reorganization for v0.2.1 (v0.3.0) release #242

Closed

Library Reorganization for v0.3.0 release #243

Merged

Fix loss function bugs and inconsistencies #247

Merged

[WIP] v0.3 tracking branch #248

Merged

iancze closed this as completed in #248 Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hermitian pairs of visibilities for imaging and fitting #100

Hermitian pairs of visibilities for imaging and fitting #100

iancze commented Oct 5, 2022 •

edited

Loading

iancze commented Dec 25, 2022 •

edited

Loading

jeffjennings commented Feb 9, 2023

iancze commented Feb 18, 2023

iancze commented Feb 21, 2023

iancze commented Jan 4, 2024

Hermitian pairs of visibilities for imaging and fitting #100

Hermitian pairs of visibilities for imaging and fitting #100

Comments

iancze commented Oct 5, 2022 • edited Loading

iancze commented Dec 25, 2022 • edited Loading

jeffjennings commented Feb 9, 2023

iancze commented Feb 18, 2023

iancze commented Feb 21, 2023

iancze commented Jan 4, 2024

iancze commented Oct 5, 2022 •

edited

Loading

iancze commented Dec 25, 2022 •

edited

Loading