-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hermitian pairs of visibilities for imaging and fitting #100
Comments
The chi_square and log_likelihood routines should not accept Hermitian pairs, this would be equivalent to double-counting the data. This doesn't change the minimum of the loss function surface, but does change the parameter uncertainties that would be inferred. It doesn't matter whether this happens in the "normalized log likelihood" functions, since the normalization over data points makes the calculation the same as whether it has Hermitian pairs or not. These normalized loss functions should not be used for inference, anyway. |
Note to potentially come back to: in #132 , the |
I've come to the conclusion that it would be best that any routine that is evaluating a likelihood function (or a loss function) should not be involved with Hermitian pairs, whether they are "loose" or whether they are gridded. If Hermitian pairs were to be included in the likelihood function calculation, then, in order for the calculation to be accurate, the simple \chi^2 sum of data - model would need to be replaced with a multi-variate Gaussian likelihood that included covariances between data points. There would be a perfect covariance between each point and its Hermitian pair. The one case where Hermitian pairs do need to be included is in any routine that is creating an image using the inverse FFT. This applies to all of the routines involved in making dirty images. It does not apply to any forward-modeling routine involved in RML, like FourierCube or the NuFFT. |
Merging #156 has gone a long way towards resolving this issue, but I think a little bit of work still remains regarding documentation. I think the loose-visibilities.md tutorial should be updated to be more specific about the difference in gridding operations performed by the DirtyImager vs. DataAverager, and how only the DataAverager is the correct one for inference. This tutorial should also have a discussion about the perfect covariance matrix between points and their Hermitian pairs, as described above. |
#248 makes significant progress on this issue by consolidating docs in various places. I discovered/remembered that The tests still seem to pass but there's a good change that
And we should update the documentation and internal comments to be clear about Hermitian pairs in these routines. |
It would be worthwhile reviewing/auditing how we treat raw visibilities and their Hermitian pairs through the fitting and imaging processes.
We know that Hermitian pairs must be included for any routine that uses an iFFT to synthesize an image, otherwise the image will not turn out to be real.
However, it's less clear whether they should be included when we are using an image to forward model the visibilities and evaluating a likelihood on those, because there is the potential for "double counting" measurements. I don't think it makes a technical difference (since all measurements would be double counted) and would just be a scale factor adjustment relative to other regularizer settings. However, it would be good to come to a consensus on this and document it thoroughly.
The text was updated successfully, but these errors were encountered: