Lazy covariance #179

aewallwi · 2018-11-09T19:54:39Z

Added some new weighting modes.

coveralls · 2018-11-09T20:37:55Z

Coverage increased (+0.1%) to 96.817% when pulling 6a10cdb on lazy_covariance into 58d860b on master.

hera_pspec/pspecdata.py

hera_pspec/utils.py

hera_pspec/pspecdata.py

nkern

Can you also update the doc-string of PSpecData.R() to note that the K matrix can be either I, iC or sinc_downweight based on the value of self.data_weighting?

hera_pspec/pspecdata.py

hera_pspec/utils.py

nkern · 2019-05-14T22:38:20Z

It seems like what you've done in sinc_downweight is to apply a fourier filter, which is done via a discrete convolution whose functional form is dictated by a tophat in the fourier domain, or a sinc function in real space.. I'm just curious as to whether this is something we want to replace the K matrix in our R weighting, or have as a separate step in our QE chain. In other words, this matrix isn't really a covariance matrix, its a toeplitz matrix that is used for a discrete convolution, so not sure if its fair to replace the slot in QE that would normally be used for covariance matrices with this matrix... perhaps we can discuss this in person or on the next pspec telecon.

nkern · 2019-05-14T23:09:33Z

@anguta Here is a neat example for you to test out.

import numpy as np
from hera_pspec import utils
from scipy import stats

# generate a bunch of white noise vectors
x = np.array([stats.norm.rvs(0, 1, 32) for i in range(100)]).T

# generate a sinc downweight matrix
weights = np.ones(32)
cmat1 = utils.sinc_downweight_mat_inv(32, 100e3, weights, filter_centers = 0., filter_widths = 1500e-9, filter_factors = 1e-9)

# apply to data
xfilt = cmat1.dot(x)

# get covariances of data, filtered data, and then dot C with sinc matrix
C = np.cov(x)
CF = np.cov(xfilt)
CF2 = cmat1.dot(C).dot(cmat1.T)

You'll find that CF and CF2 are nearly identical, which I think suggests that this is not necessarily a replacement for covariance weighting..

aewallwi · 2019-05-21T17:16:26Z

It seems like what you've done in sinc_downweight is to apply a fourier filter, which is done via a discrete convolution whose functional form is dictated by a tophat in the fourier domain, or a sinc function in real space.. I'm just curious as to whether this is something we want to replace the K matrix in our R weighting, or have as a separate step in our QE chain. In other words, this matrix isn't really a covariance matrix, its a toeplitz matrix that is used for a discrete convolution, so not sure if its fair to replace the slot in QE that would normally be used for covariance matrices with this matrix... perhaps we can discuss this in person or on the next pspec telecon.

I think that the main reason to include this matrix in our power-spectrum stage is that its action is most easily incorporated into the calculation of power-spectrum covariances and error bars if its added as the K matrix. That said, we could propogate it into C either empirically or through direct calculation (though for the calculation, we'd need to develop some extra machinery outside of pspec).

We should discuss this on the next telecon.

aewallwi · 2019-05-21T17:18:04Z

Can you also update the doc-string of PSpecData.R() to note that the K matrix can be either I, iC or sinc_downweight based on the value of self.data_weighting?

I've added that sinc_downweight can be used as an option

aewallwi · 2019-05-21T17:22:20Z

@anguta Here is a neat example for you to test out.

import numpy as np
from hera_pspec import utils
from scipy import stats

# generate a bunch of white noise vectors
x = np.array([stats.norm.rvs(0, 1, 32) for i in range(100)]).T

# generate a sinc downweight matrix
weights = np.ones(32)
cmat1 = utils.sinc_downweight_mat_inv(32, 100e3, weights, filter_centers = 0., filter_widths = 1500e-9, filter_factors = 1e-9)

# apply to data
xfilt = cmat1.dot(x)

# get covariances of data, filtered data, and then dot C with sinc matrix
C = np.cov(x)
CF = np.cov(xfilt)
CF2 = cmat1.dot(C).dot(cmat1.T)

You'll find that CF and CF2 are nearly identical, which I think suggests that this is not necessarily a replacement for covariance weighting..

Wouldn't you want to filter this by the inverse covariance?

I do think that this weighting is very similar to inv covariance weighting.

One simple way to think of inverse covariance filtering is that you transform into the eigenbasis of the signal vector (which in an ideal infinite bandwidth, spectrally flat foregrounds case would be delay space), and then divide the foreground modes (in the wedge) by their variance, and then transform back. What you've done is suppressed the sinusoids in the wedge by a factor of 1/Var which is essentially what the inverse sinc weighting does (although the supression factor is set by filter_factor which could be very different from the actual ratio between foreground and thermal noise variance).

hera_pspec/pspecdata.py

nkern · 2019-05-29T05:33:16Z

@anguta could you also link your lazy covariance PDF or overleaf url for posterity here? thanks

philbull

Looks good. Just a few questions about things I'm confused about, and a query about how r_params is specified.

hera_pspec/pspecdata.py

aewallwi · 2019-06-11T19:42:59Z

@anguta could you also link your lazy covariance PDF or overleaf url for posterity here? thanks

I'm working on a tutorial notebook that also describes some of the underlying math and why the filter is effective. I'm going to be adding it to uvtools with a new pull request there (which lets you use lazy covariance weighting as a general clean method for data inspection).

nkern · 2019-06-11T21:23:08Z

@anguta ok sounds good. do we want to wait on this PR so that you just call uvtools capabilities instead of duplicate code here? I think that's probably the best approach.

Also we still need to figure out how to propagate some amount of r_params metadata to the file history without overloading it, perhaps making it optional.

aewallwi · 2019-06-12T23:53:24Z

@anguta ok sounds good. do we want to wait on this PR so that you just call uvtools capabilities instead of duplicate code here? I think that's probably the best approach.

Also we still need to figure out how to propagate some amount of r_params metadata to the file history without overloading it, perhaps making it optional.

Yeah, lets wait until the uvtools code is merged in so I can reference it from hera_pspec.

for the history issue, maybe we can use one baseline per redundant group for now as you suggested.

cheers,

-Aaron

aewallwi · 2019-06-21T23:29:51Z

I've added linear cleaning to uvtools and removed duplicate code here.

philbull

Looks good, just a couple of things to clean up.

hera_pspec/pspecdata.py

hera_pspec/tests/test_pspecdata.py

philbull · 2019-06-25T15:32:32Z

@anguta Also: the new notebook examples/PS_estimation_example_inverse_weights.ipynb seems to be a duplicate of the existing examples/PS_estimation_example.ipynb, but with a couple of small changes. I don't think it should duplicate the whole thing, as this will make it hard for users to see what the differences are! So, perhaps it should either focus on showing a specific example of how to use the new weighting scheme, or fully replace the old PS estimation example. (I have a preference for the former; it would be nice to show a simple comparison, e.g. using Nick's example code!)

philbull

Apart from a few minor typos in the example notebook, this is ready to go.

aewallwi · 2019-07-08T21:22:55Z

I've also added a new r_params parameter for uvpspec objects that stores r_parameters as a string (in a compressed format that avoids storing many copies of the same dictionary) along with a decompression method to retrieve the original dictionary.

philbull

Looks good, but a few loose ends to tie up:

a couple of docstrings could be a little clearer/more detailed
it's not clear how r_params is treated when UVPSpec objects are added together
I think the get_r_params test should be a bit more real-worldy

hera_pspec/uvpspec.py

hera_pspec/tests/test_uvpspec.py

…spec into lazy_covariance

aewallwi · 2019-07-17T14:15:27Z

@anguta Here is a neat example for you to test out.

import numpy as np
from hera_pspec import utils
from scipy import stats

# generate a bunch of white noise vectors
x = np.array([stats.norm.rvs(0, 1, 32) for i in range(100)]).T

# generate a sinc downweight matrix
weights = np.ones(32)
cmat1 = utils.sinc_downweight_mat_inv(32, 100e3, weights, filter_centers = 0., filter_widths = 1500e-9, filter_factors = 1e-9)

# apply to data
xfilt = cmat1.dot(x)

# get covariances of data, filtered data, and then dot C with sinc matrix
C = np.cov(x)
CF = np.cov(xfilt)
CF2 = cmat1.dot(C).dot(cmat1.T)

You'll find that CF and CF2 are nearly identical, which I think suggests that this is not necessarily a replacement for covariance weighting..

I'm a little bit confused by this example. The filtering matrix is supposed to be the inverse of sinc_downweight_inv. At any rate, I'd naively expect CF2 and CF to be the same since the first is the covariance of the data after applying a matrix multiplication (basically a change of basis with a matrix) and the second is applying the transformation matrix to the covariance of the original basis (equivalent to calculating the covariance in the new basis).

nkern

looks good, thanks Aaron!

aewallwi · 2019-07-24T08:40:47Z

Looks good, but a few loose ends to tie up:

* a couple of docstrings could be a little clearer/more detailed

* it's not clear how `r_params` is treated when UVPSpec objects are added together

* I think the `get_r_params` test should be a bit more real-worldy

I've addressed all of the issues mentioned here.

cheers,

-Aaron

philbull

Looks good. Just minor typos.

hera_pspec/uvpspec.py

hera_pspec/uvpspec_utils.py

ghost assigned aewallwi Nov 9, 2018

ghost added the in progress label Nov 9, 2018

nkern reviewed Nov 10, 2018

View reviewed changes

hera_pspec/pspecdata.py Outdated Show resolved Hide resolved

aewallwi force-pushed the lazy_covariance branch from cbf99cd to 2c6eb2e Compare January 2, 2019 04:36

aewallwi force-pushed the lazy_covariance branch from 2c6eb2e to 0e93ae0 Compare May 2, 2019 05:03

nkern reviewed May 7, 2019

View reviewed changes

hera_pspec/utils.py Outdated Show resolved Hide resolved

nkern reviewed May 7, 2019

View reviewed changes

hera_pspec/utils.py Outdated Show resolved Hide resolved

nkern reviewed May 14, 2019

View reviewed changes

hera_pspec/pspecdata.py Outdated Show resolved Hide resolved

nkern reviewed May 14, 2019

View reviewed changes

hera_pspec/pspecdata.py Outdated Show resolved Hide resolved

nkern reviewed May 14, 2019

View reviewed changes

hera_pspec/pspecdata.py Show resolved Hide resolved

nkern reviewed May 14, 2019

View reviewed changes

hera_pspec/pspecdata.py Outdated Show resolved Hide resolved

nkern reviewed May 14, 2019

View reviewed changes

hera_pspec/utils.py Outdated Show resolved Hide resolved

aewallwi requested a review from philbull May 29, 2019 02:19

nkern reviewed May 29, 2019

View reviewed changes

hera_pspec/pspecdata.py Outdated Show resolved Hide resolved

philbull requested changes May 29, 2019

View reviewed changes

aewallwi force-pushed the lazy_covariance branch from fae39b1 to 3c33bc9 Compare June 21, 2019 19:12

philbull requested changes Jun 25, 2019

View reviewed changes

philbull approved these changes Jul 3, 2019

View reviewed changes

EXTERNAL-Ewall-Wice added 5 commits July 5, 2019 09:18

uvpspec now stores r_params. fixed typos in example.

d9a9c57

write r params to history string.

a7dace4

Use json to store r_params in a separate string with r_params_compressed

40fb2a0

some bugfixes for r_params

a9d0c3d

added a method to retrieve decompressed r_params from uvp

d301104

fixed some typos and orphan comments.

381390d

philbull requested changes Jul 10, 2019

View reviewed changes

hera_pspec/uvpspec.py Outdated Show resolved Hide resolved

hera_pspec/uvpspec.py Show resolved Hide resolved

hera_pspec/tests/test_uvpspec.py Outdated Show resolved Hide resolved

hera_pspec/tests/test_uvpspec.py Show resolved Hide resolved

philbull and others added 10 commits July 10, 2019 15:20

Merge branch 'master' into lazy_covariance

4ea60f3

defined select and combine functionality

e2d79fe

added some tests for select.

909b2c1

Merge branch 'lazy_covariance' of https://github.com/HERA-Team/hera_p…

3e70bc4

…spec into lazy_covariance

made test_uvpspec more real world.

b9447df

debugging.

1780b06

scope creep.

3ae83fb

fix json ordering issue.

3179c5f

updated docstring in get_r_params()

06bef16

add better description of _r_params

eb6e2e7

nkern approved these changes Jul 23, 2019

View reviewed changes

aewallwi closed this Jul 24, 2019

aewallwi reopened this Jul 24, 2019

EXTERNAL-Ewall-Wice added 2 commits July 30, 2019 23:31

Merge branch 'master' into lazy_covariance

dc86c84

added back import dspec.

9189a3d

philbull approved these changes Aug 2, 2019

View reviewed changes

hera_pspec/uvpspec.py Outdated Show resolved Hide resolved

hera_pspec/uvpspec_utils.py Outdated Show resolved Hide resolved

fix typos

6a10cdb

philbull merged commit 7708f92 into master Aug 6, 2019

philbull deleted the lazy_covariance branch August 6, 2019 07:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lazy covariance #179

Lazy covariance #179

aewallwi commented Nov 9, 2018

coveralls commented Nov 9, 2018 •

edited

Loading

nkern left a comment

nkern commented May 14, 2019 •

edited

Loading

nkern commented May 14, 2019

aewallwi commented May 21, 2019

aewallwi commented May 21, 2019

aewallwi commented May 21, 2019 •

edited

Loading

nkern commented May 29, 2019

philbull left a comment

aewallwi commented Jun 11, 2019 •

edited

Loading

nkern commented Jun 11, 2019

aewallwi commented Jun 12, 2019 •

edited

Loading

aewallwi commented Jun 21, 2019

philbull left a comment

philbull commented Jun 25, 2019

philbull left a comment

aewallwi commented Jul 8, 2019

philbull left a comment

aewallwi commented Jul 17, 2019

nkern left a comment

aewallwi commented Jul 24, 2019

philbull left a comment

Lazy covariance #179

Lazy covariance #179

Conversation

aewallwi commented Nov 9, 2018

coveralls commented Nov 9, 2018 • edited Loading

nkern left a comment

Choose a reason for hiding this comment

nkern commented May 14, 2019 • edited Loading

nkern commented May 14, 2019

aewallwi commented May 21, 2019

aewallwi commented May 21, 2019

aewallwi commented May 21, 2019 • edited Loading

nkern commented May 29, 2019

philbull left a comment

Choose a reason for hiding this comment

aewallwi commented Jun 11, 2019 • edited Loading

nkern commented Jun 11, 2019

aewallwi commented Jun 12, 2019 • edited Loading

aewallwi commented Jun 21, 2019

philbull left a comment

Choose a reason for hiding this comment

philbull commented Jun 25, 2019

philbull left a comment

Choose a reason for hiding this comment

aewallwi commented Jul 8, 2019

philbull left a comment

Choose a reason for hiding this comment

aewallwi commented Jul 17, 2019

nkern left a comment

Choose a reason for hiding this comment

aewallwi commented Jul 24, 2019

philbull left a comment

Choose a reason for hiding this comment

coveralls commented Nov 9, 2018 •

edited

Loading

nkern commented May 14, 2019 •

edited

Loading

aewallwi commented May 21, 2019 •

edited

Loading

aewallwi commented Jun 11, 2019 •

edited

Loading

aewallwi commented Jun 12, 2019 •

edited

Loading