Add auto spectra #640

bhazelton · 2022-10-27T22:54:05Z

Description

Add a new table (hera_auto_spectrum) with the full autocorrelation spectra (not just the medians which are currently recorded in the hera_autos table).

This new table uses an Array type column in postgres and a character type column in SQLite. In order to make sure this works properly I added testing related to spinning up SQLite databases, which we never did before in our tests. In the process I found and fixed some bugs and made some significant changes to cm_gen_sqlite.py.

I also changed the CI to use mamba instead of conda because I was annoyed with how slow it was.

Motivation and Context

I think this closes an issue somewhere, but I can't find it.

Types of changes

Bug fix (non-breaking change which fixes an issue)
Schema change (any change to the SQL tables)
New feature without schema change (non-breaking change which adds functionality)
Change associated with a change in redis structure
Breaking change (fix or feature that would cause existing functionality to change)
Version change
Build or continuous integration change
Other

Checklist:

Schema change:

codecov · 2022-10-27T22:58:14Z

Codecov Report

Base: 98.26% // Head: 98.36% // Increases project coverage by +0.09% 🎉

Coverage data is based on head (2a32ff1) compared to base (dbb7b7e).
Patch coverage: 100.00% of modified lines in pull request are covered.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #640      +/-   ##
==========================================
+ Coverage   98.26%   98.36%   +0.09%     
==========================================
  Files          34       34              
  Lines        5080     5131      +51     
==========================================
+ Hits         4992     5047      +55     
+ Misses         88       84       -4

Impacted Files	Coverage Δ
hera_mc/autocorrelations.py	`98.57% <100.00%> (+0.65%)`	⬆️
hera_mc/cm_gen_sqlite.py	`100.00% <100.00%> (ø)`
hera_mc/mc_session.py	`99.77% <100.00%> (+<0.01%)`	⬆️
hera_mc/mc.py	`92.56% <0.00%> (+3.30%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

bhazelton · 2022-10-27T23:09:00Z

@david-deboer do you have ideas around testing that it doesn't error with sqlite? 2 uncovered lines here associated with the support for sqlite.

dannyjacobs · 2022-10-27T23:21:20Z

hera_mc/autocorrelations.py

+        Antenna number. Part of primary_key.
+    antenna_feed_pol : String Column
+        Feed polarization, either 'e' or 'n'. Part of primary_key.
+    spectrum : Float Columnn


assumption made here about channel/frequency mapping. I'm fine with this if you are.

I'm not sure what you're getting at. That the frequencies aren't in their own column? That would be pretty redundant because they'd be the same for all rows.

I mostly agree with bryna here, but we have done 8k and 16k channel autos before (as a test if I remember but we still did it). But overall it seems like a bit of wasted space to save it all the time. Even then we don't spit out the frequencies in redis anywhere, I'm pretty sure we'd always have to reconstruct them using a linspace and an average (which is what we'd ask a user to do anyway). We could always just write down the algorithm somewhere convenient for anyone who is interested.

we could add a column for frequency resolution if that helped. I don't actually know where to find that in redis...

I added something to the table definition to describe how to get the frequencies. The basic answer is that the bandwidth (0 -> 250 MHz) is fixed, so the frequencies can be computed from that and from the number of entries in the spectrum.

dannyjacobs

I can't comment on the finer points of the sql array definition, but it looks reasonable to me. We continue here the practice of leaving channel mapping to the wits of the end consumer. But in the interest of time I suggest we test now and debate later.

mkolopanis

The only real question I have is if we want to specifically specify the precision of the Float column? Currently we know these autos will always be a float32 and the default Float in sqlalchemy does not seem to have a specified precision. From a little bit of research, it seems like this will default to double in psql.
https://stackoverflow.com/questions/62938757/how-to-force-sqalchemy-float-type-to-real-in-postgres

Assuming this stackoverflow's assertion is correct about casting to double, we're not really gaining any information by upcasting it to 64bit, just using double the space. I think I would argue for the type of this Array to be Real.

david-deboer · 2022-10-28T21:26:38Z

Isn't the only way to hit those two lines is to have a non-postgresql data base (e.g. a test sqlite version in data/test_data) that gets used as the database in the tests?

mkolopanis

I have a few extra questions now

hera_mc/cm_gen_sqlite.py

+ more debugging cleanup

mkolopanis

Thanks for the extra work on this. Looks good from my end. I don't mind a little hand waving on the sqlite coverage.

bhazelton added enhancement realtime labels Oct 27, 2022

dannyjacobs reviewed Oct 27, 2022

View reviewed changes

dannyjacobs previously approved these changes Oct 27, 2022

View reviewed changes

mkolopanis reviewed Oct 28, 2022

View reviewed changes

bhazelton added 3 commits October 30, 2022 22:18

Add auto spectrum table

d1bb588

update the changelog

2940134

address review comments

eba1eaf

bhazelton dismissed dannyjacobs’s stale review via eba1eaf October 31, 2022 21:10

bhazelton force-pushed the add_auto_spectra branch from 67cc95e to eba1eaf Compare October 31, 2022 21:10

bhazelton added 6 commits October 31, 2022 16:47

fix coverage in cm_gen_sqlite

7c91e53

Fix table declaration for sqlite

bf35caf

Add debugging info

2f4eea3

Try to fix sqlite testing setup

36253d5

fix config path

e64a258

fix config url

cd4c463

bhazelton force-pushed the add_auto_spectra branch from 9ca27b9 to cd4c463 Compare November 2, 2022 16:39

bhazelton added 2 commits November 2, 2022 09:56

use mamba!

dce9a34

add assert to check for test being run on ci

27c2522

mkolopanis reviewed Nov 2, 2022

View reviewed changes

hera_mc/cm_gen_sqlite.py Show resolved Hide resolved

hera_mc/cm_gen_sqlite.py Outdated Show resolved Hide resolved

bhazelton added 7 commits November 2, 2022 11:18

add another assert to hunt test execution

3d13f60

more debugging

a8e6113

more targeted debugging testing

d9951bc

add assert error message, turn off pytest output capture

93ee15d

use os.remove rather than the subprocess call

3c510e5

check for postgres port environmental variable

5ad709e

more debugging

d399fb5

bhazelton added 6 commits November 2, 2022 14:47

Try to fix db url for pg_dump in CI

8f0aa60

try another way to get the db_url right for CI

a4e9b3f

properly handle db_url in tests, make test better

9ccb247

cleanup debugging and test file creation

eabe00e

remove out of date schema and insert files

c950887

fix coverage issues

1d1ea28

+ more debugging cleanup

bhazelton marked this pull request as ready for review November 2, 2022 23:58

bhazelton requested a review from david-deboer November 2, 2022 23:58

Add back test that was lost for sqlite matching postgres

2a32ff1

bhazelton requested a review from mkolopanis November 3, 2022 17:02

mkolopanis approved these changes Nov 3, 2022

View reviewed changes

mkolopanis merged commit 383569c into main Nov 3, 2022

mkolopanis deleted the add_auto_spectra branch November 3, 2022 17:16

bhazelton mentioned this pull request Nov 3, 2022

Update the schema document for the hera_auto_spectrum table #642

Merged

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add auto spectra #640

Add auto spectra #640

bhazelton commented Oct 27, 2022 •

edited

Loading

codecov bot commented Oct 27, 2022 •

edited

Loading

bhazelton commented Oct 27, 2022

dannyjacobs Oct 27, 2022

bhazelton Oct 28, 2022

mkolopanis Oct 28, 2022

bhazelton Oct 28, 2022

bhazelton Nov 1, 2022

dannyjacobs left a comment

mkolopanis left a comment

david-deboer commented Oct 28, 2022

mkolopanis left a comment

mkolopanis left a comment

Add auto spectra #640

Add auto spectra #640

Conversation

bhazelton commented Oct 27, 2022 • edited Loading

Description

Motivation and Context

Types of changes

Checklist:

codecov bot commented Oct 27, 2022 • edited Loading

Codecov Report

bhazelton commented Oct 27, 2022

dannyjacobs Oct 27, 2022

Choose a reason for hiding this comment

bhazelton Oct 28, 2022

Choose a reason for hiding this comment

mkolopanis Oct 28, 2022

Choose a reason for hiding this comment

bhazelton Oct 28, 2022

Choose a reason for hiding this comment

bhazelton Nov 1, 2022

Choose a reason for hiding this comment

dannyjacobs left a comment

Choose a reason for hiding this comment

mkolopanis left a comment

Choose a reason for hiding this comment

david-deboer commented Oct 28, 2022

mkolopanis left a comment

Choose a reason for hiding this comment

mkolopanis left a comment

Choose a reason for hiding this comment

bhazelton commented Oct 27, 2022 •

edited

Loading

codecov bot commented Oct 27, 2022 •

edited

Loading