New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DM-22277: Gen3 object tables #490
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some minor comments below, otherwise looks good (assuming it works with ci_hsc_gen3).
catalogs[band] = {} | ||
catalogs[band]['meas'] = measDict[band]['meas'] | ||
catalogs[band]['forced_src'] = forcedSourceDict[band]['forced_src'] | ||
catalogs[band]['ref'] = inputs['inputCatalogRef'] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would this read a bit better with:
catalogs[band] = {'meas': measDict[band]['meas'],
'forced_src': forcedSourceDict[band]['forced_src'],
'ref': inputs['inputCatalogRef']}
|
||
dataId = butlerQC.quantum.dataId | ||
parq = self.run(catalogs=catalogs, tract=dataId['tract'], patch=dataId['patch']) | ||
outputs = pipeBase.Struct(outputCatalog=parq.toDataFrame()) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know if there's a big overhead to converting to and from parquet and dataFrames, but I think it's odd that the last thing the run
method does is convert the dataframe to parquet, and the first thing this code does is convert it back to a dataframe. I know that's what the gen2 code expects, but in terms of planning for gen2 removal, I think that run
should return a dataframe and runDataRef
could then parquet-ify the table?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree. ParquetTable will disappear with along with Gen2.
Was avoiding API changes on this one, but if the reviewer advocates for API changes, well then I guess I have to :)
dimensions=("tract", "patch", "band", "skymap"), | ||
storageClass="SourceCatalog", | ||
name="{coaddName}Coadd_meas", | ||
multiple=True |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess there's no advantage to these being deferLoad=True
because we need to hold them all in memory at the same time anyway?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's right. We're vertically concatenating them.
ce9e814
to
789e1b7
Compare
789e1b7
to
a65dc2e
Compare
No description provided.