Feature domain for phenomenological and ontological classes #141

bfhealy · 2022-10-28T22:16:24Z

Currently, DNN training on our phenomenological classes uses 40 features generated from the ZTF light curves. The ontological classifiers are trained on all of those features along with 34 more. These additional features consist of AllWISE, Gaia and PanStarrs magnitudes along with the ra, dec, ccd and quadrant of the source.

How should we proceed with these feature domains going forward? I don't think the coordinates and silicon position of the source should be part of the training (especially for the ontological branch where they're used now), since those features should not inform the intrinsic nature of the source.

Also, the inclusion of additional features for the ontological training means that the distinction between the phenomenological eclipsing and ontological binary star classes may be more complex that mentioned in #133. It would be helpful to know how the human classifiers treated these two classes during their labeling.

The text was updated successfully, but these errors were encountered:

AshishMahabal · 2022-10-28T22:23:39Z

I agree that ra, dec, ccd and quadrant should not be part of the training. I am surprised they were. Were they explicitly used?

bfhealy · 2022-10-28T22:30:19Z

Yes, in config.yaml for features: they are explicitly listed under the ontological: header. They were commented out in an older-looking header in the list (ontological_d13:), but in the one that's currently being used by the training, they are uncommented.

AshishMahabal · 2022-10-28T22:36:31Z

They should definitely not be used, In xgboost I have not used them, and based on the old header you mention, I am certain that Dima wouldn't have used those. We were pulling parameters like quad because at one point we were looking for bogus objects as a function of quads to understand the types of boguses.

bfhealy · 2022-10-28T22:38:40Z

That makes sense, I can see how those features would be useful for identifying bogus sources. I'll comment out the inclusion of ra, dec, ccd and quad in the ontological feature list.

bfhealy · 2022-11-17T00:04:22Z

This issue is also specifically relevant to the AGN class, for which the Gaia parallax should not be applicable.

bfhealy added the question Further information is requested label Oct 28, 2022

bfhealy mentioned this issue Oct 28, 2022

EPIC Scope catalogue #54

Open

48 tasks

bfhealy mentioned this issue Oct 28, 2022

'Binary star' vs 'eclipsing' labels #133

Closed

bfhealy linked a pull request Oct 31, 2022 that will close this issue

Remove ra/dec/field/ccd/quad from training #142

Closed

bfhealy closed this as completed Aug 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature domain for phenomenological and ontological classes #141

Feature domain for phenomenological and ontological classes #141

bfhealy commented Oct 28, 2022

AshishMahabal commented Oct 28, 2022

bfhealy commented Oct 28, 2022

AshishMahabal commented Oct 28, 2022

bfhealy commented Oct 28, 2022

bfhealy commented Nov 17, 2022

Feature domain for phenomenological and ontological classes #141

Feature domain for phenomenological and ontological classes #141

Comments

bfhealy commented Oct 28, 2022

AshishMahabal commented Oct 28, 2022

bfhealy commented Oct 28, 2022

AshishMahabal commented Oct 28, 2022

bfhealy commented Oct 28, 2022

bfhealy commented Nov 17, 2022