Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract metadata from FITS files for searching and filtering #5

Open
zonca opened this issue Feb 2, 2023 · 2 comments
Open

Extract metadata from FITS files for searching and filtering #5

zonca opened this issue Feb 2, 2023 · 2 comments

Comments

@zonca
Copy link
Member

zonca commented Feb 2, 2023

          > Thanks! I would like to extract metadata from FITS header and store in the portal for querying and filtering.

Where is the best location for the metadata?

I suggest creating creating JSON files along side the FITS and DAT files. Like datareleases/npipe6v20/lowres/npipe6v20_030_map_ns0016.fits would have a file datareleases/npipe6v20/lowres/npipe6v20_030_map_ns0016.fits.json.

And put these in the Globus collection with the files. We can then aggregate some of the metadata from the individual files into the dataset metadata, along with the core elements, like title, creation date, and publisher. E.g., datareleases/npipe6v20/lowres/metadata.json. This metadata can be combined to update the dataset descriptions, table of files, and populate a searchable JSON catalog to use with lunr.js or similar.

Originally posted by @rpwagner in #4 (comment)

@rpwagner
Copy link
Collaborator

rpwagner commented Feb 3, 2023

@zonca do you want to have a short call next week to talk about dataset organization? Right now, there's a lot of files in each directory and I don't know enough about the data to suggest how to break them up. I'm interested in what's a reasonable breakdown so that users can find the datasets and files to do their analysis or other work.

@zonca
Copy link
Member Author

zonca commented Feb 23, 2023

We plan to have another dataset ready soon to be added to the portal.
I think it will be easier to design an overall organization for the data with 2 datasets. So let's wait for that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants