Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement OODT Metadata Extractors for pycoal data products #6

Open
lewismc opened this issue Feb 27, 2018 · 12 comments
Open

Implement OODT Metadata Extractors for pycoal data products #6

lewismc opened this issue Feb 27, 2018 · 12 comments
Labels
enhancement filemgr OODT File Manager Service help wanted metadata metadata modeling
Milestone

Comments

@lewismc
Copy link
Member

lewismc commented Feb 27, 2018

Now that we have basic file management set up, we should look at implementing OODT Metadata Extractors for all products consumed by (e.g. AVIRIS-C/NG imagery and accompanying .hdr files, the spectral libraries and hydrology datasets) pycoal as well as generated by pycoal e.g. mineral, mining and environment classifications. This will product a much richer metadata model enabling us to improve the cataloguing functionality.

@bdegley4789 can you please list all of the resources pycoal consumes and products it generates?

@thomkenn can you please start working on this when you get a chance?

@bdegley4789
Copy link
Member

Sorry for the late reply. I missed the email for this notification

@bdegley4789
Copy link
Member

bdegley4789 commented Mar 2, 2018

Pycoal Consumes
Input header File = ang20150420t182050_corr_v1e_img.hdr
Input image = ang20150420t182050_corr_v1e_img
Can be found here: ftp://avng.jpl.nasa.gov/AVNG_2015_data_distribution/L2/ang20150420t182050_rfl_v1e/

Spectral Library header file = s06av95a_envi.hdr
Can be found here: ftp://ftpext.cr.usgs.gov/pub/cr/co/denver/speclab/pub/spectral.library/splib06.library/Convolved.libraries/s06av95a_envi.hdr

vector file = Shape/NHDFlowline.shp
Can be found here: ftp://rockyftp.cr.usgs.gov/vdelivery/Datasets/Staged/Hydrography/NHD/State/HighResolution/Shape/NHD_H_New_Mexico_Shape.zip

Pycoal Generates
rgb file = ang20150420t182050_corr_v1e_img_rgb.hdr & ang20150420t182050_corr_v1e_img_rgb.img

classified file = ang20150420t182050_corr_v1e_img_class.hdr & ang20150420t182050_corr_v1e_img_class.img

mining file = ang20150420t182050_corr_v1e_img_class_mining.hdr & ang20150420t182050_corr_v1e_img_class_mining.img

environmental correlation file = ang20150420t182050_corr_v1e_img_class_mining_NHDFlowline_correlation.hdr & ang20150420t182050_corr_v1e_img_class_mining_NHDFlowline_correlation.img

ang20150420t182050_corr_v1e_img_class_mining_NHDFlowline_proximity.hdr &
ang20150420t182050_corr_v1e_img_class_mining_NHDFlowline_proximity.img

ang20150420t182050_corr_v1e_img_class_mining_NHDFlowline_correlation.hdr &
ang20150420t182050_corr_v1e_img_class_mining_NHDFlowline_correlation.img

All of these generated currently staged here https://drive.google.com/drive/folders/1YVhdLxvrZE3eC97OEXathLMJRgWt8haO

@lewismc
Copy link
Member Author

lewismc commented Mar 5, 2018

Hi @thomkenn did @bdegley4789 pass on the instruction to begin working on this? I want to make sure we are making progress on COAL-SDS, this means some tanglible progress from week to week.
Each issue left in this repository should take no longer than 1 week (12 working hours) to complete as all of the relevant documentation and resources e,g, community mailing lists are available. Please keep me in the loop with progress, thank you.

@thomkenn
Copy link
Contributor

thomkenn commented Mar 6, 2018

Yes, he has brought me up to speed, and we are fully committed to making real week to week progress!

@lewismc
Copy link
Member Author

lewismc commented Mar 6, 2018

OK doke, if you could try to get a pull request completed for this coming week it would be great. Also, please make sure to augment the meeting notes ahead of time with any progress. Thank you I appreciated it.

@lewismc
Copy link
Member Author

lewismc commented Mar 7, 2018

@thomkenn I should also say, if there is anything you are stuck on, PLEASE let me know. I can put aside time for an hour or so to resolve any issues. The idea is for the work to go on between meeting such that we have a quick tag up on Thursdays. Please keep me up-to-speed with what is going on such that I can help keep things on track. Thank you.

@lewismc
Copy link
Member Author

lewismc commented Mar 13, 2018

How are things coming along @thomkenn ?

@lewismc
Copy link
Member Author

lewismc commented Mar 13, 2018

PING @thomkenn

@thomkenn
Copy link
Contributor

thomkenn commented Mar 15, 2018

gah, sorry, didnt see your ping, these keep going to my spam folder for some reason. i have a possible commit i want to discuss in the meeting, as well as an error i saw while testing.

@lewismc
Copy link
Member Author

lewismc commented Mar 15, 2018 via email

@lewismc
Copy link
Member Author

lewismc commented Mar 16, 2018

@thomkenn dont work on this I will do it. Skip over to capstone-coal/pycoal#105 and work with @bdegley4789 thanks.

@lewismc lewismc added filemgr OODT File Manager Service metadata metadata modeling labels Sep 14, 2018
@lewismc
Copy link
Member Author

lewismc commented Sep 28, 2018

We need to add the following commands to the crawler_launcher tool execution
This is documented at the following
https://cwiki.apache.org/confluence/display/OODT/Using%2BTikaCmdLineMetExtractor

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement filemgr OODT File Manager Service help wanted metadata metadata modeling
Projects
None yet
Development

No branches or pull requests

3 participants