Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Processing: as new data files get generated, upload them to object storage #67

Open
Tracked by #62
franTarkenton opened this issue May 9, 2023 · 2 comments
Open
Tracked by #62
Assignees
Labels
enhancement New feature or request

Comments

@franTarkenton
Copy link
Member

related to #65

The download step is now complete for both Viirs and Modis data. The next step is to rework how the data processing script step works.

This ticket will:

  1. modify the processing of the data so that immediately after new intermediate data is generated it will get copied up to object storage.
  2. The script will always check what data is in object storage first, and pull it down before generating locally.
@franTarkenton franTarkenton added the enhancement New feature or request label May 9, 2023
@franTarkenton franTarkenton self-assigned this May 9, 2023
@franTarkenton
Copy link
Member Author

Made some progress on this with the download step, and for the processing / plotting steps for the modis runs. After the amount of time invested and the amount of work remaining made the decision to pivot out of this task and focus more on getting the pipeline to run based on missing data.

Work completed in this ticket includes:

  • creation of a class to help manage the inputs / output paths of various data sets that are generated by the process
  • detailed analysis of the code in an attempt to understand what data files get created by all the different steps that make up the daily pipeline.
  • refactor to have various steps to check to see if they have already run based on the data that exists in object storage.
  • removing auto delete of previously generated data.
  • more documentation

@franTarkenton
Copy link
Member Author

this ticket is not complete, but am closing, will come back to it if required

@franTarkenton franTarkenton added zen_reorg temp label to help with migration to ghp and removed zen_reorg temp label to help with migration to ghp labels Dec 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Review / QA
Development

No branches or pull requests

1 participant