-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Download, process, and covariate calculation package- next steps #5
Comments
Download Functions
Processing/Covariate Calculation Functions
Discussion
|
Thanks Mitchell
|
Whether the additional data sources are included in the first or a later version, I think our immediate priority is cleaning up what we have. It will be easier to add additional download, process, and calculation functions once we have standard function names and more robust unit/integration tests. |
The code naming consistency can be based on the output of each function. The The The |
Mitchell To Do (January 17, 2023) -
|
|
We mainly use circular buffer polygons, which are generated inside each |
Discussion 2024/01/2018 @sigmafelix @Spatiotemporal-Exposures-and-Toxicology
|
For Point 1, MODIS-related auxiliary functions are separated from the main functions then placed in R/calculate_covariates_support.R. The file and function names will be changed in the next PR. I guess For Point 4 (helper function), I think a spatial data class conversion function will be necessary. Functions written earlier (mostly by me) usually accept For Point 5, I think unifying common argument names across |
@sigmafelix
The "cleaning" does not have to include manipulation/calculation/alteration of the data values in any way (ignore contradiction with I think generating cleaned data in a two step workflow from Lots of rambling thoughts so I will refine and post again with an example. |
@sigmafelix |
Performance checks for spatial subsetting with
|
TODO
cf. Argument lists by functions in # calc_* function arguments
fun path sites id_col product name_covariates radius subdataset fun_summary nthreads package_list_add
1 calc_ecoregion TRUE TRUE TRUE NA NA NA NA NA NA NA
2 calc_koppen_geiger TRUE TRUE TRUE NA NA NA NA NA NA NA
3 calc_modis TRUE TRUE TRUE TRUE TRUE TRUE TRUE TRUE TRUE TRUE
4 calc_nei TRUE TRUE TRUE NA NA NA NA NA NA NA
5 calc_nlcd_ratio TRUE TRUE NA NA NA TRUE NA NA NA NA
6 calc_temporal_dummies NA TRUE TRUE NA NA NA NA NA NA NA
7 calc_tri TRUE TRUE TRUE NA NA TRUE NA NA NA NA
export_list_add year county_shp sites_epsg domain_year
1 NA NA NA NA NA
2 NA NA NA NA NA
3 TRUE NA NA NA NA
4 NA TRUE TRUE TRUE NA
5 NA TRUE NA NA NA
6 NA NA NA NA TRUE
7 NA NA NA TRUE TRUE
# modis_* function arguments
fun path product nsds fun_agg paths date_in regex_sds foo get_var resolution custom_sel subdataset crs_ref
1 modis_aggregate_sds TRUE TRUE TRUE TRUE NA NA NA NA NA NA NA NA NA
2 modis_get_vrt NA TRUE NA NA TRUE TRUE TRUE TRUE NA NA NA NA NA
3 modis_mosaic_mod06 NA NA NA NA TRUE TRUE NA NA TRUE TRUE NA NA NA
4 modis_prefilter_sds NA TRUE NA NA NA NA NA NA NA NA TRUE NA NA
5 modis_preprocess_vnp46 NA NA NA NA TRUE TRUE NA NA NA NA NA TRUE TRUE
6 modis_warp_stars TRUE NA NA NA NA NA NA NA NA NA NA NA NA
7 modis_worker NA TRUE NA NA NA NA NA NA NA NA NA NA NA
cellsize threshold crs_out raster date sites_in name_extracted fun_summary_raster id_col radius
1 NA NA NA NA NA NA NA NA NA NA
2 NA NA NA NA NA NA NA NA NA NA
3 NA NA NA NA NA NA NA NA NA NA
4 NA NA NA NA NA NA NA NA NA NA
5 NA NA NA NA NA NA NA NA NA NA
6 TRUE TRUE TRUE NA NA NA NA NA NA NA
7 NA NA NA TRUE TRUE TRUE TRUE TRUE TRUE TRUE |
@mitchellmanware |
@sigmafelix It's always nice to have as much internal consistency - could potentially even be made into a special use-case advanced linter (e.g. arugment_name_linter) |
I agree these general names are more broadly applicable. I think there could be more variability in the ie for the GMTED processing function:
The |
We can also apply the |
@mitchellmanware The separation was for distinguishing a directory of compressed raw data from a directory of the unzipped data (what we actually use to process and calculate). @Spatiotemporal-Exposures-and-Toxicology Yes, a custom linter will help a lot! I will work on adding one. |
Right, thanks for the reminder. That would allow for succinct folder arguments across functions which is good, but seems low priority for now. I will revisit next week. |
Great, thank you Insang. I am very close to being done with my function migration and will create pull request once finished. |
@mitchellmanware closing - reopen if needed. 🤙 |
Updates, additions, and next steps for suite of functions designed to download, process, and calculation covariates from geospatial environmental data sources.
@sigmafelix @Spatiotemporal-Exposures-and-Toxicology
The text was updated successfully, but these errors were encountered: