Weather DB Schema Plans (long term storage v. API call) #24

tnigon · 2020-05-11T14:10:22Z

To Do

Create a schematic for DB schema/structure for weather data. This should include all relevant DB tables in addition to any weather data stored in memory (e.g., from a quick API call; maybe for forecast data).

For each DB table or in-memory dataframe, list the column headings that describe the nature of the data. Make it obvious which tables can be joined together.
For each DB table or in-memory dataframe, define the temporal resolution that must be stored (15 min, hourly, daily, etc.).
For now, let's assume we will want to organize/query weather data by location (not by customer, field name, etc.). Not sure what this looks like, but try to capture this concept in the schematic.
What data we actually use for our prediction models is somewhat irrelevant to this issue. As we build more models/customer use cases, we will naturally rely on different weather products, which we can't anticipate now. Set up to scale easily for basice weather columns (e.g., those of the EPIC .Dly files) - T_min (ºC), T_max (ºC), relative humidity (%), precipitation (mm), wind speed (m/s @ 2m above ground level), and solar radiation (MJ/m^2).

Notes

Retrieving/storing data is certainly separate from compiling (I would call it populating an X matrix). This card is dealing with data storage, not building an X matrix. I suggest always having a default weather product to use (this will be whatever product gives us the highest likelihood of the existence of data - probably some hourly or daily historical API). Then for each "observed" data product (this could be a weather station/point), we store data in addition to the "default". Only at the point of populating the X matrix do we choose which weather data to use. A simple if/then would be if weather station data exists, overwrite API data, else use API data.

tnigon added the documentation Improvements or additions to documentation label May 11, 2020

tnigon assigned tnigon and bohm0072 May 11, 2020

tnigon added this to To do in research_tools project via automation May 11, 2020

tnigon mentioned this issue May 11, 2020

Weather data - use 3rd party API to populate DB table for every point #23

Open

tnigon moved this from To do to In progress in research_tools project May 11, 2020

tnigon unassigned tnigon and bohm0072 Oct 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weather DB Schema Plans (long term storage v. API call) #24

Weather DB Schema Plans (long term storage v. API call) #24

tnigon commented May 11, 2020 •

edited

Loading

Weather DB Schema Plans (long term storage v. API call) #24

Weather DB Schema Plans (long term storage v. API call) #24

Comments

tnigon commented May 11, 2020 • edited Loading

To Do

Notes

tnigon commented May 11, 2020 •

edited

Loading