This repository has been archived by the owner on Sep 11, 2023. It is now read-only.
Add map to each ML training example showing location of installed PV systems #184
Labels
data
New data source or feature; or modification of existing data source
enhancement
New feature or request
good first issue
Good for newcomers
Detailed Description
At the moment, for each ML example, we ask our ML models to directly predict total solar PV power generation for an entire region of the country (specifically: a region that's electrically connected to a grid supply point (GSP). A GSP is basically a huge electricity substation that is the boundary between the transmission system and the distribution system).
We get the estimated total PV generation for each GSP region from Sheffield Solar's excellent PV Live Regional API. These estimates go back to 2014.
Behind the scenes, Sheffield Solar maintain a map of the locations of installed solar PV systems. This map changes over time. So, for example, Sheffield Solar's estimates of total PV power generation for each GSP for 2016 were created using their map of what PV was installed in 2016.
If we can feed the PV map (for each GSP, and for each timestep) into our ML models, then our ML models will know which patches of the satellite image to focus on.
It's not the end-of-the-world if we can't use this map. With luck, our models may implicitly learn the location of the PV systems for each GSP, and learn how that map changes over time. But it's almost certainly better to explicitly provide this map as an input to the ML model, to give the ML model less to learn for itself :)
This issue is related to #182
Related issues
This issue is about getting the capacity map into
nowcasting_dataset
.Let's discuss how to encode the map for our ML models in openclimatefix/nowcasting_dataloader#24
The text was updated successfully, but these errors were encountered: