Skip to content
This repository has been archived by the owner on Sep 11, 2023. It is now read-only.

Add map to each ML training example showing location of installed PV systems #184

Open
JackKelly opened this issue Sep 30, 2021 · 1 comment
Labels
data New data source or feature; or modification of existing data source enhancement New feature or request good first issue Good for newcomers

Comments

@JackKelly
Copy link
Member

JackKelly commented Sep 30, 2021

Detailed Description

At the moment, for each ML example, we ask our ML models to directly predict total solar PV power generation for an entire region of the country (specifically: a region that's electrically connected to a grid supply point (GSP). A GSP is basically a huge electricity substation that is the boundary between the transmission system and the distribution system).

We get the estimated total PV generation for each GSP region from Sheffield Solar's excellent PV Live Regional API. These estimates go back to 2014.

Behind the scenes, Sheffield Solar maintain a map of the locations of installed solar PV systems. This map changes over time. So, for example, Sheffield Solar's estimates of total PV power generation for each GSP for 2016 were created using their map of what PV was installed in 2016.

If we can feed the PV map (for each GSP, and for each timestep) into our ML models, then our ML models will know which patches of the satellite image to focus on.

It's not the end-of-the-world if we can't use this map. With luck, our models may implicitly learn the location of the PV systems for each GSP, and learn how that map changes over time. But it's almost certainly better to explicitly provide this map as an input to the ML model, to give the ML model less to learn for itself :)

This issue is related to #182

Related issues

This issue is about getting the capacity map into nowcasting_dataset.

Let's discuss how to encode the map for our ML models in openclimatefix/nowcasting_dataloader#24

@JackKelly JackKelly added enhancement New feature or request data New data source or feature; or modification of existing data source labels Sep 30, 2021
@JackKelly JackKelly added this to To do in National Grid Nowcasting: WP1 via automation Sep 30, 2021
@JackKelly JackKelly changed the title Add map showing location of installed PV Add map to each ML training example showing location of installed PV systems Oct 1, 2021
@JackKelly
Copy link
Member Author

JackKelly commented Oct 12, 2021

Update from Jamie at Sheffield Solar:

@JamieTaylor-TUOS has very kindly shared PV installed capacity, aggregated by LLSOA, per month as a CSV file (see the email Jamie sent on 7th Oct 2021).

We'll also need to use the LSOA boundary shape file.

Jamie says:

LSOAs vary in size. The biggest ones are in Scotland. The largest is ~1200 km2 !! The median size is 0.418 km2 though, so in general they're pretty small.

So, I think we're good to go! Thanks again, @JamieTaylor-TUOS!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
data New data source or feature; or modification of existing data source enhancement New feature or request good first issue Good for newcomers
Projects
No open projects
Status: Todo
Development

No branches or pull requests

1 participant