-
-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create notebook exploring bias in load growth projections #3910
Comments
Hey @nilaykumar just comment in here if you run into anything strange or have questions about the data or electricity system background (it looks like I can't assign the issue to you until you've engaged with it though) |
What are the formal definitions (according to form 714) of summer and winter? I looked through the form documentation but couldn't find an answer (maybe I missed it!). The EIA's glossary defines summer as May through October and winter as November through April. I'll stick with this for the moment, but let me know if you're familiar with the precise definitions. Edit: and would April 2025, for instance, still count as the winter of 2024? Edit#2: Aha, I should have checked the data dictionary:
|
I suspect that the column descriptions in the data dictionary ultimately came from the EIA column definitions even though they're in a FERC table. Unfortunately the FERC-714 instructions are totally vague on the summer/winter definition, which could mean that every respondent is applying their own criteria and it's not standardized. This EIA post from 2020 give a little insight into how peaks vary by region, month, and hour. The "summer peaking" pattern is a single daily peak late in the afternoon for AC, while the "winter peaking" pattern is two (smaller) daily peaks, in the morning and evening for heating. And when the load curve shifts between these two patterns is different in different regions. E.g. the US Southwest has a "summer" style pattern in all of April, July, and October, and only has the winter pattern in January, but the Northeast has a winter pattern in all of October, January, and April, and only looks like "summer" in July. So maybe it's not unreasonable that different respondents in different climatic regions can choose different cutoffs? For the purposes of this visualization / analysis probably it doesn't matter too much -- if we can just make the cutoff dates for summer/winter a parameter that goes into it we can tweak it later if need be. And looking at all those regional curves, the "winter" peaking demand pattern is always highest in January while the "summer" peaking pattern is always highest in July, so windows that exclude the shoulder seasons are probably fine. It's probably simpler initially to just do a global peak rather than calling out the summer and winter peaks separately. It looks like the RMI analysis didn't differentiate. Are you actually seeing winter peaks that happen in December for some respondents? |
Thanks for the detailed explanation! This question about varying summer/winter designations by geo is an interesting one, but agreed -- it makes sense to stick with a simple global peak for now. I am actually seeing peaks throughout the year, but I might be wrangling the data incorrectly. My notebook is here and I believe it should be visible (I'm new to Kaggle, so let me know if there's something missing). I've got a simple histogram of the 10-year-forecast-vs-realized over-forecast percentage at the bottom. Hopefully that looks reasonable. |
Yes, I can access the notebook! I'm suspicious of the relatively flat distribution of months in which peak demand occurred. I would expect it to be primarily centered around a summer peak in July or August, with a smaller set of planning areas (if any) peaking in ~January.
One thing that might be happening is that the planning areas reporting FERC-714 vary wildly in size, and the smaller ones probably have a much more variable pattern of demand. You might try just looking at planning areas above a certain total demand threshold? The Also the number of peak values being reported in It might be good to spot check a couple of big regional respondents and make sure they look reasonable. E.g. the California ISO and ERCOT should both have a clear summer peak. Maybe aggregate to the max value per day and plot those curves to see what the seasonal patterns look like for various planning areas. |
I think the histogram looks generally like what I would expect. More or less centered around 0, but with a right-skew. Given the wide range of total demand in the different planning areas, it might be more informative to do a histogram that's weighted either by peak demand or total demand. |
Nice catch, I had a lot of duplicates there from demand numbers that were either identically zero or hitting the yearly peak quite often (e.g. during the summer). Dropping duplicates appropriately seems to give more reasonable results (though there are still a decent number of peaks in December, for example):
I've started to sketch out some plots similar to the RMI plot. I'm not familiar enough yet with the data to have much confidence in them yet (the peak-weighted curve is all over the place), but getting there! |
Okay, that distribution looks much more like I would expect -- almost all the peaks are in clear summer or winter months and not the shoulder seasons. |
Annnnd now Georgia Power has an even more bonkers load projection that would triple its overall generating capacity by 2030, almost entirely driven by datacenter loads. Just deranged fantasy. The IRP will be 🍿🔥 |
Hey @nilaykumar I'm going to be offline for December, but @cmgosnell and @arengel would both have context on this analysis if you need to check in with someone. |
Overview
Regulated utilities have a habit of overestimating load growth, in order to justify expanding their rate base. @arengel at RMI did a little exploration of this in 2017 in The Billion Dollar Costs of Forecasting Electricity Demand and it has become relevant once again with the rush to build gas plants and delay coal plant retirements in order to serve "hyperscale" data centers and AI training. To what extent are utilities simply taking advantage of the hype around this narrative to justify "emergency" build out of new fossil infrastructure? Data reported by planning areas in the FERC-714 can provide some context, and would also provide a nice example analysis notebook for our PUDL Examples repo.
Outline
out_ferc714__hourly_planning_area_demand
(only available as Parquet) -- you'll need to look at what the definition of winter and summer peak demand are.Questions
Background Reading
The text was updated successfully, but these errors were encountered: