Determine whether a system has a fixed orientation or is equipped with a tracker #49

wfvining · 2020-05-29T21:58:10Z

This adds a function to the pvanalytics.system module for identifying the orientation (Fixed or Tracking) of a PV system from power or irradiance data. This is a fairly early draft, but I wanted to get it out there for early feedback/suggestions.

This is derived from the PVFleets QA Analysis project.

(incomplete) To Do list

finish implementing testing plan (see comments in tests/test_system.py)
decide whether/how to pass goodness of fit parameters
add to API documentation
test winter and summer months separately. This is necessary in order to catch problems during the winter that are masked when the entire series is considered as one because the maximum power/irradiance envelope is higher in the summer months.

- Use an enum for the return value, rather than a string. - Calculation of envelopes that will be used for curve fitting.

Trying to take just the core functionality form PVFleets QA. These are the main differences: 1. Data is not separated into winter and summer months. This means that if there are time-shifts in the data they could mess up the fit. By removing the separation we get the core functionality (curve fitting) and can leave it to the user to pass only winter/summer and combine the orientation inferred for each in to a meaningful result (i.e. PVFleets basically decides the orientation is inconclusive if the winter and summer don't match) 2. A Boolean mask for day/night is passed as a parameter 3. Clipping is passed as a Boolean mask, and the percent of clipping is calculated (rather than passing in clip_percent directly) 4. The responsibilities of the function have been made narrower, reducing the number of values returned and hopefully making future maintenance simpler. It is possible (probable?) that I went too far in removing responsibilities and that some additional return values are needed. When this comes up we should look carefully at whether to add them here or add new functions with separate responsibilities. Going forward I need to make a testing plan for this and write some tests. Also need to decide whether returning None is a good idea, or if I should add a third instance to the Orientation Enum (e.g. INDETERMINANT). Finally I would like to do a bit of simplification in the _orientation_from_fit function. It would be nice if this could be made more general/flexible (i.e. pass in the various thresholds), it will take a bit of exploring to figure out the best way forward for this.

Change name of first parameter and update description of several functions and their parameters.

The orientation of a GHI sensor is FIXED.

previously was passing the polynomial object, linregress needs the y values

enum.auto() is not supported on python 3.5. We add the @enum.unique decorator to enfore uniqueness of instances since the values are now assigned manually.

Rather than abusing/overloading None, we should explicitly signal that the orientation could not be determined from the data.

This change is a first step towards letting the user pass in a data structure with their own minimum r^2 values. Not quite there, but this makes it so that the when we do pass in these parameters we don't have to change the Tracking/Fixed/Unknown logic.

Uses pvlib to simulate power from a fixed PV system and verifies that system.orientation() returns Orientation.FIXED.

Calculating clearsky separately for every test that needs it slows the tests down. Here I change the scope of the summer_clearsky fixture so it will be computed only once. I also change the timestamp spacing from 10 minutes to 1 hour, substantially speeding up the tests. Not a huge performance difference right now, but as more tests are added small performance gains should add up.

The minimum r-squared for curve fits changes depending on the amount of clipping in power data passed in. Originally the different minimums were hard coded, this commit is a first attempt to let users pass in their own minimums. It needs substantially more documentation.

simplified_solis does not require tables.

Use functions in 'util._fit' and 'util._group' rather than locally defined functions.

Defines _is_tracking/_is_fixed predicates which shortens lines and makes the code more clear.

If the profile of the median does not match the profile of the 99.5% quantile (because 100 days have garbage data) then the orientation is UNKNOWN.

Zero slope (in the constant data) leads to warnings from the curve fitting funcitons due to division by zero.

100 days of garbage data messes up the median and causes the fit to fail when 'fit_median' is True.

- Don't need to test POA separately. - Require timezone localized series as input.

Basically we can classify fixed as tracking by messing with the thresholds.

Substantially expanded the documentation in the docstring

Update _infer_tracking() function to pass x and y params to _fit functions.

Instead of passing clipping percentage and maximum clipping percent down through three layers of internal functions we perform the clipping percent test and get the fit bounds at the top level. This makes the code somewhat more efficient by short-circuiting the resampling/grouping steps if there is too much clipping. The number of parameters that are threaded through the internal functions is also reduced by two, which should improve maintainability and readability.

Use the ratio of clipped measurements to the total number of measurements directly rather than converting to a percent. With a fraction between 0 and 1 the fit params dicitonary is a little easier to understand.

Users can pass in a tuple with the summer months and winter months specified in lists. This gives users in different regions the ability to customize the performance of the algorithm to match their particular seasons.

Make it clear that if there is too much clipping neither TRACKING nor FIXED can be determined.

cwhanse

Mostly editorial, one loose end about the seasonal splitting. Go ahead and merge when these comments are closed.

pvanalytics/system.py

pvanalytics/tests/test_system.py

Reorder the params for system.is_tracking_envelope() so that parameters relating to the envelope fit are together and parameters relating to the median fit are together.

Substantial improvements to documentation from code review.

Co-authored-by: Cliff Hansen <cwhanse@sandia.gov>

We can use any type for the "list" of months, as long as it is iterable. Changed to a tuple to avoid problems inherent in using mutable objects as default values.

pvanalytics/system.py

Add coverage for various corner-cases and for disabling the seasonal split all together.

Change the semantics of the seasonal_split param. Now expects a dictionary with keys 'winter' and 'summer'. The behavior of `infer_tracking_envelope()` is also updated so if either season has no data, curve fitting proceeds only on the season that has data (previously the entire series was used).

The `winter_perturbed` fixture did not perturb enough data to dramatically change the median. Extends the months that are perturbed into Spring so that if the entire year is used for curve fitting the median fit will fail.

We just return UNKNOWN in this case; however, it is possible that a user made a mistake in this case. Providing a warning seems like the polite thing to do.

Highlight that seasonal grouping is optional and improve description of `seasonal_split` parameter.

wfvining · 2020-08-31T17:40:43Z

@cwhanse I implemented the optional seasonal split with the dictionary parameter discussed above (along with new tests). Would you mind taking a quick look to see if you think it is reasonable? I didn't want to put a dictionary as the default value directly (mutability concerns) so I used a descriptive key-word.

I also added a warning for possible user error when neither season has data (seems more polite than a ValueError, or silently returning UNKNOWN).

pvanalytics/system.py

cwhanse · 2020-08-31T21:13:45Z

Looks good you decide about that last comment.

pvanalytics/system.py

wfvining mentioned this pull request Jun 16, 2020

(single-axis) trackers #54

Closed

wfvining force-pushed the system-orientation branch from 53f23f9 to f7d1d79 Compare June 19, 2020 14:28

wfvining added 17 commits June 26, 2020 13:16

Started porting orientation check from pvfleets QA

133c741

- Use an enum for the return value, rather than a string. - Calculation of envelopes that will be used for curve fitting.

Can pass irradiance or power to system.orientation()

eab6105

Change name of first parameter and update description of several functions and their parameters.

Add test for GHI orientation.

22ed3c8

The orientation of a GHI sensor is FIXED.

Pass correct values to lineregress

fc80b1f

previously was passing the polynomial object, linregress needs the y values

number Orientation enum manually

f6b188a

enum.auto() is not supported on python 3.5. We add the @enum.unique decorator to enfore uniqueness of instances since the values are now assigned manually.

Clean up comments and docstrings for pvanalytics.system tests

021937e

Add UNKNOWN instance to Orientation enum

f6cc84c

Rather than abusing/overloading None, we should explicitly signal that the orientation could not be determined from the data.

Add orientation test based on simulated power data

6043dbd

Uses pvlib to simulate power from a fixed PV system and verifies that system.orientation() returns Orientation.FIXED.

Test that a simulated single axis tracker is Orientation.TRACKING

9bae1d4

Remove stray print statement

5f81f94

Make maximum clipping percentage a parameter

8b21600

data with too much clipping has UNKNOWN orientation

cb0bbb3

Use simplified solis model for generating clearsky irradiance

7a882c4

simplified_solis does not require tables.

wfvining force-pushed the system-orientation branch from f7d1d79 to 7a882c4 Compare June 26, 2020 19:16

wfvining added 10 commits June 26, 2020 14:39

Refactor to use shared utility functions

a79c57e

Use functions in 'util._fit' and 'util._group' rather than locally defined functions.

refactor to improve readability

69001ae

Defines _is_tracking/_is_fixed predicates which shortens lines and makes the code more clear.

Test that a constant signal has "UNKNOWN" orientation

08163e1

Test orientation on tracking system that is borken part of the time

556d984

If the profile of the median does not match the profile of the 99.5% quantile (because 100 days have garbage data) then the orientation is UNKNOWN.

Suppress warning caused by test data with slope equal to zero

209fc48

Zero slope (in the constant data) leads to warnings from the curve fitting funcitons due to division by zero.

Test that a fixed system with perturbed data has UNKNOWN orientation

1486684

100 days of garbage data messes up the median and causes the fit to fail when 'fit_median' is True.

Passing series with wrong type in constant value test

25adf42

Revise test plan

b346653

- Don't need to test POA separately. - Require timezone localized series as input.

Add test for custom r-squared thresholds

5b374de

Basically we can classify fixed as tracking by messing with the thresholds.

Added system.orientation to API docs

92d9a46

Substantially expanded the documentation in the docstring

wfvining added 6 commits July 24, 2020 13:31

Shorten lines and fix whitespace in is_tracking_envelope docstring

840a8a0

Merge branch 'master' into system-orientation

f2548d4

Update _infer_tracking() function to pass x and y params to _fit functions.

Use clipping fraction instead of percent

8b04c22

Use the ratio of clipped measurements to the total number of measurements directly rather than converting to a percent. With a fraction between 0 and 1 the fit params dicitonary is a little easier to understand.

Add seasonal_split parameter

335492d

Users can pass in a tuple with the summer months and winter months specified in lists. This gives users in different regions the ability to customize the performance of the algorithm to match their particular seasons.

Clarify documentation of clip_max parameter

9ca0efa

Make it clear that if there is too much clipping neither TRACKING nor FIXED can be determined.

wfvining requested a review from cwhanse July 27, 2020 14:39

Merge branch 'master' into system-orientation

0588bff

cwhanse approved these changes Aug 25, 2020

View reviewed changes

wfvining and others added 6 commits August 28, 2020 08:19

Group related parameters

35e02b7

Reorder the params for system.is_tracking_envelope() so that parameters relating to the envelope fit are together and parameters relating to the median fit are together.

Specify default seasonal_split in parameter description

4b923c9

Shorten line to satisfy linter

49c169d

Update documentation

b86faf4

Substantial improvements to documentation from code review.

Fix test descriptions

90e62a5

Co-authored-by: Cliff Hansen <cwhanse@sandia.gov>

Explicitly set the default seasonal_split

fba925b

We can use any type for the "list" of months, as long as it is iterable. Changed to a tuple to avoid problems inherent in using mutable objects as default values.

cwhanse reviewed Aug 28, 2020

View reviewed changes

pvanalytics/system.py Outdated Show resolved Hide resolved

wfvining added 6 commits August 28, 2020 15:37

Test for new seasonal_split semantics

442abe7

Add coverage for various corner-cases and for disabling the seasonal split all together.

Perturb a greater amount of the winter data in tests

3485c78

The `winter_perturbed` fixture did not perturb enough data to dramatically change the median. Extends the months that are perturbed into Spring so that if the entire year is used for curve fitting the median fit will fail.

pass clip_min as a fraction not a percent

77cb133

Issue a warning when there is no data for both winter and summer

6ff9dd6

We just return UNKNOWN in this case; however, it is possible that a user made a mistake in this case. Providing a warning seems like the polite thing to do.

Reword documentation

9c09982

Highlight that seasonal grouping is optional and improve description of `seasonal_split` parameter.

cwhanse reviewed Aug 31, 2020

View reviewed changes

pvanalytics/system.py Outdated Show resolved Hide resolved

Better warning message

57d72ad

wfvining commented Aug 31, 2020

View reviewed changes

pvanalytics/system.py Outdated Show resolved Hide resolved

Shorten line

3bed642

wfvining merged commit fb0091d into master Sep 1, 2020

wfvining deleted the system-orientation branch September 1, 2020 13:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Determine whether a system has a fixed orientation or is equipped with a tracker #49

Determine whether a system has a fixed orientation or is equipped with a tracker #49

wfvining commented May 29, 2020 •

edited

cwhanse left a comment

wfvining commented Aug 31, 2020

cwhanse commented Aug 31, 2020

Determine whether a system has a fixed orientation or is equipped with a tracker #49

Determine whether a system has a fixed orientation or is equipped with a tracker #49

Conversation

wfvining commented May 29, 2020 • edited

(incomplete) To Do list

cwhanse left a comment

Choose a reason for hiding this comment

wfvining commented Aug 31, 2020

cwhanse commented Aug 31, 2020

wfvining commented May 29, 2020 •

edited