New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Eqw 2 #94

Merged

Rlamboll merged 13 commits into master from EQW_2

Apr 30, 2020

Collaborator

Rlamboll commented Apr 30, 2020

Pull request

Please confirm that this pull request has done the following:

Tests added
Documentation added (where applicable)
Example added (either to an existing notebook or as a new notebook, where applicable)
Description in CHANGELOG.rst added

Adding to CHANGELOG.rst

Please add a single line in the changelog notes similar to one of the following:

- (`#XX <https://github.com/znicholls/silicone/pull/XX>`_) Added feature which does something
- (`#XX <https://github.com/znicholls/silicone/pull/XX>`_) Fixed bug identified in (`#YY <https://github.com/znicholls/silicone/issues/YY>`_)

Lamboll added 9 commits

April 29, 2020 00:35


          Added sketch of new function, tests do not run

3797e08


          Corrected the function, started reworking tests

55eda00


          Added major bugfix and improved tests (incomplete)

7d46556


          Fixed the tests to run

9e74cc1


          Added function to cruncher comparison

618f525


          Added an extra test for complete coverage, removed an error message

8511fc3


          ran black

067cd52


          Updated changelog

f0fb725


          Minor change to changelog

5d9a140

Rlamboll mentioned this pull request

Equal quant walk #88

Closed

4 tasks

Rlamboll requested a review from znicholls

April 30, 2020 09:25

znicholls approved these changes

View reviewed changes

Collaborator

znicholls left a comment

lgtm, I'm not sure I fully understand the tests but the code change is sufficiently simple that I'll leave it up to you how thoroughly you want to test it.

Assuming notebooks etc. come next?

src/silicone/database_crunchers/equal_quantile_walk.py

+                  Database cruncher which uses the 'equal quantile walk' technique.
+                  This cruncher assumes that the amount of effort going into reducing one emission set
+                  is equal to that for another emission, therefore the lead and follow data should be

Collaborator

znicholls Apr 30, 2020

Suggested change

      
                is equal to that for another emission, therefore the lead and follow data should be
          
                is equal to that for another emission, therefore the lead and follow data should come from

src/silicone/database_crunchers/equal_quantile_walk.py Outdated

+                  This cruncher assumes that the amount of effort going into reducing one emission set
+                  is equal to that for another emission, therefore the lead and follow data should be
+                  the same quantile of all pathways in the infiller database.
+                  It calculates what quantile the lead infillee data is in the lead infiller database,

Collaborator

znicholls Apr 30, 2020

Suggested change

      
                It calculates what quantile the lead infillee data is in the lead infiller database,
          
                It calculates the quantile of the lead infillee data is in the lead infiller database,

src/silicone/database_crunchers/equal_quantile_walk.py Outdated

+                      lead_vals = lead_vals.sort_values()
+                      quant_of_lead_vals = np.arange(len(lead_vals)) / (len(lead_vals) - 1)
+                      if any(quant_of_lead_vals > 1) or any(quant_of_lead_vals < 0):
+                          raise NotImplementedError("Impossible quantiles!")

Collaborator

znicholls Apr 30, 2020

Suggested change

      
                        raise NotImplementedError("Impossible quantiles!")
          
                        raise ValueError("Impossible quantiles!")

Also very amusing +1

src/silicone/database_crunchers/equal_quantile_walk.py

+                      input_quantiles = scipy.interpolate.interp1d(
+                          lead_vals, quant_of_lead_vals, bounds_error=False, fill_value=(0, 1)
+                      )(lead_input)
+                      return np.nanquantile(follow_vals, input_quantiles)

Collaborator

znicholls Apr 30, 2020

Suggested change

      
                    return np.nanquantile(follow_vals, input_quantiles)
          
                    return np.nanquantile(follow_vals, input_quantiles, interpolation="linear")

It's already what happens but just makes clear?

src/silicone/database_crunchers/equal_quantile_walk.py Outdated

+                      return self._db.filter(variable=variable_follower)
+                  def _find_same_quantile(self, follow_vals, lead_vals, lead_input):
+                      if len(lead_vals) == 1:

Collaborator

znicholls Apr 30, 2020

Suggested change

      
                    if len(lead_vals) == 1:
          
                    if len(follow_vals) == 1:

Would this be clearer?

Collaborator Author

Rlamboll Apr 30, 2020

It's to avoid a singularity later, we could also short-circuit the calculation in the case of 1 follow but that's for computational reasons not essential ones. I guess it's better to take the mean afterwards in case the length of the two are different.

tests/integration/crunchers/test_cruncher_equal_quantile_walk.py Outdated

+                      infilled = res(simple_df)
+                      # We compare the results with the expected results: for T1, we are below the
+                      # lower limit on the first, in the middle on the second. At later times we are

Collaborator

znicholls Apr 30, 2020

Suggested change

      
                    # lower limit on the first, in the middle on the second. At later times we are
          
                    # lower limit on the first, in the middle on the second scenario. At later times we are

? Is there a way to make it slightly clearer in the test rather than having the hard-coded 50 and 100?

Collaborator Author

Rlamboll Apr 30, 2020

It's derived better now

tests/integration/crunchers/test_cruncher_equal_quantile_walk.py Outdated

+                  def test_with_one_value_in_infiller_db(self, test_db, caplog):
+                      # The calculation is different with only one entry in the infiller db. We
+                      # expect a warning and the only value to be returned in all cases.

Collaborator

znicholls Apr 30, 2020

Suggested change

      
                    # expect a warning and the only value to be returned in all cases.
          
                    #

No warning at the moment ?

Collaborator Author

Rlamboll Apr 30, 2020

True. Probably not going to add one now, they are annoying

tests/integration/crunchers/test_cruncher_equal_quantile_walk.py Outdated

+                      )
+                      # Repeat with reducing the minimum value. This works differently because the
+                      # minimum point is doubled. By default the cruncher selects the higher

Collaborator

znicholls Apr 30, 2020

Suggested change

      
                    # minimum point is doubled. By default the cruncher selects the higher
          
                    # minimum point is doubled. This modification causes the cruncher to pick the lower value.

Lamboll added 4 commits

April 30, 2020 13:46


          Added changes in the documentation details

d080d15


          Added a cover for the case of missing values

3d20700


          Improved commetns

6286d3e


          Changed explanatory text in tests

7691e7f

Rlamboll merged commit 0562e27 into master

Rlamboll deleted the EQW_2 branch

April 30, 2020 14:25

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment