Develop spec lean #161

rburghol · 2024-05-06T21:29:34Z

@PaulDudaRESPEC @austinorr
This PR contains the changes to the HSP2 function to include special actions, classic ACTIONS. The following is included:

HSP2/SPECL.py the base to enable special actions support.
HSP2/state.py: improved data model to hold and pass information amongst dynamic operations.
om*.py additions: entirely new code to support SPECL.py and dynamic operations
- om_simtimer.py: timer setup, maintains running date/time info in state_ix
- om_model_object.py: base class for parsing and formatting special operations into runtime
- om_model_linkage.py: managing links among operations to insure dependency ordering and data sharing
- om.py: runtime execution code for special operations
tests/test10/HSP2results/test10spec.uci: a simple example of special actions added to the standard test10.uci example.
SEDTRN.py: special actions enabled into SEDTRN
HYDR.py: special actions enabled into HYDR
main.py: calls initialization for specl

This has been run installed via pip in python 3.10.8 successfully

rburghol · 2024-05-06T21:36:35Z

By the way @austinorr - I can see the use of matrix that you mentioned and that works really nicely for eliminating duplicative code in tests over mucho python versions - awesome!

austinorr · 2024-05-07T00:30:43Z

@rburghol Thanks for making this diff nice and clean. Do you want to add an HSPFresults directory with the results you got when you run it in hspf? This should probably be in a separate directory than the test10 on, maybe add a new one called test10spec? Then we can check that the code in this PR yields the same results.

rburghol · 2024-05-07T11:48:27Z

Good thinking @austinorr -- test10specl has been separated into it's own folder with HSP2results and HSPFresults folders. I will do a PR to add test coverage into the develop branch after this gets merged so that I am not updating the test files from this branch unless you think it's a must to do in one... tho we're now back in a 30+ file commit.

PaulDudaRESPEC · 2024-05-07T13:44:50Z

These changes to main.py would overwrite some of what Tim just added in PR #159 (for instance 'Union' in the arg list). Was that an accident?

austinorr · 2024-05-07T13:48:47Z

I will do a PR to add test coverage into the develop branch after this gets merged

Let’s take a look at all the new pytest code that got put together to help us automate the tests. I think once we go through it we’ll be able to remove the HSP2results directory (we only need the HSPFresults for the new test pattern) and the notebooks too, unless they’re important for documentation.

In your local Python env, try the following:
$pip install -e .[dev]
$pytest tests -v

you’ll find that there’s a test harness in the tests directory that we can add the test10spec to, with some minor revisions, and it’ll run it and do a full comparison.

rburghol · 2024-05-07T13:51:47Z

@PaulDudaRESPEC Definitely not code that was intentionally modified in develop–specact. Git must not be able to handle this, considering this branch has been live for so long before being merged.

austinorr · 2024-05-07T13:58:39Z

I also hope we can do the chore of fixing the “*” imports before we merge this. One of the things we’ve discussed a lot is getting this project formatted and linted and I don’t think we should burden that lift with having to fixing all of these imports. Let’s fix them here instead.

rburghol · 2024-05-07T14:13:01Z

@PaulDudaRESPEC I have migrated the changes to main.py that @timcera merged in #159

rburghol · 2024-05-07T14:23:37Z

@austinorr I am going to make these import changes.

…EDTRN

rburghol · 2024-05-07T16:05:49Z

OK @austinorr -- the import style has been switched for all files in this PR.
I am curious going forward what your preference is in regard to things like import pandas date_range, to then reference the function as date_range, versus, import pandas as np, and then using the as np.date_range?

austinorr · 2024-05-07T16:27:30Z

@rburghol Thanks, looking great! Common practice is to use the pandas or numpy namespace whole with import pandas as pd then reference things with pd.date_range, as you said. However, I don't actually have a strong opinion, as long as things are explicitly imported, then it's just personal preference and linters will pass. My preference is for less clutter up top, but that just means lots of little pd.<everythings> throughout the rest of the code.

I'm looking through the test directory you made for this, and I wonder what would be the best way to test the special actions code? I see the test10specl.uci, but it's never been run by hspf (there's no .hbn files referenced in that uci) and there's no differences between the test10.uci except the new one has a typo in line 381 that @PaulDudaRESPEC caught and fixed last week (SNDFG should be SDFG). Can we slim down the new test directory to just what we actually use to verify the new capabilities this PR added?

rburghol · 2024-05-07T16:51:37Z

@austinorr the files are different in that test10specl.uci has a special actions block beginning on line 994:

SPEC-ACTIONS
*** test special actions
  RCHRES 5                                RSED    4       +=  2.50E+05
  RCHRES 5                                RSED    5       +=  6.89E+05
  RCHRES 5                                RSED    6       +=  4.01E+05
END SPEC-ACTIONS

This was a super quick example that @PaulDudaRESPEC recommended in an email exchange we had, and thus I wonder what tests he might advise? Note: since I don't run HSPF on windows, it would be helpful if someone else ran it, I literally don't have a WinHSPF install handy.

In terms of slimming the test10specl dir, sure, it's possible, but I have no opinion other than to say I think a uniform directory tree for test files would facilitate automated testing. When you requested that I add an HSPFresults directory, I assumed I should follow the apparent project convention to have an HSPFresults dir and an HSP2results dir for the purpose of automated comparisons between results. I just cloned it, as I have not touched those other files in the past.

Again, I would ask @PaulDudaRESPEC what the thinking behind the various jupyter notebooks and the 2 directories was, and whether we should carry that practice forward.

austinorr · 2024-05-07T16:57:12Z

I can do it, and with your permission, i'll clean up the test directory.

rburghol · 2024-05-07T18:31:12Z

@austinorr I'd welcome it! (both the model running and the cleanup).
And for the folder structure, I'm happy to provide feedback or ideas if you want to get together on it, but since I was not involved in the first round of test and examples setup, I don't yet have an opinion, other than to suggest that we put *.h5 in .gitignore rather than explicitly ignoring test h5 files since these things can be a menace with githubs file size limits and that can be an impediment to less experienced git users producing tests (or to someone who is working on a tight timeline).

austinorr · 2024-05-07T19:15:10Z

@rburghol do you have any time to screen share and talk me through how to run the specl actions? I've gotten things all run through HSPF, new hbn files etc, but they aren't matching hsp2 results in reach 5 -- the one we added actions for.

rburghol · 2024-05-07T19:41:59Z

Thanks @austinorr for pushing this forward!

I'm tied up right now, but I may be able to find some time tomorrow, and definitely on Thursday I will have a great deal of flexibility.

@PaulDudaRESPEC Helped me set up this example, and may have some testing insight.

PaulDudaRESPEC · 2024-05-07T21:07:43Z

In that RCHRES SANDFG table, make sure the values are lined up in the 15th column -- they appear to be in the 16th.

rburghol · 2024-05-08T11:54:52Z

Thanks @PaulDudaRESPEC - when we set these up originally did you have a quick script to compare output between the two hsp2 runs to verify function?

PaulDudaRESPEC · 2024-05-08T14:24:39Z

I'll send you the UCI that I suggest you use for the test (via email).

rburghol · 2024-05-08T15:43:56Z

@austinorr I have incorporated the new test10specl.uci that @PaulDudaRESPEC sent into the HSP2 and HSPF results folders.

…ch, added domain specific specl filter in HYDR

tests for spec actions

…ated as registers that get zeroed

rburghol · 2024-05-10T15:48:14Z

@austinorr I just cleaned up a last few imports. I think this is now ready to go, unless you want to wait until we rethink how the object creation should pass parent and/or child in (#162), though I'd like to give that more thought, and ultimately think it's more about readability than functionality at this juncture.

…special case of ModelVariable, and then ModelVariable can form the base class for simple numerical data holders like UVQUAN

austinorr · 2024-05-14T16:42:51Z

@rburghol I think we're close -- I'm going to do a couple quick performance checks with the tests now that they pass.

One thing we should do is delete all the unrelated personal development files that are in here. I had done that already in one of my commits, but they got re-added.

all compre-case.json should be deleted, right?
all changes to tests/testcbp* should be reverted, right?

we don't need a wdm in test10specl/HSP2results. If a contributor makes it so they can test it with the 'convert' script, please delete it or leave it out of the commit.

we don't need the test10/HSP2results/pytest10.py file, it doesn't run any tests and it looks like it's just scratch notes for a readme. let's pull out the valuable notes and make this file a .md or .txt.

we don't need the test10/HSP2results/test10.example.py file either, this looks like scratch work and it appears nothing imports from it.

It looks like this could be avoided by using a per-file staging pattern when building a commit. VSCode and other IDEs help with this a lot, you can just click the '+' button to stage each file that has a necessary change (you can even stage selected lines!), then do git commit -m "something something new feature"

PaulDudaRESPEC · 2024-05-14T17:17:56Z

@austinorr and @rburghol , I think that test10.example.py file is not 'needed', but it is a good example of how one could add custom python code as a sophisticated special action, and therefore is worth keeping.

I think there are probably other files I've added over the years that ought to be removed. I can make a pass through before release to clean out any leftovers that shouldn't really bin there.

I continue to be holding for a nod from you both, at which point I'll merge this PR.

rburghol · 2024-05-14T17:23:48Z

@austinorr the below are all now completed and have run successfully through the tests -- a couple notes follow.

Thanks for the heads up, couple things:

all compre-case.json should be deleted, right?
- Yes. compare_case.* can and will be deleted.
All changes to tests/testcbp* should be reverted, right?
- Maybe? Relevant changes have been made in these files to explore the changes that have been made during this overhaul, so I would say no on reversion, but that directory can be slimmed down if you like. There is one file that is really oriented towards a next PR, but doesn't break anything.
- jk_tester.* can and will be deleted.
we don't need a wdm in test10specl/HSP2results
- I might be missing something here, that file is called for in the uci, and thus gets imported to the h5 during testing, so I think it is necessary - are you saying the test10spec.uci should be linked "../HSPFresults/test10.wdm"? Note, it also appears in TEST10_hsp2_compare.ipynb
...pytest10.py file, it doesn't run any tests,
- It can and will be deleted (I migrated comments to Advanced Model validation tests #143). Tho while it does (did?) get called by the test workflow to perform test of whether or not h5 file creation completed with the test step hsp2 import_uci test10uci test10.h5, I can confirm it is superfluous now since your comparisons create and execute an h5.
test10/HSP2results/test10.example.py
- To give some more detail to Paul's response above, it is actually relevant I think, though not employed in automated tests. It tests the ability to employ a python script addendum to special actions. The system looks for [uci root].py and if it finds it with a properly formatted function to modify the model, it does so. Paul renamed it to test10.example.py in order to turn that off by default, but we kept it there as an example to be run manually, or as something we could enable in our testing scripts by doing a file copy to the name of the testing h5 file which would automatically trigger its inclusion -- but it would be a simple "test if it runs without blowing up" at this point, or we could retool the example to mimic the test10spec.uci behavior, and see how that worked.
It looks like this could be avoided by using a per-file staging pattern when building a commit.
- I want to learn how to do this :)

austinorr · 2024-05-14T20:33:48Z

@PaulDudaRESPEC @rburghol sounds good, I'm for keeping things we need -- just want to make sure we don't add things accidentally.

If testcbp is ready to include in the test suite, let's make an HSPF directory for it so that we can get results to check against and add it to the suite. Currently only test10 and test10specl are in the pytest suite. Nothing in the pytest suite references HSP2results directories, those are from the tests/convert manual testing pattern. Since this tool is a drop-in replacement for HSPF, we should maybe try to migrate away from having some files for the HSPF run and other duplicated files for the HSP2 run. That's the approach the test suite takes, there's only one test case, and it runs in both HSPF and HSP2 exactly identically. If we duplicate the files and run separate tests for each, we can't say that anymore.

Looping back on performance, I ran the test10 with both the 'develop' branch and this one and timings look good.

under develop we have:

pytest -k test10- --durations 0
================ slowest durations =====================
54.69s call     tests/test_regression.py::test_case[test10-0]
28.54s call     tests/test_regression.py::test_case[test10-3]
28.32s call     tests/test_regression.py::test_case[test10-1]
28.30s call     tests/test_regression.py::test_case[test10-4]
28.18s call     tests/test_regression.py::test_case[test10-2]

under this branch we have:

pytest -k test10- --durations 0
================ slowest durations =====================
43.56s call     tests/test_regression.py::test_case[test10-0]
28.48s call     tests/test_regression.py::test_case[test10-2]
27.35s call     tests/test_regression.py::test_case[test10-4]
26.98s call     tests/test_regression.py::test_case[test10-3]
26.27s call     tests/test_regression.py::test_case[test10-1]

the first call includes compiling and caching the numba jit code, so we can ignore it.

kudos @rburghol for getting this huge feature implemented without a performance hit.

Note that because of all the disk io writing to H5 files, the numba compiled code (28 seconds avg) is not much faster than pure python code (32 seconds avg):

NUMBA_DISABLE_JIT=1 pytest -k test10- --durations 0

36.20s call     tests/test_regression.py::test_case[test10-4]
33.02s call     tests/test_regression.py::test_case[test10-0]
32.83s call     tests/test_regression.py::test_case[test10-3]
32.15s call     tests/test_regression.py::test_case[test10-1]
31.91s call     tests/test_regression.py::test_case[test10-2]

I think we should focus our attention there next.

rburghol · 2024-05-15T11:16:40Z

Thanks @austinorr for all your guidance and lift on this! Thanks @PaulDudaRESPEC !!
Stoked to try and up the performance level in the next phase.

Burgholzer added 8 commits May 6, 2024 16:24

just hsp2/om specific stuff

140cc60

just hsp2/om specific stuff

bec246c

special actions and support files state and om

ff7cf97

special actions and support files state and om

059263f

change endings for HYDR.py

5ffd306

change endings for SEDTRN.py

ff0e799

change endings for main.py

ac90d34

change endings for state.py

2ff7f56

austinorr mentioned this pull request May 7, 2024

Merge changes from develop-specact into develop #155

Closed

added test10specl.uci

c571c9d

Merge changes that were reverted from respec#159

3006ec7

Burgholzer added 4 commits May 7, 2024 10:50

load specific functions instead of blanket import

8396388

load functions for ModelObject and SimTimer

c882806

load required functions for state into om

8e61c34

load required functions for state/om/specl explicitly into HYDR and S…

28ab858

…EDTRN

corrected typo in the comment/header line in the specl UCIs

0e1258c

added new UCI test from Paul

8437dc2

Burgholzer and others added 6 commits May 8, 2024 13:37

restored reversion for SEDTRN specl execution, fixed precision mismat…

ed9f6d1

…ch, added domain specific specl filter in HYDR

tests for spec actions

3d40f9b

Merge pull request #100 from Geosyntec/spec-act

a189d50

tests for spec actions

better example in comments, no functional change

29fc5f2

Removed global ModelObject class attriutes. Fixed special actions tre…

041e037

…ated as registers that get zeroed

cleaned up a couple remaining import statements

c1af3f9

Renamed ModelConstant to ModelVariable since ModelConstant should be …

cf00586

…special case of ModelVariable, and then ModelVariable can form the base class for simple numerical data holders like UVQUAN

Burgholzer added 2 commits May 14, 2024 13:34

Deleeted superfluous files

91f1d60

Deleted superfluous files

a4b2b1d

aufdenkampe added this to the v0.12.0: Improved packaging, etc. milestone May 17, 2024

PaulDudaRESPEC merged commit 460f9f7 into respec:develop May 20, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop spec lean #161

Develop spec lean #161

rburghol commented May 6, 2024 •

edited

Loading

rburghol commented May 6, 2024

austinorr commented May 7, 2024

rburghol commented May 7, 2024

PaulDudaRESPEC commented May 7, 2024

austinorr commented May 7, 2024

rburghol commented May 7, 2024 •

edited

Loading

austinorr commented May 7, 2024

rburghol commented May 7, 2024

rburghol commented May 7, 2024

rburghol commented May 7, 2024

austinorr commented May 7, 2024

rburghol commented May 7, 2024 •

edited

Loading

austinorr commented May 7, 2024

rburghol commented May 7, 2024

austinorr commented May 7, 2024

rburghol commented May 7, 2024 •

edited

Loading

PaulDudaRESPEC commented May 7, 2024

rburghol commented May 8, 2024

PaulDudaRESPEC commented May 8, 2024

rburghol commented May 8, 2024

rburghol commented May 10, 2024 •

edited

Loading

austinorr commented May 14, 2024

PaulDudaRESPEC commented May 14, 2024

rburghol commented May 14, 2024 •

edited

Loading

austinorr commented May 14, 2024

rburghol commented May 15, 2024

Develop spec lean #161

Develop spec lean #161

Conversation

rburghol commented May 6, 2024 • edited Loading

rburghol commented May 6, 2024

austinorr commented May 7, 2024

rburghol commented May 7, 2024

PaulDudaRESPEC commented May 7, 2024

austinorr commented May 7, 2024

rburghol commented May 7, 2024 • edited Loading

austinorr commented May 7, 2024

rburghol commented May 7, 2024

rburghol commented May 7, 2024

rburghol commented May 7, 2024

austinorr commented May 7, 2024

rburghol commented May 7, 2024 • edited Loading

austinorr commented May 7, 2024

rburghol commented May 7, 2024

austinorr commented May 7, 2024

rburghol commented May 7, 2024 • edited Loading

PaulDudaRESPEC commented May 7, 2024

rburghol commented May 8, 2024

PaulDudaRESPEC commented May 8, 2024

rburghol commented May 8, 2024

rburghol commented May 10, 2024 • edited Loading

austinorr commented May 14, 2024

PaulDudaRESPEC commented May 14, 2024

rburghol commented May 14, 2024 • edited Loading

austinorr commented May 14, 2024

rburghol commented May 15, 2024

rburghol commented May 6, 2024 •

edited

Loading

rburghol commented May 7, 2024 •

edited

Loading

rburghol commented May 7, 2024 •

edited

Loading

rburghol commented May 7, 2024 •

edited

Loading

rburghol commented May 10, 2024 •

edited

Loading

rburghol commented May 14, 2024 •

edited

Loading