A collection of Matt and David's commits from #2316 #2491

rwest · 2023-06-23T04:28:14Z

Motivation or Problem

Pull request #2316 which adds electrochemistry, adds lithium, adds solvation to surfaces, adds charge transfer reactions, is rather large and hard to review, and also has some necessarry but unrelated things, that can be helpful elsewhere.

Description of Changes

I've pulled out a bunch (but not all) of those things into this PR so they can be reviewed and merged, then the electrochem PR can be rebased onto it. I also reordered things so the fixups commits are mostly right after the things they're fixing.

One feature that was added was the ability to forbid species and functional groups in input files. As I was trying to add examples to the input files and documentation, I found it was already there, since this feature was added in #2185 - authored around the same time in October 2021. Rather than add a second syntax for the same thing, I reverted the commits on this branch.

Testing

I read the commits as I was cherry-picking, got the test suite to pass, and also added some more unit tests.

codecov · 2023-06-23T07:43:18Z

Codecov Report

Merging #2491 (cfe970f) into main (ef83a1c) will increase coverage by 0.04%.
The diff coverage is 55.73%.

@@            Coverage Diff             @@
##             main    #2491      +/-   ##
==========================================
+ Coverage   48.50%   48.55%   +0.04%     
==========================================
  Files         110      110              
  Lines       30761    30812      +51     
  Branches     8039     8054      +15     
==========================================
+ Hits        14920    14960      +40     
- Misses      14320    14329       +9     
- Partials     1521     1523       +2

Impacted Files	Coverage Δ
rmgpy/data/solvation.py	`56.30% <0.00%> (-0.19%)`	⬇️
rmgpy/rmg/input.py	`27.27% <ø> (ø)`
rmgpy/rmg/model.py	`38.95% <0.00%> (ø)`
rmgpy/yml.py	`12.42% <0.00%> (-0.15%)`	⬇️
rmgpy/rmg/reactors.py	`20.83% <33.33%> (+0.09%)`	⬆️
rmgpy/data/kinetics/family.py	`48.80% <51.42%> (+0.75%)`	⬆️
arkane/common.py	`77.70% <100.00%> (+0.72%)`	⬆️
rmgpy/constraints.py	`78.94% <100.00%> (+1.48%)`	⬆️

... and 3 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

rwest · 2023-06-23T14:38:17Z

The only test failures are expected failures of the regression tests because of the random run-to-run variations.

rmgpy/data/kinetics/family.py

rmgpy/rmg/input.py

rmgpy/data/kinetics/family.py

…esorbed species

…ut file

rwest

@mjohnson541 isn't this "make it possible to explicitly forbid molecules and groups in the input file" feature already here thanks to #2185 ?

Besides adding a new, undocumented, name for it, not sure what this commit adds? Can you use the existing mechanism in your input files and we drop this commit?

I see you authored this commit a couple of weeks before PR #2185 was merged, so you're forgiven for not using it back then :)

… the input file" This reverts commit 92fad47. also "fig adjacencyListGroup in input" which was a fixup. The functionality has been available for 2 years since ReactionMechanismGenerator#2185 so adding a new syntax for doing it would be confusing and unnecessarry.

SurfaceChargeTransfer and ArrheniusChargeTransfer are not yet defined (on this branch). Revert this commin once they are.

For now it just tests Arrhenius types, but should eventually work on others.

maximumSurfaceSites and maximumSurfaceBondOrder

Doesn't make a real difference, just easier for a human to parse the code without making errors, when the "if" blocks are in the same order as the documentation and examples.

JacksonBurns

Looks good overall, have requested some small changes for good code style and clarity.

arkane/common.py

JacksonBurns · 2023-07-14T16:57:25Z

rmgpy/data/kinetics/family.py

+    count = 0
+    for kinetics in kinetics_list:
+        count += 1
+        logA += np.log10(kinetics.A.value_si)
+        n += kinetics.n.value_si
+        Ea += kinetics.Ea.value_si
+
+    logA /= count
+    n /= count
+    Ea /= count


We should just use Python's built in average, or numpy's. i.e. np.mean([i.n.value_si for i in kinetics_list]) (and include the log10 for geometric, or use a method that just does geometric mean).

But we'd have to iterate through the list multiple times (once for each attribute) and create a bunch of lists. Is this a verbosity thing? an efficiency thing? a less-likely-to-make-mistakes thing?
I don't like playing code golf for the sake of it.

But we'd have to iterate through the list multiple times (once for each attribute) and create a bunch of lists.

You can do it very efficiently in one line like this:

from types import SimpleNamespace import numpy as np # mimic the kinetics objects kinetics_list = [SimpleNamespace(a=1, b=2, c=3) for _ in range(3)] a, b, c = np.mean([[np.log10(i.a), i.b, i.c] for i in kinetics_list], axis=1)

Is this a verbosity thing? an efficiency thing? a less-likely-to-make-mistakes thing? I don't like playing code golf for the sake of it.

This is an antipattern, which we benefit from removing for all of these reasons. Numpy will be faster, less code is easier to maintain and read, etc.

A hole in one!

I quite like it. Although, having now taught programming a few times, I am learning to appreciate the benefits of code that a novice can understand at a glance. More participation, fewer mistakes, less time spent explaining or parsing, etc. etc.

Anyway, the main reason to not do it your way here and now is that we soon need to revert this commit
2180175
and the merge conflicts would be horrible to resolve.

We can revisit optimizing this code after that happens.
(Though I think this code is seldom called)

rmgpy/rmg/model.py

rmgpy/data/kinetics/family.py

If you happen to pick the first of the list then the or or method was faster. But if you pick the last of the list (or not in the list) then the in tuple method is faster. Also changed some lists to sets for consistency, and because they're a touch faster. See pylint https://pylint.pycqa.org/en/latest/user_guide/messages/refactor/consider-using-in.html (Change requested in code review) Also switched a format to an fstring.

They describe some kinetics comments that were moved in 921d425 and removed in 22e6a13

Must be left over from the PEP8-ify project.

The only way r would be 'None' is if a symmetrical reaction (reactants == products) was found in a library, and hit this check in check_for_existing_reaction that's called by make_new_reaction. But that suggests some other problem. So I doubt this test is often needed. But Matt added it for something. I can't see any way it would be False, so now we check it's not None. (as requested in code review)

Unfortunately we need to revert the commit titled "Remove charge transfer types from average_kinetics (to be reverted)" at some point in the not so distant future, and doing this different averaging scheme now would mess that up.

rwest · 2023-07-14T23:30:15Z

Think I've addressed all your comments. I pushed back on one suggestion - though I like it - because it'll cause horrible merge conflicts soon because that method needs to be more complicated than it currently looks. But the rest I adopted.

JacksonBurns

LGTM!

rwest requested a review from mjohnson541 June 23, 2023 04:28

rwest commented Jun 23, 2023

View reviewed changes

rmgpy/data/kinetics/family.py Outdated Show resolved Hide resolved

rwest commented Jun 23, 2023

View reviewed changes

rmgpy/rmg/input.py Outdated Show resolved Hide resolved

JacksonBurns reviewed Jun 23, 2023

View reviewed changes

rmgpy/data/kinetics/family.py Outdated Show resolved Hide resolved

rwest marked this pull request as draft July 2, 2023 14:08

mjohnson541 and others added 7 commits July 2, 2023 15:08

add functionality for constant species in Reactor classes

34ed5a3

For surface species calculate solvent correction from the saturated d…

a6742e9

…esorbed species

added max_surface_bond_order constraint

49bb3c1

fix bug with max_surface_sites specification

7cee142

handle case when symbol is in all caps

5badbd1

make it possible to explicitly forbid molecules and groups in the inp…

92fad47

…ut file

fig adjacencyListGroup in input

54d9e72

rwest force-pushed the matt branch 2 times, most recently from 6727b61 to 5883156 Compare July 5, 2023 09:23

rwest commented Jul 14, 2023

View reviewed changes

rwest and others added 10 commits July 14, 2023 12:03

add comment attribute to species and reactions in rms yaml file

20adbd1

when rule making is done on one processor don't use pool

4425ac5

add function for averaging kinetics

7d527a3

check reaction exists before checking coverage_dependence

23d88f2

Replace Asserts with ValueErrors

0a6c456

Remove charge transfer types from average_kinetics (to be reverted)

2180175

SurfaceChargeTransfer and ArrheniusChargeTransfer are not yet defined (on this branch). Revert this commin once they are.

Unit test for average_kinetics method

9609275

For now it just tests Arrhenius types, but should eventually work on others.

Add documentation (examples) for recently added species constraints.

88adf99

maximumSurfaceSites and maximumSurfaceBondOrder

Check species constraints in consistent order.

f8fdf43

Doesn't make a real difference, just easier for a human to parse the code without making errors, when the "if" blocks are in the same order as the documentation and examples.

rwest marked this pull request as ready for review July 14, 2023 16:06

rwest requested a review from JacksonBurns July 14, 2023 16:09

rwest force-pushed the matt branch from 5883156 to 82239a1 Compare July 14, 2023 16:09

JacksonBurns requested changes Jul 14, 2023

View reviewed changes

rwest force-pushed the matt branch from 0b86510 to 9cb62f1 Compare July 14, 2023 23:24

rwest added 7 commits July 14, 2023 19:24

Unit test for maximumSurfaceBondOrder species constraint.

f9991fe

Remove obsolete comments.

409b39d

They describe some kinetics comments that were moved in 921d425 and removed in 22e6a13

Update attribute name in a comment.

6071531

Must be left over from the PEP8-ify project.

Comment an elegant alternative averaging scheme.

4013707

Unfortunately we need to revert the commit titled "Remove charge transfer types from average_kinetics (to be reverted)" at some point in the not so distant future, and doing this different averaging scheme now would mess that up.

Simplify the atomic symbol to number conversion.

cfe970f

rwest force-pushed the matt branch from 9cb62f1 to cfe970f Compare July 14, 2023 23:25

JacksonBurns self-requested a review July 14, 2023 23:33

JacksonBurns approved these changes Jul 14, 2023

View reviewed changes

rwest merged commit 215eb1b into ReactionMechanismGenerator:main Jul 15, 2023
4 of 6 checks passed

rwest deleted the matt branch July 15, 2023 02:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A collection of Matt and David's commits from #2316 #2491

A collection of Matt and David's commits from #2316 #2491

rwest commented Jun 23, 2023 •

edited

codecov bot commented Jun 23, 2023 •

edited

rwest commented Jun 23, 2023

rwest left a comment

JacksonBurns left a comment

JacksonBurns Jul 14, 2023

rwest Jul 14, 2023

JacksonBurns Jul 14, 2023

rwest Jul 14, 2023

rwest commented Jul 14, 2023

JacksonBurns left a comment

A collection of Matt and David's commits from #2316 #2491

A collection of Matt and David's commits from #2316 #2491

Conversation

rwest commented Jun 23, 2023 • edited

Motivation or Problem

Description of Changes

Testing

codecov bot commented Jun 23, 2023 • edited

Codecov Report

rwest commented Jun 23, 2023

rwest left a comment

Choose a reason for hiding this comment

JacksonBurns left a comment

Choose a reason for hiding this comment

JacksonBurns Jul 14, 2023

Choose a reason for hiding this comment

rwest Jul 14, 2023

Choose a reason for hiding this comment

JacksonBurns Jul 14, 2023

Choose a reason for hiding this comment

rwest Jul 14, 2023

Choose a reason for hiding this comment

rwest commented Jul 14, 2023

JacksonBurns left a comment

Choose a reason for hiding this comment

rwest commented Jun 23, 2023 •

edited

codecov bot commented Jun 23, 2023 •

edited