General Fixes and Improvements by mjohnson541 · Pull Request #37 · zadorlab/PySIDT

mjohnson541 · 2025-05-21T00:50:21Z

Fixed small bug in atom set handling in extension generation
Fixed bug in complementary group handling. Complementary groups that were generated off of bond creation extensions were still considered complementary by node generation even though some were not complementary for the training set. The change checks whether the group is complementary for the training data at that node and only adds it as a complementary node if it is complementary with respect to the associated training data.
Removed a duplicate line of code
Small improvement to dictionary generation
Add weighting of multi evaluation regressor node selection based on occurrence (along with uncertainty) and make it default. The application I originally developed the algorithm for involved training on data that was distributed differently than the prediction cases. However, in most applications one should assume the training distribution is the same as the prediction distribution. This change doesn't seem to matter so much for larger training sets and does not always improve model performance, but it does make a very significant difference in improving model performance consistency in the <1000 datapoint regime.

some complementary groups were not actually complementary this ensures complementary groups are not used if they are not truly complementary

…naries

…or and make default

mjohnson541 added 5 commits May 20, 2025 17:19

fix atom set handling in extension generation

6aa6282

fix complementary group handling

b83f490

some complementary groups were not actually complementary this ensures complementary groups are not used if they are not truly complementary

remove duplicate weight assignment

5c164e4

fixes for list, np.ndarray and dict attributes when generating dictio…

4cb207d

…naries

enable weighting of node selection by occurrence in multieval regress…

07dedb0

…or and make default

mjohnson541 merged commit 4641127 into main May 21, 2025
1 check passed

mjohnson541 deleted the general_improvements branch May 21, 2025 00:55

mjohnson541 restored the general_improvements branch June 29, 2025 01:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

General Fixes and Improvements#37

General Fixes and Improvements#37
mjohnson541 merged 5 commits intomainfrom
general_improvements

mjohnson541 commented May 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mjohnson541 commented May 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant