CollinearityThreshold bug fix and speedup #41

harper357 · 2024-02-06T19:10:10Z

This probably should have some tests along with it, but I have wanted to get a PR started with my suggested changes.

…prehensions

…s an out of range error on the `index[0]` call

ThomasBury

more pythonic, tested locally, all good

ThomasBury

A remaining component of the previously used symmetrical version. Sorting post-operation is preferred for clarity, but pandas' index-based approach maintain consistency (no bug)

ThomasBury

While dividing isn't essential, skipping it could be confusing. We're aiming to find the feature with the highest collinearity. First, we combine the upper and lower halves of the association matrix (since it's not symmetrical). Then, dividing by 2 gives the average and ensures the values never exceed 1, avoiding misinterpretations.

harper357 added 5 commits February 6, 2024 10:41

rewrite cols_to_drop & rows_to_drop to use df.loc instead of list com…

b83e523

…prehensions

added a return if there are no more features to drop. It also prevent…

25b19fa

…s an out of range error on the `index[0]` call

correct most_collinear_series calculation

5d842a2

remove not needed division by a constant.

2131b6c

added comment about possibly removing if statement

f550a79

ThomasBury self-assigned this Feb 7, 2024

ThomasBury added the bug Something isn't working label Feb 9, 2024

ThomasBury approved these changes Feb 9, 2024

View reviewed changes

ThomasBury requested changes Feb 9, 2024

View reviewed changes

ThomasBury approved these changes Feb 9, 2024

View reviewed changes

ThomasBury merged commit 985186d into ThomasBury:main Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CollinearityThreshold bug fix and speedup #41

CollinearityThreshold bug fix and speedup #41

harper357 commented Feb 6, 2024

ThomasBury left a comment

ThomasBury left a comment

ThomasBury left a comment

CollinearityThreshold bug fix and speedup #41

CollinearityThreshold bug fix and speedup #41

Conversation

harper357 commented Feb 6, 2024

ThomasBury left a comment

Choose a reason for hiding this comment

ThomasBury left a comment

Choose a reason for hiding this comment

ThomasBury left a comment

Choose a reason for hiding this comment