Reformatting and naming #94

rosecers · 2021-04-08T14:45:46Z

This is an on-going PR with a few goals (from https://scikit-learn.org/stable/developers/develop.html#coding-guidelines and elsewhere)

Some of these things I anticipate will take place in other PRs, such as the docstring reformatting in #82 and the everything reformatting in #93.

examples/PCovR.ipynb

Luthaf · 2021-04-08T14:59:19Z

Merging this PR will be a bit painful, since it might introduce a lot conflicts with other PR. Should we try to get this one through as quickly as possible (and rebase other PR), or wait for other PR to be merged first and rebase this one?

EDIT: missed you edit in the top comment 😃. So the plan is to do all of this piecewise? That sounds good!

rosecers · 2021-04-08T15:00:43Z

Merging this PR will be a bit painful, since it might introduce a lot conflicts with other PR. Should we try to get this one through as quickly as possible (and rebase other PR), or wait for other PR to be merged first and rebase this one?

This one is still a WIP. I'd like #93 and #82 in first.

rosecers · 2021-04-09T09:18:25Z

So I think my plan is this, because this checklist was easier to get through than I anticipated. Given that skcosmo/feature_selection, skcosmo/sample_selection and skcosmo/preprocessing are all going to change monumentally with #93 and #82, I'll hold off merging this until those are in. However, I'll make sure to enforce these formatting rules in those (and all future PR's) so that rebasing will be easy. @Luthaf , thoughts?

Luthaf · 2021-04-09T09:25:45Z

I'll make sure to enforce these formatting rules in those

Enforce them during the review you mean? Or automatically? I'm good if we enforce them manually during a review, using automatic tools would be harder.

rosecers · 2021-04-09T09:26:43Z

I'll make sure to enforce these formatting rules in those

Enforce them during the review you mean? Or automatically? I'm good if we enforce them manually during a review, using automatic tools would be harder.

100% agree. I'll add to our internal resources this list for comparison

rosecers · 2021-04-09T09:27:47Z

Note that not touching the files in #82 and #93 will cause this to fail CI until rebase.

added atol, added tests for atol and rtol, expanded and unified documentation, added description of methods change the default variables and docs Applying formatting consistent with #94

added atol, added tests for atol and rtol, expanded and unified documentation, added description of methods replaced the absolute tolerance with a relative tolerance added atol, added tests for atol and rtol, expanded and unified documentation, added description of methods change the default variables and docs Applying formatting consistent with #94 added copy parameter

* replaced the absolute tolerance with a relative tolerance added atol, added tests for atol and rtol, expanded and unified documentation, added description of methods replaced the absolute tolerance with a relative tolerance added atol, added tests for atol and rtol, expanded and unified documentation, added description of methods change the default variables and docs Applying formatting consistent with #94 added copy parameter * Changed rtol and atol defaults Co-authored-by: rosecers <rosecersonsky@gmail.com>

agoscinski

Looks good.

I think only preprocessing/flexible_scaler.py should be renamed to preprocessing/_data.py if we follow sklearn with their StandardScaler, what do you think?
https://github.com/scikit-learn/scikit-learn/blob/95119c13a/sklearn/preprocessing/_data.py#L564

EDIT: It can also exist in its own file, I just think it should also have an underscore following the naming convention of sklearn

rosecers · 2021-04-20T14:15:50Z

Looks good.

I think only preprocessing/flexible_scaler.py should be renamed to preprocessing/_data.py if we follow sklearn with their StandardScaler, what do you think?
https://github.com/scikit-learn/scikit-learn/blob/95119c13a/sklearn/preprocessing/_data.py#L564

EDIT: It can also exist in its own file, I just think it should also have an underscore following the naming convention of sklearn

Ooh good catch! I was planning on this, following #91, just hadn't remembered to do so.

agoscinski · 2021-04-21T09:28:36Z

Do we use sklearn utilities where warranted? https://scikit-learn.org/stable/developers/utilities.html#developers-utils

I think we are not doing this in the moment in the test

If your code relies on a random number generator, it should never use functions like numpy.random.random or numpy.random.normal. This approach can lead to repeatability issues in unit tests. Instead, a numpy.random.RandomState object should be used, which is built from a random_state argument passed to the class or function. The function check_random_state, below, can then be used to create a random number generator object.

namely in

test_linear_model.py (random.seed)
test_metrics.py (random.seed)
test_kernel_normalizer.py (random.uniform)
test_orthogonalizers.py (random.uniform)
test_sparse_kernel_centerer.py (random.uniform)
test_standard_flexible_scaler.py (random.uniform)
skcosmo/sample_selection/_voronoi_fps.py (random.randint)
skcosmo/_selection.py (random.randint)

Its a bit frustrating that they only mention that this could lead to issues, but don't say what kind of issues. But it make sense to have a consistent random state functionality. I already have changed test_metrics.py and test_linear_model.py because these one require a random orthogonal matrix, so I had to switch to use sklearn.utils.extmath.randomized_range_finder, since RandomState seed does not effect numpy seeds.

I think the other points don't match any part of the code.

EDIT: Sorry I misclicked when commenting

…latest/format.html#numpydoc-docstring-guide

…nd made requisite changes Renaming utils with leading underscore

…bute on the instance. In init, there should be no logic, not even input validation, and the parameters should not be changed. The corresponding logic should be put where the parameters are used, typically in fit. Adding fit before checking alpha

checked for absolute imports in tests checked for import * anywhere

rosecers · 2021-04-27T09:45:53Z

@Luthaf something fishy is going on with black here. On both my source-built scikit-cosmos, black passes, but doesn't in CI. Thoughts?

Changed all random instances to RandomStates

agoscinski

looks good to me

Luthaf reviewed Apr 8, 2021

View reviewed changes

examples/PCovR.ipynb Outdated Show resolved Hide resolved

rosecers force-pushed the reformatting_and_naming branch 6 times, most recently from 6c373b4 to c59f946 Compare April 9, 2021 08:33

rosecers force-pushed the reformatting_and_naming branch from 302a8e6 to 3427c03 Compare April 9, 2021 09:27

rosecers added a commit that referenced this pull request Apr 9, 2021

Applying formatting consistent with #94

ddfec11

rosecers force-pushed the reformatting_and_naming branch 2 times, most recently from 9d2415d to 3135583 Compare April 9, 2021 15:29

rosecers force-pushed the reformatting_and_naming branch from 3135583 to 0dbbdc0 Compare April 16, 2021 13:45

rosecers marked this pull request as ready for review April 16, 2021 14:27

agoscinski reviewed Apr 20, 2021

View reviewed changes

rosecers mentioned this pull request Apr 20, 2021

Re-organize tests into module sub-folders #106

Closed

3 tasks

rosecers force-pushed the reformatting_and_naming branch from 0dbbdc0 to 28ea13b Compare April 20, 2021 14:15

rosecers requested a review from agoscinski April 21, 2021 07:22

agoscinski closed this Apr 21, 2021

agoscinski reopened this Apr 21, 2021

rosecers added 10 commits April 27, 2021 11:32

Reformatting tests ala sklearn

b39173c

Reformatting docstrings to follow https://numpydoc.readthedocs.io/en/…

e4ffe3d

…latest/format.html#numpydoc-docstring-guide

Changed decomposition to underscore naming, KPCovR --> KernelPCovR, a…

dd5545d

…nd made requisite changes Renaming utils with leading underscore

Only relative imports inside skcosmo

8c27a6c

checked for absolute imports in tests checked for import * anywhere

Checking documentation types

7dd32bf

Checking default values

e467d07

Checking attribute names

9fb1f4b

Make necessary changes to documentation

ca044df

Moving _flexible_scaler to _data

35670ff

rosecers force-pushed the reformatting_and_naming branch from 4657bf4 to baa42f9 Compare April 27, 2021 09:34

change numpy.random.seed to RandomState seed

5afaf03

Changed all random instances to RandomStates

rosecers force-pushed the reformatting_and_naming branch from baa42f9 to 5afaf03 Compare April 27, 2021 10:13

agoscinski approved these changes Apr 27, 2021

View reviewed changes

rosecers merged commit 55d80f3 into main Apr 27, 2021

rosecers deleted the reformatting_and_naming branch April 27, 2021 10:37

rosecers mentioned this pull request Apr 27, 2021

Folder and file naming convention #49

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reformatting and naming #94

Reformatting and naming #94

rosecers commented Apr 8, 2021 •

edited

Luthaf commented Apr 8, 2021 •

edited

rosecers commented Apr 8, 2021

rosecers commented Apr 9, 2021

Luthaf commented Apr 9, 2021

rosecers commented Apr 9, 2021

rosecers commented Apr 9, 2021

agoscinski left a comment •

edited

rosecers commented Apr 20, 2021

agoscinski commented Apr 21, 2021 •

edited

rosecers commented Apr 27, 2021

agoscinski left a comment

Reformatting and naming #94

Reformatting and naming #94

Conversation

rosecers commented Apr 8, 2021 • edited

Luthaf commented Apr 8, 2021 • edited

rosecers commented Apr 8, 2021

rosecers commented Apr 9, 2021

Luthaf commented Apr 9, 2021

rosecers commented Apr 9, 2021

rosecers commented Apr 9, 2021

agoscinski left a comment • edited

Choose a reason for hiding this comment

rosecers commented Apr 20, 2021

agoscinski commented Apr 21, 2021 • edited

rosecers commented Apr 27, 2021

agoscinski left a comment

Choose a reason for hiding this comment

rosecers commented Apr 8, 2021 •

edited

Luthaf commented Apr 8, 2021 •

edited

agoscinski left a comment •

edited

agoscinski commented Apr 21, 2021 •

edited