Speed up columns slices #900

Mr-Geekman · 2022-09-01T14:32:11Z

Before submitting (must do checklist)

Did you read the contribution guide?
Did you update the docs? We use Numpy format for all the methods and classes.
Did you write any new necessary tests?
Did you update the CHANGELOG?

Proposed Changes

Look #775.

Closing issues

Closes #775.

github-actions · 2022-09-01T14:35:33Z

🚀 Deployed on https://deploy-preview-900--etna-docs.netlify.app

codecov-commenter · 2022-09-01T16:21:10Z

Codecov Report

Merging #900 (1f662fa) into master (18a064a) will increase coverage by 0.03%.
The diff coverage is 96.87%.

@@            Coverage Diff             @@
##           master     #900      +/-   ##
==========================================
+ Coverage   84.95%   84.99%   +0.03%     
==========================================
  Files         133      133              
  Lines        7593     7611      +18     
==========================================
+ Hits         6451     6469      +18     
  Misses       1142     1142

Impacted Files	Coverage Δ
etna/transforms/math/apply_lambda.py	`92.00% <71.42%> (ø)`
etna/datasets/__init__.py	`100.00% <100.00%> (ø)`
etna/datasets/tsdataset.py	`90.74% <100.00%> (-0.03%)`	⬇️
etna/datasets/utils.py	`95.91% <100.00%> (+1.47%)`	⬆️
etna/transforms/feature_selection/filter.py	`100.00% <100.00%> (ø)`
etna/transforms/math/add_constant.py	`97.95% <100.00%> (ø)`
etna/transforms/math/lags.py	`100.00% <100.00%> (ø)`
etna/transforms/math/log.py	`96.15% <100.00%> (ø)`
etna/transforms/math/sklearn.py	`95.49% <100.00%> (+0.16%)`	⬆️
etna/transforms/math/statistics.py	`99.28% <100.00%> (+<0.01%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

martins0n

Maybe we should benchmarks?
Sorts everywhere could lead to bottlenecks.

martins0n · 2022-09-04T20:31:33Z

.github/workflows/test.yml

+          virtualenvs-create: true
+          virtualenvs-in-project: true
+
+      - name: Load cached venv


We shouldn't use cache there -- it could be corrupted.

martins0n · 2022-09-04T20:32:23Z

.github/workflows/test.yml

+          path: .venv
+          key: venv-${{ runner.os }}-3.8-${{ hashFiles('**/poetry.lock') }}
+
+      - name: Install dependencies


As a result of caching we will run the same test case without pandas installing.

martins0n · 2022-09-04T20:34:04Z

.github/workflows/test.yml

+      - name: Install dependencies
+        if: steps.cached-poetry-dependencies.outputs.cache-hit != 'true'
+        run: |
+          poetry add "pandas${{ matrix.pandas-version }}"


I guess installing via poetry add will lead to long dependency resolving.

We should install it via pip after poetry install.

martins0n · 2022-09-04T20:37:18Z

make_bug.py

@@ -0,0 +1,57 @@
+import numpy as np


Hm, what is the purpose of this script?

martins0n · 2022-09-04T20:38:33Z

etna/transforms/math/statistics.py

@@ -73,7 +73,8 @@ def transform(self, df: pd.DataFrame) -> pd.DataFrame:
        history = self.seasonality * self.window if self.window != -1 else len(df)
        segments = sorted(df.columns.get_level_values("segment").unique())

-        x = df.loc[pd.IndexSlice[:], pd.IndexSlice[segments, self.in_column]].values[::-1]
+        df_slice = df.loc[pd.IndexSlice[:], pd.IndexSlice[:, self.in_column]].sort_index(axis=1)


I guess, we don't need pd.IndexSlice[:]

Mr-Geekman · 2022-09-06T07:27:45Z

I added report with benchmark into etna-experiments, PR: ISSUE - 775.

martins0n

👍

We need to update changelog

d.a.bunin added 6 commits August 25, 2022 15:47

Intermediate results

b7a5281

Changed test file

7d0d51d

Add set_columns_wide

0275539

Remove segments selection in math transforms

13c6e44

Merge remote-tracking branch 'origin/master' into issue-775

5eefb11

Fix few more segments selection

c0ecc88

Mr-Geekman self-assigned this Sep 1, 2022

Mr-Geekman marked this pull request as draft September 1, 2022 14:32

github-actions bot temporarily deployed to pull request September 1, 2022 14:35 Inactive

Try new ci

bf8189f

github-actions bot temporarily deployed to pull request September 1, 2022 15:42 Inactive

Fix ci

29fcf46

github-actions bot temporarily deployed to pull request September 1, 2022 16:04 Inactive

Mr-Geekman marked this pull request as ready for review September 1, 2022 16:38

martins0n self-requested a review September 4, 2022 20:27

martins0n suggested changes Sep 4, 2022

View reviewed changes

d.a.bunin added 3 commits September 5, 2022 15:48

Remove extra file with a bug

1452ea0

Remove caching in CI

38b7e1f

Remove redundant IndexSlice

838fa14

github-actions bot temporarily deployed to pull request September 5, 2022 12:57 Inactive

Mr-Geekman requested a review from martins0n September 6, 2022 08:26

Merge branch 'master' into issue-775

3584c86

github-actions bot temporarily deployed to pull request September 6, 2022 10:28 Inactive

martins0n previously approved these changes Sep 6, 2022

View reviewed changes

Update changelog

282cdb4

github-actions bot temporarily deployed to pull request September 7, 2022 07:22 Inactive

martins0n approved these changes Sep 7, 2022

View reviewed changes

Merge branch 'master' into issue-775

1f662fa

github-actions bot temporarily deployed to pull request September 7, 2022 07:34 Inactive

Mr-Geekman merged commit 533fce8 into master Sep 7, 2022

Mr-Geekman deleted the issue-775 branch September 7, 2022 08:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up columns slices #900

Speed up columns slices #900

Mr-Geekman commented Sep 1, 2022 •

edited

Loading

github-actions bot commented Sep 1, 2022 •

edited

Loading

codecov-commenter commented Sep 1, 2022 •

edited

Loading

martins0n left a comment

martins0n Sep 4, 2022

martins0n Sep 4, 2022

martins0n Sep 4, 2022

martins0n Sep 4, 2022

martins0n Sep 4, 2022

Mr-Geekman commented Sep 6, 2022

martins0n left a comment

Speed up columns slices #900

Speed up columns slices #900

Conversation

Mr-Geekman commented Sep 1, 2022 • edited Loading

Before submitting (must do checklist)

Proposed Changes

Closing issues

github-actions bot commented Sep 1, 2022 • edited Loading

codecov-commenter commented Sep 1, 2022 • edited Loading

Codecov Report

martins0n left a comment

Choose a reason for hiding this comment

martins0n Sep 4, 2022

Choose a reason for hiding this comment

martins0n Sep 4, 2022

Choose a reason for hiding this comment

martins0n Sep 4, 2022

Choose a reason for hiding this comment

martins0n Sep 4, 2022

Choose a reason for hiding this comment

martins0n Sep 4, 2022

Choose a reason for hiding this comment

Mr-Geekman commented Sep 6, 2022

martins0n left a comment

Choose a reason for hiding this comment

Mr-Geekman commented Sep 1, 2022 •

edited

Loading

github-actions bot commented Sep 1, 2022 •

edited

Loading

codecov-commenter commented Sep 1, 2022 •

edited

Loading