Time Series Split #163

Srinidhi-Patil · 2020-04-02T17:06:00Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Time Series Cross-Validation:
Implemented Time Series Split and Blocking Time Series Split with Tests

Any other comments?

deatinor · 2020-04-03T12:53:23Z

gtime/model_selection/cross_validation.py

+
+def time_series_split(time_series: pd.DataFrame, n_splits=4, split_on='index'):
+   """
+   Split the input DataFrame into n_splits. If the data is not a timeries then the split


gtime/model_selection/cross_validation.py

gtime/model_selection/tests/test_cross_validation.py

gtime/model_selection/cross_validation.py

deatinor · 2020-04-03T13:26:47Z

gtime/model_selection/cross_validation.py

+         next_date = start_date + pd.Timedelta(split_length)
+
+         for split in range(n_splits - 1):
+            time_fold = time_series[(time_series.index >= start_date) & (time_series.index < next_date)]


I would do these operations diectly on the index, so that you don't copy the whole time series

What do you mean by 'copy the whole time series' ?
Are you suggesting that I should not assign it and directly send the yield like below

for split in range(n_splits - 1): yield time_series[(time_series.index >= start_date) & (time_series.index < next_date)].index next_date += pd.Timedelta(split_length) yield time_series[0:].index

First select the index and then compute the folds on it.
This line: time_series[(time_series.index >= start_date) & (time_series.index < next_date)] copies the all dataframe, data included

deatinor · 2020-04-03T13:32:51Z

gtime/model_selection/cross_validation.py

+
+def time_series_split(time_series: pd.DataFrame, n_splits=4, split_on='index'):
+   """
+   Split the input DataFrame into n_splits. If the data is not a timeries then the split


In general, it is not super clear the description

Implemented Blocking Time Series Split

Srinidhi-Patil added 5 commits April 2, 2020 22:30

Initial TsCV

70ca44b

Implementation for Time Series Split

9c7c510

Implementation for Time Series Split

619533f

Tests for Time Series Split

c6e69ec

Corrected Docstrings

99c0a49

deatinor reviewed Apr 3, 2020

View reviewed changes

Srinidhi-Patil and others added 4 commits April 3, 2020 21:59

Review Changes 1

349f5d6

Additional Tests, Review changes

d33d1cd

Implemented Blocking Time Series Split

51d12ae

Merge pull request #2 from Srinidhi-Patil/blocking_tss

ce0f4cc

Implemented Blocking Time Series Split

deatinor merged commit 380d356 into giotto-ai:master Apr 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Time Series Split #163

Time Series Split #163

Srinidhi-Patil commented Apr 2, 2020 •

edited

deatinor Apr 3, 2020

Srinidhi-Patil Apr 7, 2020

deatinor Apr 3, 2020

Srinidhi-Patil Apr 3, 2020 •

edited

deatinor Apr 3, 2020

Srinidhi-Patil Apr 6, 2020

deatinor Apr 3, 2020

Srinidhi-Patil Apr 7, 2020

Time Series Split #163

Time Series Split #163

Conversation

Srinidhi-Patil commented Apr 2, 2020 • edited

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

deatinor Apr 3, 2020

Choose a reason for hiding this comment

Srinidhi-Patil Apr 7, 2020

Choose a reason for hiding this comment

deatinor Apr 3, 2020

Choose a reason for hiding this comment

Srinidhi-Patil Apr 3, 2020 • edited

Choose a reason for hiding this comment

deatinor Apr 3, 2020

Choose a reason for hiding this comment

Srinidhi-Patil Apr 6, 2020

Choose a reason for hiding this comment

deatinor Apr 3, 2020

Choose a reason for hiding this comment

Srinidhi-Patil Apr 7, 2020

Choose a reason for hiding this comment

Srinidhi-Patil commented Apr 2, 2020 •

edited

Srinidhi-Patil Apr 3, 2020 •

edited