ENH Add option `n_splits='walk_forward'` in `TimeSeriesSplit` #23780

ShehanAT · 2022-06-28T14:15:56Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This pull request adds a requested feature in the Issue #22523: "walk_forward" for the TimeSeriesSplit class.

Based on what I understand about this feature, here are some important details:

Walking forward is defined by the first element of the first train set starting from 0 and being incremented by 1 in the subsequent train and test indices while adhering to the max train size and test size values. For example:

# Walk Forward Example where max_train_size = 10, test_size = 2, x.shape[0] = 15 
# n_splits is automatically computed to 3 in this case 
TRAIN:  [0 1 2 3 4 5 6 7 8] TEST:  [ 9 10]
TRAIN:  [ 1  2  3  4  5  6  7  8  9 10] TEST:  [11 12]
TRAIN:  [ 3  4  5  6  7  8  9 10 11 12] TEST:  [13 14]

To make waking forward possible the n_splits value is set to walk_forward:
cv = TimeSeriesSplit(n_splits="walk_forward", x_shape=x.shape[0], max_train_size=10 ,test_size=2).
However, as this parameter was originally designed only to receive integer values I had to make some modifications to the constructor of the _BaseKFold() super class to make this possible. Also, the x.shape[0] value from x = np.arange(15) is needed to be passed into the constructor in order to automatically compute values for n_splits parameter
Once the n_splits value is set to walk_forward, the find_walk_forward_n_splits_value() method is called in order to calculate an appropriate integer value for n_splits so that walking forward is possible. In the case of multiple potential n_splits values, the first element in an array containing those values will be returned:
```
if len(n_splits_arrays_first_element_zero) > 0:
            return n_splits_arrays_first_element_zero[0]
```

Any other comments?

I have regression tested manually with the original functionality and it seems to work fine.
There are two tests in the sklearn/model_selection/_split.py file that are failing(I'm looking into why now)

Let me know if this potential solution works, needs modifications or if you have any feedback.

…garding issue scikit-learn#23631

…split.py

…ion/_split.py

glemaitre · 2022-06-28T18:04:19Z

sklearn/model_selection/_split.py

        n_splits = self.n_splits
        n_folds = n_splits + 1
        gap = self.gap
        test_size = (
            self.test_size if self.test_size is not None else n_samples // n_folds
        )
-


no need for this change

glemaitre · 2022-06-28T18:12:43Z

sklearn/model_selection/_split.py

+        max_train_size=None,
+        test_size=None,
+    ):
+        if isinstance(n_splits, int):


Thinking about it, I think that we can keep the base class as is and make the base class accept a str without raising a warning.

Then, we can just specialize the get_n_splits for the TimeSeriesSplit that should not return directly self.n_splits but instead make the computation of n_splits if a string is provided.

Therefore, we only specialize the TimeSeriesSplit class.

OK, I've replaced the previous method with this get_n_splits() method now.

glemaitre · 2022-06-28T18:13:00Z

sklearn/model_selection/_split.py

@@ -1066,13 +1103,13 @@ def split(self, X, y=None, groups=None):
        """
        X, y, groups = indexable(X, y, groups)
        n_samples = _num_samples(X)
+


no need for this change

glemaitre · 2022-06-28T18:13:32Z

sklearn/model_selection/_split.py

@@ -1066,13 +1103,13 @@ def split(self, X, y=None, groups=None):
        """
        X, y, groups = indexable(X, y, groups)
        n_samples = _num_samples(X)
+
        n_splits = self.n_splits


I think that this is here that we can do:

n_splits = self.get_n_splits(...)

Yep, added the line: n_splits = self.get_n_splits(X, y, groups) now

glemaitre · 2022-06-28T18:16:04Z

sklearn/model_selection/_split.py

-    def __init__(self, n_splits=5, *, max_train_size=None, test_size=None, gap=0):
-        super().__init__(n_splits, shuffle=False, random_state=None)
+    def __init__(
+        self, n_splits=5, x_shape=None, *, max_train_size=None, test_size=None, gap=0


We should not need to have x_shape at the initialization.

Having X at the split call should be enough to get this information.

I've removed x_shape now

@ShehanAT Consider updating the pull request description as well.

glemaitre · 2022-06-28T18:18:15Z

sklearn/model_selection/_split.py

@@ -1101,6 +1138,105 @@ def split(self, X, y=None, groups=None):
                    indices[test_start : test_start + test_size],
                )

+    def find_walk_forward_n_splits_value(self, x_value, max_train_size, test_size):


if we replace this by get_n_splits, we will have the following signature:

def get_n_splits(self, X=None, y=None, groups=None)

Since we have X and self, we will have all the necessary information to compute the number of splits required to make the rolling windows

glemaitre · 2022-06-28T18:35:30Z

sklearn/model_selection/_split.py

+        TRAIN:  [10 11 12] TEST:  [13]
+        TRAIN:  [11 12 13] TEST:  [14]
+        """
+        x = np.arange(x_value)


I think that we only need to have:

def get_n_splits(self, X=None, y=None, groups=None): if isinstance(self.n_splits, str): return X.shape[0] - (self.max_train_size + self.test_size) + 1 return self.n_splits

Yep, this is a lot simpler and readable than the previous method

glemaitre · 2022-06-28T18:55:40Z

I just wanted to confirm my intuition. Here is a quick diff that I could come with without making extensive tests:

diff --git a/sklearn/model_selection/_split.py b/sklearn/model_selection/_split.py
index d2a0b5e1fc..609fda7c63 100644
--- a/sklearn/model_selection/_split.py
+++ b/sklearn/model_selection/_split.py
@@ -275,22 +275,21 @@ class _BaseKFold(BaseCrossValidator, metaclass=ABCMeta):
 
     @abstractmethod
     def __init__(self, n_splits, *, shuffle, random_state):
-        if not isinstance(n_splits, numbers.Integral):
-            raise ValueError(
-                "The number of folds must be of Integral type. "
-                "%s of type %s was passed." % (n_splits, type(n_splits))
-            )
-        n_splits = int(n_splits)
-
-        if n_splits <= 1:
+        if isinstance(n_splits, numbers.Integral):
+            if n_splits <= 1:
+                raise ValueError(
+                    "k-fold cross-validation requires at least one"
+                    " train/test split by setting n_splits=2 or more,"
+                    f" got n_splits={n_splits}."
+                )
+        elif n_splits != "walk_forward":
             raise ValueError(
-                "k-fold cross-validation requires at least one"
-                " train/test split by setting n_splits=2 or more,"
-                " got n_splits={0}.".format(n_splits)
+                "n_splits should be an integer number or 'walk_forward' for "
+                "the TimeSeriesSplit cross-validator."
             )
 
         if not isinstance(shuffle, bool):
-            raise TypeError("shuffle must be True or False; got {0}".format(shuffle))
+            raise TypeError(f"shuffle must be True or False; got {shuffle}")
 
         if not shuffle and random_state is not None:  # None is the default
             raise ValueError(
@@ -1037,6 +1036,13 @@ class TimeSeriesSplit(_BaseKFold):
 
     def __init__(self, n_splits=5, *, max_train_size=None, test_size=None, gap=0):
         super().__init__(n_splits, shuffle=False, random_state=None)
+        if self.n_splits == "walk_forward" and (
+            max_train_size is None or test_size is None
+        ):
+            raise ValueError(
+                "If n_splits is 'walk_forward', then max_train_size and test_size must"
+                " be specified."
+            )
         self.max_train_size = max_train_size
         self.test_size = test_size
         self.gap = gap
@@ -1066,7 +1072,7 @@ class TimeSeriesSplit(_BaseKFold):
         """
         X, y, groups = indexable(X, y, groups)
         n_samples = _num_samples(X)
-        n_splits = self.n_splits
+        n_splits = self.get_n_splits(X, y, groups)
         n_folds = n_splits + 1
         gap = self.gap
         test_size = (
@@ -1101,6 +1107,29 @@ class TimeSeriesSplit(_BaseKFold):
                     indices[test_start : test_start + test_size],
                 )
 
+    def get_n_splits(self, X=None, y=None, groups=None):
+        """Returns the number of splitting iterations in the cross-validator
+
+        Parameters
+        ----------
+        X : object
+            Always ignored, exists for compatibility.
+
+        y : object
+            Always ignored, exists for compatibility.
+
+        groups : object
+            Always ignored, exists for compatibility.
+
+        Returns
+        -------
+        n_splits : int
+            Returns the number of splitting iterations in the cross-validator.
+        """
+        if self.n_splits == "walk_forward":
+            return X.shape[0] - (self.max_train_size + self.test_size) + 1
+        return self.n_splits
+
 
 class LeaveOneGroupOut(BaseCrossValidator):
     """Leave One Group Out cross-validator

ShehanAT · 2022-06-28T21:16:40Z

Thanks for looking into this.
I've made the changes you've requested now.
Let me know if I need to make further modifications.

glemaitre · 2022-06-29T09:19:43Z

So now, we need to add unit tests to check the different changes and new behaviour.
We as well need an entry in the changelog.
We also need to update the documentation in docstring and I think that it should be great to update an example and the user guide.

…nto walk-forward-time-series-split-ShehanAT

…on.TimeSeriesSplit

ShehanAT · 2022-06-30T02:28:23Z

I've added an entry to the doc/whats_new/v1.2.rst file and added two unit tests.

Here is the my updated docstring for the TimeSeriesSplit class:

>>> # Add in a 2 period gap
    >>> tscv = TimeSeriesSplit(n_splits=3, test_size=2, gap=2)
    >>> for train_index, test_index in tscv.split(X):
    ...    print("TRAIN:", train_index, "TEST:", test_index)
    ...    X_train, X_test = X[train_index], X[test_index]
    ...    y_train, y_test = y[train_index], y[test_index]
    TRAIN: [0 1 2 3] TEST: [6 7]
    TRAIN: [0 1 2 3 4 5] TEST: [8 9]
    TRAIN: [0 1 2 3 4 5 6 7] TEST: [10 11]
    >>> # Showing rolling window support with via `n_splits='walk_forward'`
    >>> x = np.arange(15)
    >>> cv = TimeSeriesSplit(n_splits='walk_forward', max_train_size=10, test_size=3)
    >>> for train_index, test_index in cv.split(x):
    ...     print("TRAIN: ", train_index, "TEST: ", test_index)
    TRAIN:  [0 1 2 3 4 5] TEST:  [6 7 8]
    TRAIN:  [0 1 2 3 4 5 6 7 8] TEST:  [ 9 10 11]
    TRAIN:  [ 2  3  4  5  6  7  8  9 10 11] TEST:  [12 13 14]

    Notes
    -----
    -   The training set has size ``i * n_samples // (n_splits + 1)
        + n_samples % (n_splits + 1)`` in the ``i`` th split,
        with a test set of size ``n_samples//(n_splits + 1)`` by default,
        where ``n_samples`` is the number of samples.
    -   To use the rolling window support where the train set does not grow
        and the `n_splits` value is automatically computed: 
        Set `n_splits='walk_forward'`.
    """

Let me know if any changes are required.

MSchmidt99 · 2022-10-18T16:31:56Z

sklearn/model_selection/_split.py

+            Returns the number of splitting iterations in the cross-validator.
+        """
+        if self.n_splits == "walk_forward":
+            return X.shape[0] - (self.max_train_size + self.test_size) + 1


Am I missing something here?

If n_splits="walk_forward", X.shape[0]=20, self.max_train_size=10, and self.test_size=5 we get a calculated n_splits of 20 - (10 + 5) + 1 = 6. Thus across all splits we have a total test_size of 5 * 6 = 30 which is greater than our number of samples, and as such we will hit a ValueError on line 1099 which is non-descript for the n_splits="walk_forward" use case.

Is it not the expectation that these inputs would yield indices similar to the following?

[0 1 2 3 4] [5 6 7 8 9] [0 1 2 3 4 5 6 7 8 9 ] [10 11 12 13 14] [5 6 7 8 9 10 11 12 13 14] [15 16 17 18 19]

Yep we should return:

return (X.shape[0] - self.max_train_size) // self.test_size

The new equation only works for the case when the minimum train size is the maximum train size. In the docstring for tscv = TimeSeriesSplit(n_splits='walk_forward', max_train_size=10, test_size=3) it is shown that this is not always the case.

For the case in the docstring we calculate (15 - 10) // 3 = 1, though the examples given show that we would expect 4. See transcription below (this is why docstring tests are failing currently):

Fold 0: Train: index=[0 1 2] Test: index=[3 4 5] Fold 1: Train: index=[0 1 2 3 4 5] Test: index=[6 7 8] Fold 2: Train: index=[0 1 2 3 4 5 6 7 8] Test: index=[ 9 10 11] Fold 3: Train: index=[ 2 3 4 5 6 7 8 9 10 11] Test: index=[12 13 14]

The calculation should instead be (X.shape[0] - self.min_train_size - self.gap) // self.test_size, where perhaps the default for self.min_train_size should be self.test_size for docstring accuracy. We could also allow for setting self.min_train_size on initialization (to allow for similar behavior to #24589), which would allow users to force each training batch to be of equal size (when min_train_size==max_train_size) for consistent CV scores on parameter hypotheses.

I created a PR with fixes to merge into this branch of ShehanAT's fork at #1

I think that the behaviour that we would expect with walk_forward is to get train sets of constant size. So here, max_train_size is just too large to have more than one fold, and I think this is completely fine.

glemaitre

I pushed fixes and improve the documentation directly.
This seems good on my side.

@thomasjpfan do you mind having a look at this?

jeremiedbb · 2022-11-24T13:24:57Z

We won't have time to finish the review on this one before the 1.2 release. Moving it to 1.3

msat59 · 2023-05-25T12:52:01Z

I think we shouldn't define n_splits=walk_forward. It can be a new argument, window_method. I provided the code here. It only needs two if and how to get the train_start index.

thomasjpfan · 2023-05-31T13:42:26Z

@msat59 window_method="rolling" in combination with n_splits makes it harder to reasonable about. For example, with your implementation of TimeSeriesSplit overlaps and is restricted by the n_splits=5 default:

import numpy as np
splitter = TimeSeriesSplit(max_train_size=10, test_size=3, window_method="rolling")
X = np.arange(40)
list(splitter.split(X))

# [(array([15, 16, 17, 18, 19, 20, 21, 22, 23, 24]), array([25, 26, 27])),
# (array([18, 19, 20, 21, 22, 23, 24, 25, 26, 27]), array([28, 29, 30])),
# (array([21, 22, 23, 24, 25, 26, 27, 28, 29, 30]), array([31, 32, 33])),
# (array([24, 25, 26, 27, 28, 29, 30, 31, 32, 33]), array([34, 35, 36])),
# (array([27, 28, 29, 30, 31, 32, 33, 34, 35, 36]), array([37, 38, 39]))]

To include all the data in the split, one needs to manually figure out what n_splits needs to be.

With this PR's walk_foward, the whole dataset is used by default:

import numpy as np
splitter = TimeSeriesSplit(max_train_size=10, test_size=3, n_splits="walk_forward")
X = np.arange(40)
list(splitter.split(X))

# [(array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9]), array([10, 11, 12])),
# (array([ 3,  4,  5,  6,  7,  8,  9, 10, 11, 12]), array([13, 14, 15])),
# (array([ 6,  7,  8,  9, 10, 11, 12, 13, 14, 15]), array([16, 17, 18])),
# (array([ 9, 10, 11, 12, 13, 14, 15, 16, 17, 18]), array([19, 20, 21])),
# (array([12, 13, 14, 15, 16, 17, 18, 19, 20, 21]), array([22, 23, 24])),
# (array([15, 16, 17, 18, 19, 20, 21, 22, 23, 24]), array([25, 26, 27])),
# (array([18, 19, 20, 21, 22, 23, 24, 25, 26, 27]), array([28, 29, 30])),
# (array([21, 22, 23, 24, 25, 26, 27, 28, 29, 30]), array([31, 32, 33])),
# (array([24, 25, 26, 27, 28, 29, 30, 31, 32, 33]), array([34, 35, 36])),
# (array([27, 28, 29, 30, 31, 32, 33, 34, 35, 36]), array([37, 38, 39]))]

@glemaitre I'll take a look at this PR this week.

msat59 · 2023-05-31T14:20:58Z

window_method="rolling" in combination with n_splits makes it harder to reasonable about.

You are right, but it doesn't make sense you set both n_split and max_train_size and expect to have all data in your splits. It may reasonable only if you want to use the most recent data in your cross-validation process, however, it doesn't still make sense to use both. Most of the time, you set n_split/n-folds and you ignore max_train_size so you don't care about the number of samples in each split. Your focus is on the number of folds (model training) mainly to increase the cross-validation speed.

Does your implementation work with n_split=5 only, without specifying max_train_size?

I tested a quick fix and it seems we can calculate n_split and override its given value if max_train_size is non-zero. What is the output of your code if the array length is 38 for instance?

X = np.arange(38)

glemaitre · 2023-06-14T14:26:15Z

@thomasjpfan Do you think that we can merge this one for 1.3?

thomasjpfan · 2023-06-15T10:11:31Z

@glemaitre I do not think so right now. I want to properly evaluate #22523 (comment).

I moved this PR to 1.4

diegolovison · 2023-08-22T13:19:13Z

I did the following test:

model = TimeSeriesSplit(max_train_size=11, test_size=3, n_splits="walk_forward")

X: [ 0  1  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35]
TRAIN: [ 1  2  3  4  5  6  7  8  9 10 11] TEST: [12 13 14]
TRAIN: [ 4  5  6  7  8  9 10 11 12 13 14] TEST: [15 16 17]
TRAIN: [ 7  8  9 10 11 12 13 14 15 16 17] TEST: [18 19 20]
TRAIN: [10 11 12 13 14 15 16 17 18 19 20] TEST: [21 22 23]
TRAIN: [13 14 15 16 17 18 19 20 21 22 23] TEST: [24 25 26]
TRAIN: [16 17 18 19 20 21 22 23 24 25 26] TEST: [27 28 29]
TRAIN: [19 20 21 22 23 24 25 26 27 28 29] TEST: [30 31 32]
TRAIN: [22 23 24 25 26 27 28 29 30 31 32] TEST: [33 34 35]

I am not expecting that.

I would like to have:

0..10 11,12,13
1..11 12,13,14
2..12 13,14,15
....

The above seems like a rolling feature.
What do you think?

msat59 · 2023-08-24T11:54:30Z

The above seems like a rolling feature.

@diegolovison, you are right. If you specify the max_train_size it will be the rolling window method. Setting max_train_size doesn't make sense in using the expanding window method.

In my opinion, using max_train_size is pointless in the expanding window method, as we want the window to expand. In addition, we want to limit the number of splits by n_splits to due the computational power restrictions. As a result, I still believe
using n_splits="walk_forward" is wrong. The function must be used like this

TimeSeriesSplit(n_splits=5, test_size=3, window_method="expanding")
TimeSeriesSplit(n_splits=5, test_size=3, window_method="rolling")

@thomasjpfan, I had modified my code after you reported its bug here. So if you use max_train_size, the function ignores the given/default n_splits and the output would be:

splitter = TimeSeriesSplit(max_train_size=10, test_size=3, window_method="rolling")
X = np.arange(40)
list(splitter.split(X))

[(array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9]), array([10, 11, 12])),
 (array([ 3,  4,  5,  6,  7,  8,  9, 10, 11, 12]), array([13, 14, 15])),
 (array([ 6,  7,  8,  9, 10, 11, 12, 13, 14, 15]), array([16, 17, 18])),
 (array([ 9, 10, 11, 12, 13, 14, 15, 16, 17, 18]), array([19, 20, 21])),
 (array([12, 13, 14, 15, 16, 17, 18, 19, 20, 21]), array([22, 23, 24])),
 (array([15, 16, 17, 18, 19, 20, 21, 22, 23, 24]), array([25, 26, 27])),
 (array([18, 19, 20, 21, 22, 23, 24, 25, 26, 27]), array([28, 29, 30])),
 (array([21, 22, 23, 24, 25, 26, 27, 28, 29, 30]), array([31, 32, 33])),
 (array([24, 25, 26, 27, 28, 29, 30, 31, 32, 33]), array([34, 35, 36])),
 (array([27, 28, 29, 30, 31, 32, 33, 34, 35, 36]), array([37, 38, 39]))]

glemaitre · 2023-12-07T15:40:52Z

Moving the milestone to 1.5 since we did not get time to look at this one.
@thomasjpfan if you have spare time to have a look at this one, I think that we can target next release.

github-actions · 2023-12-07T15:42:15Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 8efb3aa. Link to the linter CI: here}

ShehanAT added 5 commits June 19, 2022 17:56

provided new link for SVD based initialization link on line 959, re…

a01e12d

…garding issue scikit-learn#23631

added find_walk_forward_n_splits_value() to sklearn/model_selection/_…

38c78cf

…split.py

fixed merge conflict in doc/modules/decomposition.rst

3edd8db

refactored find_walk_forward_n_splits_value() in sklearn/model_select…

9764f6d

…ion/_split.py

fixed grammar error in docstring in sklearn/model_selection/_split.py

29ff1dd

github-actions bot added the module:model_selection label Jun 28, 2022

glemaitre reviewed Jun 28, 2022

View reviewed changes

ShehanAT added 2 commits June 28, 2022 16:55

made changes suggested by @glemaitre

4512e66

refactored sklearn/model_selection/_split.py according to @glemaitre

d1ec219

ShehanAT added 4 commits June 29, 2022 21:55

added two unit tests regard 'walk_forward' TimeSeriesSplit feature

b5cf2cf

Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

ee3315a

…nto walk-forward-time-series-split-ShehanAT

added entry to doc/whats_new/v1.2.rst

52642e6

updated docstring to mention 'walk_forward' support for model_selecti…

06c5b44

…on.TimeSeriesSplit

This was referenced Jul 20, 2022

[WIP | First Contribution] add first pass at TimeSeriesInitialSplit #23931

Closed

TimeSeriesSplit Needs a version that allows users to specify initial time #23923

Open

MSchmidt99 mentioned this pull request Oct 5, 2022

ENH added RollingWindowCV to sklearn.model_selection #24589

Closed

glemaitre self-requested a review October 17, 2022 12:38

MSchmidt99 reviewed Oct 18, 2022

View reviewed changes

glemaitre added 6 commits November 3, 2022 22:55

Merge remote-tracking branch 'origin/main' into pr/ShehanAT/23780-1

a63b61d

DOC move the entry in changelog

97f380d

DOC update docstring for walk_forward

0cfbb7d

DOC add user guide documentation

47fbc01

TST reformat tests

481cf56

FIX add dosctring and fix n_split

9a27b48

glemaitre changed the title ~~ENH: Added "walk forward" feature to sklearn.model_selection.TimeSeriesSplit~~ ENH Add option n_splits='walk_forward' in TimeSeriesSplit Nov 3, 2022

FIX make sure to take gap into account

7876b0a

DOC add gap into examples in user guide

d907960

glemaitre added this to the 1.2 milestone Nov 3, 2022

glemaitre approved these changes Nov 3, 2022

View reviewed changes

MSchmidt99 mentioned this pull request Nov 6, 2022

FIX get_n_split ShehanAT/scikit-learn#1

Closed

glemaitre self-assigned this Nov 10, 2022

DOC fix docstring

cc5e970

glemaitre removed their assignment Nov 10, 2022

jeremiedbb modified the milestones: 1.2, 1.3 Nov 24, 2022

Merge branch 'main' into walk-forward-time-series-split-ShehanAT

bad0678

DOC Update to 1.3

8df4f5b

thomasjpfan modified the milestones: 1.3, 1.4 Jun 15, 2023

Merge branch 'main' into walk-forward-time-series-split-ShehanAT

8efb3aa

glemaitre modified the milestones: 1.4, 1.5 Dec 7, 2023

jeremiedbb modified the milestones: 1.5, 1.6 May 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH Add option `n_splits='walk_forward'` in `TimeSeriesSplit` #23780

ENH Add option `n_splits='walk_forward'` in `TimeSeriesSplit` #23780

ShehanAT commented Jun 28, 2022 •

edited

glemaitre Jun 28, 2022

glemaitre Jun 28, 2022

ShehanAT Jun 28, 2022 •

edited

glemaitre Jun 28, 2022

ShehanAT Jun 28, 2022 •

edited

glemaitre Jun 28, 2022

ShehanAT Jun 28, 2022

glemaitre Jun 28, 2022

ShehanAT Jun 28, 2022

he7d3r Sep 12, 2022

glemaitre Jun 28, 2022

ShehanAT Jun 28, 2022

glemaitre Jun 28, 2022

ShehanAT Jun 28, 2022

glemaitre commented Jun 28, 2022 •

edited

ShehanAT commented Jun 28, 2022 •

edited

glemaitre commented Jun 29, 2022

ShehanAT commented Jun 30, 2022

MSchmidt99 Oct 18, 2022 •

edited

glemaitre Nov 3, 2022

MSchmidt99 Nov 5, 2022 •

edited

glemaitre Nov 10, 2022

glemaitre left a comment

jeremiedbb commented Nov 24, 2022

msat59 commented May 25, 2023 •

edited

thomasjpfan commented May 31, 2023

msat59 commented May 31, 2023 •

edited

glemaitre commented Jun 14, 2023

thomasjpfan commented Jun 15, 2023

diegolovison commented Aug 22, 2023 •

edited

msat59 commented Aug 24, 2023 •

edited

glemaitre commented Dec 7, 2023

github-actions bot commented Dec 7, 2023

ENH Add option n_splits='walk_forward' in TimeSeriesSplit #23780

Are you sure you want to change the base?

ENH Add option n_splits='walk_forward' in TimeSeriesSplit #23780

Conversation

ShehanAT commented Jun 28, 2022 • edited

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ShehanAT Jun 28, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ShehanAT Jun 28, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glemaitre commented Jun 28, 2022 • edited

ShehanAT commented Jun 28, 2022 • edited

glemaitre commented Jun 29, 2022

ShehanAT commented Jun 30, 2022

MSchmidt99 Oct 18, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MSchmidt99 Nov 5, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glemaitre left a comment

Choose a reason for hiding this comment

jeremiedbb commented Nov 24, 2022

msat59 commented May 25, 2023 • edited

thomasjpfan commented May 31, 2023

msat59 commented May 31, 2023 • edited

glemaitre commented Jun 14, 2023

thomasjpfan commented Jun 15, 2023

diegolovison commented Aug 22, 2023 • edited

msat59 commented Aug 24, 2023 • edited

glemaitre commented Dec 7, 2023

github-actions bot commented Dec 7, 2023

✔️ Linting Passed

ENH Add option `n_splits='walk_forward'` in `TimeSeriesSplit` #23780

ENH Add option `n_splits='walk_forward'` in `TimeSeriesSplit` #23780

ShehanAT commented Jun 28, 2022 •

edited

ShehanAT Jun 28, 2022 •

edited

ShehanAT Jun 28, 2022 •

edited

glemaitre commented Jun 28, 2022 •

edited

ShehanAT commented Jun 28, 2022 •

edited

MSchmidt99 Oct 18, 2022 •

edited

MSchmidt99 Nov 5, 2022 •

edited

msat59 commented May 25, 2023 •

edited

msat59 commented May 31, 2023 •

edited

diegolovison commented Aug 22, 2023 •

edited

msat59 commented Aug 24, 2023 •

edited