Doc Improve. Replace timeseries_split unit-tests with a simple test #4070

Sahil333 · 2018-01-07T17:59:39Z

No description provided.

karlnapf

This is a nice clean up patch, thanks a lot

I made all sorts of minor comments, let's address them and then this is ready to be merged if travis is happy

karlnapf · 2018-01-11T17:20:19Z

examples/meta/src/evaluation/cross_validation_timeseries_split.sg

@@ -6,16 +6,16 @@ RegressionLabels labels(f_labels)

 #![set parameters]
 int num_subsets = 5
-int min_subset_size = 4
+int future_steps = 4


I'd suggest future_offset but that is minor

karlnapf · 2018-01-11T17:21:15Z

examples/meta/src/evaluation/cross_validation_timeseries_split.sg

 #![set parameters]

 #![build subsets]
 splitting.build_subsets()
 #![build subsets]

-#![generate subsets and inverse (aka test labels and train labels)]
-IntVector test_labels_indices = splitting.generate_subset_indices(1)
+#![generate subsets and inverse (aka vaildation set and train set)]


This is a tag for the cookbooks, not free form comment. So pls make it super short (see other examples)

karlnapf · 2018-01-11T17:21:25Z

src/shogun/evaluation/TimeSeriesSplitting.cpp

 }

 void CTimeSeriesSplitting::build_subsets()
 {
+	REQUIRE(m_num_subsets > 0, "Number of subsets should be greater than 0.");


Missing newline

karlnapf · 2018-01-11T17:23:24Z

src/shogun/evaluation/TimeSeriesSplitting.cpp


-		/* filling current with indices on right end  */
+		/* filling current with indices on right side */
 		for (auto k = split_index; k < indices.vlen; ++k)


auto k : Range(indices.vlen) but it is minor

karlnapf · 2018-01-11T17:24:09Z

src/shogun/evaluation/TimeSeriesSplitting.cpp

-	REQUIRE(min_size > 0, "Minimum subset size should be atleast 1.")
+	/* future_steps should be less than the difference between number of labels
+	 * and split index of second last fold */
+	index_t future_steps_upperbound =


max_future_steps
or better max_future_offset

karlnapf · 2018-01-11T17:24:55Z

src/shogun/evaluation/TimeSeriesSplitting.h

-	 * indices greater than a split index. The split indices are \f$ c[N/K] \f$
+	/** @brief Implements a timeseries splitting strategy for cross-validation,
+	 * respecting time.
+	 * Each fold splits timeseries into train (subset_inverse) and validation


pls remove the (subset_inverse) here. This is documented in the base class
The newlines are awkwards, could you remove them?

karlnapf · 2018-01-11T17:25:30Z

src/shogun/evaluation/TimeSeriesSplitting.h

+	 * (subset) set.
+	 * Train set contains indices less than a split index
+	 * and validation set contains rest of the indices.
+	 * The split indices are \f$ C*floor(N/K) \f$


in latex math, you don't use * for multiplication but \cdot or just nothing

karlnapf · 2018-01-11T17:27:02Z

tests/unit/evaluation/SplittingStrategy_unittest.cc

-		splitting->build_subsets();
-
-		for (index_t i = 0; i < num_subsets; ++i)
+		labels = new CRegressionLabels(num_labels);


labels = some(num_labels); and then you dont have to SG_UNREF later

karlnapf · 2018-01-11T17:28:11Z

tests/unit/evaluation/SplittingStrategy_unittest.cc

-		SG_UNREF(splitting);
+	for (auto i : range(12))
+	{
+		EXPECT_EQ(train_indices[i], i);


yes this is a better test in fact!

vigsterkr · 2018-01-23T07:22:50Z

@Sahil333 any update on this?

stale · 2020-02-26T16:52:11Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2020-03-04T17:29:19Z

This issue is now being closed due to a lack of activity. Feel free to reopen it.

Doc Improve. Replace timeseries_split unit-tests with a simple test

ebe9e63

karlnapf requested changes Jan 11, 2018

View reviewed changes

stale bot added the stale label Feb 26, 2020

stale bot closed this Mar 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Doc Improve. Replace timeseries_split unit-tests with a simple test #4070

Doc Improve. Replace timeseries_split unit-tests with a simple test #4070

Sahil333 commented Jan 7, 2018

karlnapf left a comment

karlnapf Jan 11, 2018

karlnapf Jan 11, 2018

karlnapf Jan 11, 2018

karlnapf Jan 11, 2018

karlnapf Jan 11, 2018

karlnapf Jan 11, 2018

karlnapf Jan 11, 2018

karlnapf Jan 11, 2018

karlnapf Jan 11, 2018

vigsterkr commented Jan 23, 2018

stale bot commented Feb 26, 2020

stale bot commented Mar 4, 2020

Doc Improve. Replace timeseries_split unit-tests with a simple test #4070

Doc Improve. Replace timeseries_split unit-tests with a simple test #4070

Conversation

Sahil333 commented Jan 7, 2018

karlnapf left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vigsterkr commented Jan 23, 2018

stale bot commented Feb 26, 2020

stale bot commented Mar 4, 2020