How to calculate the bootstrap error and its confidence interval of a time series data #726

geophysics91 · 2020-09-05T07:37:16Z

Dear experts, i need to calculate the bootstrap error of the 5 time series data appended in a file. In side the time_series files five time series data are separated with > > symbols. https://i.fluffy.cc/12NLsqHhTTcvR67btNjRzXZCkbpkfw9c.html can anybody suggest better way to do it. I tried http://rasbt.github.io/mlxtend/user_guide/evaluate/bootstrap/#example-1-bootstrapping-the-mean but its for only single timseries data

pkaf · 2020-09-15T08:19:20Z

Looking at the implementation of

bootstrap(x, func, num_rounds=1000, ci=0.95, ddof=1, seed=None)

it says x can be (n_samples, [n_columns]), perhaps you need to reshape your data to have this dimension?

rasbt · 2020-09-15T18:18:20Z

it says x can be (n_samples, [n_columns]), perhaps you need to reshape your data to have this dimension?

Yes, @pkaf is correct it can be both an 1D or 2D array. Reshaping may not be necessary though. It depends on what your argument for fun is. E.g., the numpy mean function can compute the mean for both 1D and 2D arrays so both

import numpy as np
from mlxtend.evaluate import bootstrap


rng = np.random.RandomState(123)
x = rng.normal(loc=5., size=100)
original, std_err, ci_bounds = bootstrap(x, num_rounds=1000, func=np.mean, ci=0.95, seed=123)
print('Mean: %.2f, SE: +/- %.2f, CI95: [%.2f, %.2f]' % (original, 
                                                        std_err, 
                                                        ci_bounds[0],
                                                        ci_bounds[1]))

and

rng = np.random.RandomState(123)
x = rng.normal(loc=5., size=(100, 2))
original, std_err, ci_bounds = bootstrap(x, num_rounds=1000, func=np.mean, ci=0.95, seed=123)
print('Mean: %.2f, SE: +/- %.2f, CI95: [%.2f, %.2f]' % (original, 
                                                        std_err, 
                                                        ci_bounds[0],
                                                        ci_bounds[1]))

would work.

You could also handle the reshaping yourself if it is necessary for your func. E.g., like in the example below:

from mlxtend.data import autompg_data

from sklearn.linear_model import LinearRegression
from sklearn.metrics import r2_score

X, y = autompg_data()


lr = LinearRegression()

def r2_fit(X, model=lr):
    x, y = X[:, 0].reshape(-1, 1), X[:, 1]
    pred = lr.fit(x, y).predict(x)
    return r2_score(y, pred)


original, std_err, ci_bounds = bootstrap(X, num_rounds=1000,
                                         func=r2_fit,
                                         ci=0.95,
                                         seed=123)
print('Mean: %.2f, SE: +/- %.2f, CI95: [%.2f, %.2f]' % (original, 
                                                             std_err, 
                                                             ci_bounds[0],
                                                             ci_bounds[1]))

rasbt added the Question label Sep 15, 2020

rasbt closed this as completed Feb 8, 2021

Repository owner locked and limited conversation to collaborators Feb 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

How to calculate the bootstrap error and its confidence interval of a time series data #726

How to calculate the bootstrap error and its confidence interval of a time series data #726

geophysics91 commented Sep 5, 2020 •

edited

Loading

pkaf commented Sep 15, 2020

rasbt commented Sep 15, 2020

This issue was moved to a discussion.

This issue was moved to a discussion.

How to calculate the bootstrap error and its confidence interval of a time series data #726

How to calculate the bootstrap error and its confidence interval of a time series data #726

Comments

geophysics91 commented Sep 5, 2020 • edited Loading

pkaf commented Sep 15, 2020

rasbt commented Sep 15, 2020

This issue was moved to a discussion.

geophysics91 commented Sep 5, 2020 •

edited

Loading