a research paper and supplementary materials - software, notebooks
by Martina Hančová, Andrej Gajdoš, Jozef Hanč, Gabriela Vozáriková
martina.hancova@upjs.sk
We revisit and update estimating variances, fundamental quantities in a time series forecasting approach called kriging, in time series models known as FDSLRMs, whose observations can be described by a linear mixed model (LMM).
As a result of applying the convex optimization, we resolved two open problems in FDSLRM research: (1) theoretical existence and equivalence between two standard estimation methods --- least squares estimators, non-negative (M)DOOLSE, and maximum likelihood estimators, (RE)MLE, (2) and a practical lack of free available computational implementation for FDSLRM. As for computing (RE)MLE in the case of n observed time series values, we also discovered a new algorithm of order O(n), which at the default precision is 107 times more accurate and n2 times faster than the best current Python(or R)-based computational packages, namely CVXPY, CVXR, nlme, sommer and mixed.
The LMM framework led us to the proposal of a two-stage estimation method of variance components based on the empirical (plug-in) best linear unbiased predictions of unobservable random components in FDSLRM. The method, providing non-negative estimators with a simple explicit analytic form and performance comparable with REMLE in the Gaussian case, can be used for any absolutely continuous probability distribution of time series data.
We illustrate our results via applications and simulations on three real data sets (electricity consumption, tourism and cyber security), which are easily available, reproducible, sharable and modifiable in the form of interactive Jupyter notebooks.
This is a pre-print of an article published in Statistical Papers. The final revised authenticated version is available online at: https://link.springer.com/article/10.1007%2Fs00362-020-01165-5
A preprint, the first version of the paper is available at https://arxiv.org/abs/1905.07771.
The notebooks folders (Modeling
, PYnotebooks
, Rnotebooks
) contain Python based and R based Jupyter notebooks which are detailed records of our computing
with explaining narratives ilustrating explored concepts and methods.
Notebooks can be studied and viewed statically in Jupyter nbviewer with full visualisation. If there is a need, they can be also viewed directly on Github index.ipynb
, also as a raw code.
For interactive executing notebooks as live documents without any need to install or compile the software, use the following links
Python based notebooks using CVXPY and Scipy
-Python based notebooks using SageMath
-R based notebooks
-
There is also another very comfort alternative, fully tested by us, CoCalc providing interactive computing with our Jupyter notebooks.
All source code is distributed under the MIT license.
The misc
folder contains our previously published papers related to the fdslrm project.
This work was supported by the Slovak Research and Development Agency under the contract No. APVV-17-0568, the Scientific Grant Agency of the Slovak Republic (VEGA), VEGA grant No.1/0311/18 and the Internal Research Grant System of Faculty of Science, P.J. Šafárik University in Košice (VVGS PF UPJŠ) — project VVGS-PF-2018-792.
The developed algorithms and open science digital tools were embedded and became a starting point for computational, simulation and theoretical reseach in our current projects supported by the Slovak Research and Development Agency under the Contract no. APVV-21-0216 and APVV-21-0369.