Using PySR to create an Autoregression Time Series Model #538
Unanswered
michaelbschulte21
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
My friend & I are trying to develop a technique to create descriptive autoregressive models for (multivariate) nonlinear time series data for our master's capstone. I am trying to use PySR/symbolic regression to achieve this. I've noticed PySR becomes unhappy when there are more than 10 features & when there are more than 1000 observations.
Our dataset is large. It has 9 features & contains 48000 observations per feature. Also, each useful lag acts as its own feature, so we have over 100 features when we finally get to training the model. We've thought about dimensionality reduction like PCA, but I feel like that goes against the point of the project which is more about the ability to interpret the model than the model's ability to predict. The hope is tobuild a model that performs somewhere between a linear method like SARIMA & a neural network.
How should PySR be tuned to handle this situation? I feel like you would want to throw a wide net first & retrieve the operators with the most relevance. Then, narrow down the possible operators. Our current approach is to determine intervals of 1000 observations that have a large variance & build a couple of models based on these abridged data sets. Here's an example of our "wide net" tuning settings.
Beta Was this translation helpful? Give feedback.
All reactions