Analyze time series data and experiment with ML Algorithms

Jupyter Notebooks are wonderful because they provide a way to share code, explanations, and visualizations in the same place. Notebooks add narrative to computation. The cells compartmentalize steps and fascilitate data analysis. In this way, notebooks act as an invitation for experimentation. If you are looking to enhance your time series data by performing time series data analytics or data science tasks, a Notebook is a great place to start this work.

Analyze your time series data and experiment with forecasting and anomaly detection algorithms using Jupyter Notebook tutorials (.ipynb files) and corresponding sample data (.csv files). We include tutorials and sample data for the following topics:

How to generate time series data
How to get started with InfluxDB OSS and Python
How to get started with InfluxDB OSS and Pandas
How to get started with InfluxDB Cloud powered by IOx and Pandas with Flight SQL
How to get started with Jupyter and Flux
Anomaly detection:
- For multiple time series, including BIRCH, KMEANS, and Median Absolute Deviation(MAD).
- For single time series, including Autoregression, LevelShiftAD, and SeasonalAD
Forecasting, including FBProphet, LSTM with Keras, and statsmodels' Holt's Method.

Set up this repo

These instructions are written for InfluxDB OSS 2.0 (starting with release candidate 0 and later) or InfluxDB Cloud. If you're using InfluxDB Cloud make sure to change your URL appropriately.

Installations of Python can get a bit tricky; different versions of the language, as well as projects which require different versions of installed libraries, can quickly lead to conflicts. Using a virtual environment is reccommended. Please consider looking into additional tooling like virtualenv or pyenv might be useful.

Run pip install requirements.txt inside your virtual environment to download all of the necessary dependencies.

After cloning this repo, run Jupyter Notebook locally with jupyter notebook. This should direct you to the web application which by default runs on http://localhost:8888.

Implementation Options and Recommendations

After you analyze your time series data with notebooks and select the forecasting or anomaly detection approach that works for you, it's time to implement your solution in production. The following resources could be useful in that next step:

Using the http.post() function in a task in combination with a serverless compute solution (like aws lambda for example) to run your code.
Using the Telegraf Execd processor plugin to run an external program. Please see this example of Machine Learning with the Telegraf Execd processor plugin for more details.

Additional Resources

This repo is just a sample of some of the many algorithms, approaches, and tools to time series forecasting and anomaly detection. Here are additional ML solutions that might interest the reader:

STUMPY: A powerful library that efficiently computes the matrix profile of a time series, which can be used for a variety of time series data mining tasks.
scikit-multiflow: A machine learning package for streaming data in Python, especially apt for clustering.
InfluxDB Interpreter for Apache Zeppelin: Apache Zeppelin is another web-based notebook similar to Jupyter Notebooks. Zeppelin has built in Spark integration. Zeppelin and the InfluxDB interpreter enables easy access to and parallelization of big time series data for quick analysis on large volumes of data.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Anomaly_Detection		Anomaly_Detection
Forecasting		Forecasting
img		img
.DS_Store		.DS_Store
FlightSQL_Pandas.ipynb		FlightSQL_Pandas.ipynb
Generate_Time_Series.ipynb		Generate_Time_Series.ipynb
Getting_Started_with_InfluxDB_and_Pandas.ipynb		Getting_Started_with_InfluxDB_and_Pandas.ipynb
Getting_Started_with_InfluxDB_and_Python.ipynb		Getting_Started_with_InfluxDB_and_Python.ipynb
Getting_Started_with_Jupyter_and_Flux.ipynb		Getting_Started_with_Jupyter_and_Flux.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analyze time series data and experiment with ML Algorithms

Set up this repo

Implementation Options and Recommendations

Additional Resources

About

Releases

Packages

Contributors 3

Languages

InfluxCommunity/Notebooks

Folders and files

Latest commit

History

Repository files navigation

Analyze time series data and experiment with ML Algorithms

Set up this repo

Implementation Options and Recommendations

Additional Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages