Merlion dashboard app #129

yangwenzhuo08 · 2022-10-25T06:12:23Z

This PR implements a web-based visualization dashboard for Merlion. Users can get it set up by installing Merlion with the optional dashboard dependency, i.e. pip install salesforce-merlion[dashboard]. Then, they can start it up with python -m merlion.dashboard, which will start up the dashboard on port 8050. The dashboard has 3 tabs: a file manager where users can upload CSV files & visualize time series; a forecasting tab where users can try different forecasting algorithms on different datasets; and an anomaly detection tab where users can try different anomaly detection algorithms on different datasets. This dashboard thus provides a no-code interface for users to rapidly experiment with different algorithms on their own data, and examine performance both qualitatively (through visualizations) and quantitatively (through evaluation metrics).

We also provide a Dockerfile which runs the dashboard as a microservice on port 80. The Docker image can be built with docker build . -t merlion-dash -f docker/dashboard/Dockerfile from the Merlion root directory. It can be deployed with docker run -dp 80:80 merlion-dash.

aadyotb

Thanks for the contribution Wenzhuo! Here are some of my initial comments:

Can update the PR description to outline what you've done, how the code is organized, and what the major files are for?
Why is the dashboard a separate folder, rather than a part of the Merlion package? Specifically, I'm wondering if it's possible to do something like python -m merlion.dashboard instead of python app.py to launch the app server. You could include a subprocess call in merlion/dashboard/__main__.py like this StackOverflow answer. Note that you could list dashboard as an optional dependency in Merlion's setup.py and throw an import error in merlion/dashboard/__init__.py if the dashboard dependencies are not installed. If you do this, please also change all relative import paths to absolute paths.
Can you install pre-commit and make sure the formatting & copyright headers are applied to all python files? See here.
Can you provide an overview this dashboard in the repo's main README.md? I'm thinking you can add this as a new section before "Getting Started". And you can reproduce the same information in docs/source/index.rst.
What is the purpose of test_anomaly.py and test_forecast.py? Seems like they are redundant with existing test coverage. test_models.py makes sense though, since it's testing your new model classes.
Can you move the new tests to the main tests folder instead of dashboard/tests? This also follows from point (2).

Move merlion/dashboard/dashboard to merlion/dashboard, change all relative imports to absolute imports, move Dockerfiles to a separate folder.

aadyotb · 2022-10-27T21:09:29Z

@yangwenzhuo08 thanks for your changes! This looks great. I've finished what you started in terms of restructuring the module. Now, merlion.dashboard is fully integrated into Merlion itself. The dashboard's dependencies have been added as optional requirements in setup.py, so the user can install the dashboard with pip install salesforce-merlion[dashboard]. The user may manually start up the dashboard with python -m merlion.dashboard, or from Unicorn with gunicorn -b 0.0.0.0:80 merlion.dashboard.sever:server. Additionally, the dashboard is now able to handle exogenous regressors.

In terms of my original comments, can you add the documentation I requested previously? Besides, this, I have a couple new requests.

Would it be possible for you to unify the train/test interface for anomaly detection and forecasting? I think both tasks should allow the user to either (a) upload separate train/test files, or (b) upload a single file and choose a train/test split.
Can you allow max_forecast_steps = None to be a valid specification? It's actually the default setting for most models and is necessary for long-horizon forecasting.

yangwenzhuo08 · 2022-10-28T02:15:04Z

@aadyotb Thanks for the revision. For the forecasting tab, we can split train file and test file as the anomaly tab does. Well, to combine these two UIs (upload two files, upload a single file with a split fraction), I'm not sure what layout is better for it. Do you have suggestion on the UI design for this part? For forecasting, it may be straightforward, e.g., we have two dropdown lists, one for train file, the other for test file. And then we have a slider to set the split fraction which is used to split the training data into "train" and "validation". But for anomaly detection, such split has a problem when the number of labels is small, i.e., it is possible that the split validation dataset has no anomalies.

aadyotb · 2022-10-28T02:53:22Z

@yangwenzhuo08 I envision something like the following: you can have a radio box which can select "use same file for train/test" or "use separate test file". If you select "use same file for train/test", you get the slider where you specify the train/test fraction. If you select "use separate test file", you get a prompt to choose the test file. If you specify "use separate test file", the module should throw an error if the test data is not given. What do you think?

And in terms of anomaly detection, it's kind of a well-known issue that the labels are sparse. The evaluation metrics are implemented in such a way that they have reliable fallback options if there are no true positives present in the data. Maybe you can use the plot_anoms helper function in merlion.plot to plot the ground truth anomalies (if they are specified), and then also report the evaluation metrics on both train and test?

yangwenzhuo08 · 2022-10-28T03:02:05Z

So the layout is like this:

A radio button to select "single file" or "separate"
A dropdown list to select the train file
If "single" is select, it shows a slider to set split ratio. If "separate" is select, it shows a dropdown list for choosing the test file.
Is this OK?

aadyotb · 2022-10-28T05:47:48Z

Yes, this sounds good.

For endogenous variables X and exogenous variables Y, the old implementation of sklearn_base predicted X_t = f(X_{t-1}, Y_{t-1}). Now, we predict X_t = f(X_{t-1}, Y_t), i.e. we actually use the future value of the exogenous regressors.

Now, the user can manually select which features they want to use for multivariate forecasting (instead of just using all non-exogenous features by default).

…o dashboard

Merlion dashboard app

0ee3a1a

yangwenzhuo08 requested a review from aadyotb October 25, 2022 06:12

aadyotb suggested changes Oct 26, 2022

View reviewed changes

aadyotb and others added 10 commits October 26, 2022 14:29

Merge branch 'main' into dashboard

e260b59

Restructure the dashboard module

501198d

Restructure the dashboard module

ae4324f

Restructure the dashboard module

c8abce9

Add AutoETS and AutoProphet

1e42e17

Remove MSES from dashboard

95d078a

Restructure the dashboard module

14f96f1

Restructure the dashboard module

bc83667

Updates to dashboard directory structure.

7e2fc9a

Move merlion/dashboard/dashboard to merlion/dashboard, change all relative imports to absolute imports, move Dockerfiles to a separate folder.

Add Java to the dashboard Dockerfile.

d82576f

aadyotb added 6 commits October 27, 2022 14:11

Update version to 2.0.0

c2caa1d

Add explicit diskcache requirement.

4c501c3

Remove merlion/dashboard/app.py

cfb05bb

Make empty figure actually be empty.

5e66484

Use post-rule on train anomaly scores.

f11a128

Add exogenous regressor support to dashboard.

6512fc5

yangwenzhuo08 and others added 6 commits October 28, 2022 15:54

Add "max_lag" parameter for AutoETS and AutoProphet

7bbd74c

add type for input arguments with default values

9ecb98b

Merge remote-tracking branch 'origin/dashboard' into dashboard

c38c103

Update type annotations.

21bde1d

Plot ground truth anomalies if given.

cd21dda

Fix a bug in plot_anoms_plotly

57c46d2

yangwenzhuo08 and others added 4 commits November 3, 2022 13:35

Add the docs for the Merlion dashboard

3e02e57

Add the docs for the Merlion dashboard

28a992f

Add dashboard info to main docs.

78ffdce

Update dashboard screenshots.

9310743

aadyotb approved these changes Nov 3, 2022

View reviewed changes

yangwenzhuo08 and others added 22 commits November 4, 2022 15:24

Fix the datetime format issue

a408414

Fix the datetime format issue

5ae1800

Fix the json stats format issue

6989739

Fix docs build error.

9d08723

Add exogenous regressors to VectorAR.

2d0fef8

Fix deprecation warning in GH Actions.

baf8c28

Exclude dashboard from tests.

23b54fc

Minor bugfix.

0b378a4

Use future exog values, not current ones.

0d36ba9

For endogenous variables X and exogenous variables Y, the old implementation of sklearn_base predicted X_t = f(X_{t-1}, Y_{t-1}). Now, we predict X_t = f(X_{t-1}, Y_t), i.e. we actually use the future value of the exogenous regressors.

More flexible MV forecasting with the dashboard.

7d205b4

Now, the user can manually select which features they want to use for multivariate forecasting (instead of just using all non-exogenous features by default).

Reorder forecast & anomaly in the dashboard.

f070be5

Fix the bug in load_data

ad47cb2

Allow specifying enum parameters in dashboard.

c9f7311

Merge branch 'dashboard' of https://github.com/salesforce/Merlion int…

4f18b6c

…o dashboard

Add info-level logging to dashboard.

07f713d

Create static method for seasonality detection.

e39fd74

Subtract trend before doing seasonality detection.

7875e4f

Use seasonality for default maxlags.

cea4bd2

Reorder forecasting models on dashboard.

a3760a6

Fix bugs: prevent init call for several callbacks

c81bce6

Compute seas for both regular & de-trended data.

035f056

Merge branch 'dashboard' of https://github.com/salesforce/Merlion int…

a25ad79

…o dashboard

aadyotb force-pushed the dashboard branch from 0ea2732 to a25ad79 Compare November 8, 2022 01:25

aadyotb merged commit c0c852e into main Nov 8, 2022

aadyotb deleted the dashboard branch November 8, 2022 16:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merlion dashboard app #129

Merlion dashboard app #129

yangwenzhuo08 commented Oct 25, 2022 •

edited by aadyotb

aadyotb left a comment •

edited

aadyotb commented Oct 27, 2022 •

edited

yangwenzhuo08 commented Oct 28, 2022

aadyotb commented Oct 28, 2022 •

edited

yangwenzhuo08 commented Oct 28, 2022

aadyotb commented Oct 28, 2022

Merlion dashboard app #129

Merlion dashboard app #129

Conversation

yangwenzhuo08 commented Oct 25, 2022 • edited by aadyotb

aadyotb left a comment • edited

Choose a reason for hiding this comment

aadyotb commented Oct 27, 2022 • edited

yangwenzhuo08 commented Oct 28, 2022

aadyotb commented Oct 28, 2022 • edited

yangwenzhuo08 commented Oct 28, 2022

aadyotb commented Oct 28, 2022

yangwenzhuo08 commented Oct 25, 2022 •

edited by aadyotb

aadyotb left a comment •

edited

aadyotb commented Oct 27, 2022 •

edited

aadyotb commented Oct 28, 2022 •

edited