Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE REQUEST] docker and colab example #21

Open
mosheliv opened this issue Oct 4, 2021 · 10 comments
Open

[FEATURE REQUEST] docker and colab example #21

mosheliv opened this issue Oct 4, 2021 · 10 comments

Comments

@mosheliv
Copy link

mosheliv commented Oct 4, 2021

Is your feature request related to a problem? Please describe.
Its difficult to try merlion because of requirements collision. A docker or maintained colab notebook would solve this nicely. Currently merlion installation in colab fails because of pandas/statsmodels collision.

Describe the solution you'd like
A docker or colab notebook that just work.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Additional context
Add any other context or screenshots about the feature request here.

@paulkass
Copy link
Contributor

paulkass commented Oct 5, 2021

Could you elaborate on the colab pandas/statsmodels collisions that you're seeing? If I open a random colab notebook right now and try to play around with Merlion, am I bound to see these issues?

@mosheliv
Copy link
Author

mosheliv commented Oct 5, 2021 via email

@mosheliv
Copy link
Author

mosheliv commented Oct 5, 2021 via email

@mosheliv
Copy link
Author

mosheliv commented Oct 5, 2021 via email

@mosheliv
Copy link
Author

mosheliv commented Oct 5, 2021

I have managed to bypass this by cd to ts_datasets, which is not great.
However, the seatle trail dataset does not have a download function and so the nbeats example fails.

@aadyotb
Copy link
Contributor

aadyotb commented Oct 11, 2021

@mosheliv thank you for surfacing this behavior. A solution to this issue is to use the following code to install:

! if [ ! -d Merlion ]; then git clone https://github.com/salesforce/Merlion; fi
!pip install -e Merlion/
!pip install -e Merlion/ts_datasets/

whereas you have been doing

! if [ ! -d Merlion ]; then git clone https://github.com/salesforce/Merlion; fi
! cd Merlion
! pip install -e .
! pip install -e ts_datasets/

It seems that Python gets confused and tries to import from the package from the ts_datasets directory, rather than ts_datasets/ts_datasets. You can resolve this issue by removing ts_datasets from the immediate path. In other words, do cd .. rather than cd ts_datasets.

Would you mind elaborating on the issue with the SeattleTrail dataset? I understand that there is no download() function, but we have the data in our repo, so you should be able to use the dataset without specifying a rootdir, if you install as suggested above.

@mosheliv
Copy link
Author

mosheliv commented Oct 11, 2021 via email

@mosheliv
Copy link
Author

mosheliv commented Oct 11, 2021 via email

@aadyotb
Copy link
Contributor

aadyotb commented Oct 11, 2021

Thank you @mosheliv! We tested the installation on Mac and Linux environments directly, but hadn't thought to test them in Colab. I can update the installation instructions to be what I described above, as this is more robust overall. A Colab notebook example and/or Dockerfile from you would be very welcome as well.

@aadyotb
Copy link
Contributor

aadyotb commented Oct 18, 2021

PR #38 updates the installation instructions as discussed.

aadyotb added a commit that referenced this issue Oct 18, 2021
….py (#38)

* Use correct delay in benchmark_anomaly (Issue #37)

* Update install instructions (Issue #21 discussion)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants