Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sklearn missing from dvcorg/cml-py3:latest #18

Closed
ivyleavedtoadflax opened this issue Jul 15, 2020 · 8 comments
Closed

Sklearn missing from dvcorg/cml-py3:latest #18

ivyleavedtoadflax opened this issue Jul 15, 2020 · 8 comments

Comments

@ivyleavedtoadflax
Copy link

I'm following the tutorial here, and ran into the following issue:

Run # Your ML workflow goes here
Traceback (most recent call last):
  File "train.py", line 1, in <module>
    from sklearn.ensemble import RandomForestClassifier
ModuleNotFoundError: No module named 'sklearn'
##[error]Process completed with exit code 1.

You can see my attempts to get this to work in this PR: reproducible-analytics#1

Following the tutorial without making any changes brought me to this error: https://github.com/muanalytics/example_cml/runs/875446393

I tweaked the cml.yaml and you'll see that I got it working in the end. I was expecting this to work out of the box though?

@ivyleavedtoadflax
Copy link
Author

ivyleavedtoadflax commented Jul 15, 2020

Ah sorry classically copied cml.yaml from the wrong place:

it should have had the pip install -r ...

@ivyleavedtoadflax
Copy link
Author

Probably is worth correcting the results.txt error in the first cml.yaml code block here: https://github.com/iterative/cml#getting-started

@ivyleavedtoadflax
Copy link
Author

This is the first one:

image

This is the second:

image

Also, love your work! DVC ❤️

ivyleavedtoadflax added a commit to ivyleavedtoadflax/cml that referenced this issue Jul 16, 2020
@elleobrien
Copy link
Contributor

elleobrien commented Jul 17, 2020

Hi @ivyleavedtoadflax, thanks for trying it out! If I follow correctly, the first code block is from the Usage section of the CML project README, and the second is from Getting Started.

The Usage section is intended to demonstrate the simplest possible use case, with some explanation of the big picture. It's not part of the tutorial. Do you think there's anything we could do in the README to make this more clear?

@ivyleavedtoadflax
Copy link
Author

ivyleavedtoadflax commented Jul 17, 2020

Hi @andronovhopf thanks for making great stuff🙏! I think I'd be inclined to at least replicate the pip install... in all places otherwise other users could make the same mistake as me and assume that the docker container already contains all the dependencies required for the tutorial. That said, I should probably have read the whole thing first 🙊

@elleobrien
Copy link
Contributor

You're right, leaving it out could confuse people about what's in the Docker container. I'm updating the code examples in the README so pip install... is always present.

Glad you pointed it out! :)

@elleobrien
Copy link
Contributor

elleobrien commented Jul 17, 2020

Addressed and closing the issue (see commit here). Please reopen or make a new issue if you have any other trouble!

@ivyleavedtoadflax
Copy link
Author

Thanks @andronovhopf

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants