-
Notifications
You must be signed in to change notification settings - Fork 85
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question on the list of dependencies #93
Comments
Some metrics to support this issue: macOS 10.15.7 $ pip install model_card_toolkit --no-cache-dir --use-deprecated=legacy-resolve
70.43s user 31.26s system 18% cpu 9:13.00 total 9 minutes is a very long time to install a package. The size of |
Hi, We currently are experimenting with model cards and are using them as summary reports for sklearn models trained on vertex ai on GCP. We wanted to understand by when we could expect a refined package dependency list - currently the training jobs are unable to move past environment setup (due to model card toolkit dependencies) with a lot of time being taken by pip to determine the right versions to install. Here is the package list from our setup.py file to create the vertex training package (custom code option): REQUIRED_PACKAGES = ['pandas-gbq>=0.10.0', This runs on top of the europe-docker.pkg.dev/vertex-ai/training/tf-cpu.2-6:latest container for tensorflow 2.6 ML framework version. Scanning the training job logs, we see many of the following messages - INFO: This is taking longer than usual. You might need to provide the dependency resolver with stricter constraints to reduce runtime. If you want to abort this run, you can press Ctrl + C to do so. To improve how pip performs, tell us what happened here: https://pip.pypa.io/surveys/backtracking Installing the model card toolkit library on a jupyter notebook is a 1 time step and works fine, however. |
@vishwanath-prudhivi are you passing the |
@amadeuspzs thanks for the suggestion. So when we build a training package , we call the command :- pip3 install --user <training_package_name>.tar.gz Any suggestions on how to add additional flags (as mentioned in the previous suggestion) here would be helpful Regards |
Hi all, we're in the process of removing the @adrinjalali, we opened up the discussion topic #228 to brainstorm how to approach loosening up the dependencies and what the new dependency list will look like. I'd love to learn more about your use case and what an ideal workflow would look like for you. 😄 |
After removing
I did some testing, and I think the minimal requirements to support core functionality with little refactoring are:
Here, core functionality means creating Depending on how TensorFlow docs are built, it might be possible to move Model Card Toolkit is now a community-led open source project under the TFX Addons special interest group. (Learn more in this announcement.) The project now depends on community contributions, bug fixes, and documentation. This means the timeline for a creating a basic |
Right now the dependencies on
install_requires
included packages which are not necessarily a hard dependency, depending on what the user would like to do.For instance, if the user would like to have a minimal environment doing machine learning where they use frameworks other than tensorflow, this package would still pull tensorflow and many other packages in their environment.
I was wondering if you'd be open to the idea of making dependencies soft dependencies as much as possible, and only tell users in the documentation which dependencies are optional for which parts of the library, and also tell them they need extra libraries if they call specific functions of the library.
This is the list of packages pulled by
pip
on a fresh environment, which admittedly is quite a long list:The text was updated successfully, but these errors were encountered: