Darts lite/deployment version #1891

lowickert · 2023-07-11T12:44:15Z

Is your feature request related to a current problem? Please describe.
I'm currently deploying an application build with darts in a docker container. During the building process of the image I notice that darts depends on a lot of libraries, that might be important in the development process but bloat the deployment container. For example I dont need jupyter in an application running models on a server.

Describe proposed solution
It would be great to have a basic light / deployment version of darts covering only the core functionalities (trained models, ad, timeseries, etc..). Ideally, other functionality could be installed separately, I think the dask-library is doing something similar. If I overlooked something like this and that already exists / was already discussed, sorry.

Describe potential alternatives
A "quick" fix could be a prepacked official docker darts image so that all packages for darts do not need to be installed manually in a dockerfile but could be loaded as one from the docker-hub.

dennisbader · 2023-07-11T15:46:49Z

We currently have a PR that introduces a lighter Dockerfile (altough only adressing the jupyter dependency). See #1878

alexcolpitts96 · 2023-07-15T17:46:29Z

@lowickert I have been using darts in a research and production environment for roughly the last 18 months (I think?). It is tough for the image to be large, but it is also hard to strip out dependencies without breaking darts.

The biggest culprits are pytorch, catboost, and xgboost since they are all relatively large packages. It might be interesting to investigate splitting the requirements files into more sub-files. Torch is already split out of the core requirements (requirements/torch.txt), but having further segmentation might help with flexibility and build size.

If you are trying to keep your image size smaller (or at least what you have to build), try using a base image like NVIDIA base images here. Most of the big requirements are already covered.

What are your deployment configuration? Are you spinning up a REST API and handling inference that way or are you trying to spin up a new container for each prediction? If you are trying to do the latter then you will run into a lot of performance issues with bigger images while the former will be performant even if the image is large.

lowickert · 2023-07-15T20:31:45Z

@lowickert I have been using darts in a research and production environment for roughly the last 18 months (I think?). It is tough for the image to be large, but it is also hard to strip out dependencies without breaking darts.

The biggest culprits are pytorch, catboost, and xgboost since they are all relatively large packages. It might be interesting to investigate splitting the requirements files into more sub-files. Torch is already split out of the core requirements (requirements/torch.txt), but having further segmentation might help with flexibility and build size.

If you are trying to keep your image size smaller (or at least what you have to build), try using a base image like NVIDIA base images here. Most of the big requirements are already covered.

What are your deployment configuration? Are you spinning up a REST API and handling inference that way or are you trying to spin up a new container for each prediction? If you are trying to do the latter then you will run into a lot of performance issues with bigger images while the former will be performant even if the image is large.

Yes, in the end I did it as you described. We have the model deployed using a fast api restapi with docker containers building up on a pytorch image. That worked quite well.

madtoinou · 2023-08-07T14:44:29Z

thanks @alexcolpitts96 for the insights about your environment setup.

@lowickert, darts 0.25.0 (released today) requirements were updated and Catboost, LightGBM and Prophet became optional dependencies. I hope that it will help a little bit with the size of the image.

Keeping this open, hopefully the docker image without the jupyter dependency won't break the workflow of too many users.

lowickert added the triage Issue waiting for triaging label Jul 11, 2023

madtoinou added devops CI/CD, packaging, code maintenance, ... and removed triage Issue waiting for triaging labels Jul 12, 2023

madtoinou linked a pull request Aug 7, 2023 that will close this issue

Feat/Dockerfile test environment #1878

Merged

dennisbader closed this as completed in #1878 Aug 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Darts lite/deployment version #1891

Darts lite/deployment version #1891

lowickert commented Jul 11, 2023

dennisbader commented Jul 11, 2023 •

edited

alexcolpitts96 commented Jul 15, 2023

lowickert commented Jul 15, 2023

madtoinou commented Aug 7, 2023

Darts lite/deployment version #1891

Darts lite/deployment version #1891

Comments

lowickert commented Jul 11, 2023

dennisbader commented Jul 11, 2023 • edited

alexcolpitts96 commented Jul 15, 2023

lowickert commented Jul 15, 2023

madtoinou commented Aug 7, 2023

dennisbader commented Jul 11, 2023 •

edited