Merlin inference container size too big #90

sohn21c · 2022-01-19T05:24:24Z

Merlin inference container size, when built per https://github.com/NVIDIA-Merlin/Merlin/blob/main/docker/dockerfile.tri, is ~20GB which is big where some cloud services require container size to be limited under 10 or 15GB. A potential resolution could be to create inference containers for different framework using tf2, pyt triton base image instead of triton-py3 that contains all of backends i.e. Tensorflow, PyTorch, TensorRT, ONNX and OpenVINO.

kylemcmearty · 2022-01-24T15:58:26Z

@sohn21c Which cloud provider did you try this on?
Looks like Azure is 15 GB like you mentioned.

sohn21c · 2022-02-08T19:30:41Z

@kmcmearty GCP

rnyak · 2022-02-09T20:10:33Z

@albert17 @jperez999 for viz.

albert17 · 2022-02-10T00:22:45Z

@kmcmearty @sohn21c Working on fixing this. Created a PR

albert17 linked a pull request Feb 10, 2022 that will close this issue

Reduce containers size #100

Merged

albert17 self-assigned this Feb 14, 2022

benfred mentioned this issue Feb 22, 2022

[RMP] Container size reduction #112

Closed

4 tasks

jperez999 closed this as completed in #100 Feb 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merlin inference container size too big #90

Merlin inference container size too big #90

sohn21c commented Jan 19, 2022

kylemcmearty commented Jan 24, 2022

sohn21c commented Feb 8, 2022

rnyak commented Feb 9, 2022

albert17 commented Feb 10, 2022

Merlin inference container size too big #90

Merlin inference container size too big #90

Comments

sohn21c commented Jan 19, 2022

kylemcmearty commented Jan 24, 2022

sohn21c commented Feb 8, 2022

rnyak commented Feb 9, 2022

albert17 commented Feb 10, 2022