New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Best practices for using Tensorflow Serving in production #40
Comments
Good questions. I'll try to answer each: Regarding packaging, the anticipated use-case of TensorFlow Serving is that a production user would have a separate training pipeline and serving system. The handoff from training to serving only requires an export (see session_bundle/exporter.py). So in that use case, you shouldn't need to bundle any python files, or (not including the export) any proto files. Regarding a thinner footprint, much of these are standard TensorFlow dependencies. There are some efforts, e.g. mobile / android support, to shrink the deployed package sizes. At the moment, TensorFlow Serving is focusing on server-side usage where resources are less scarce, but open to feedback on use-cases. For statusz/healthz style interfaces I think this is a hybrid of a gRPC and TensorFlow Serving. I'll answer for TensorFlow Serving, where we are thinking about adding something like a /servablez that would show the status of each model. This is in early thinking / a not yet prioritized state so both feedback and contributions are welcome. You can see some related groundwork for this in recent additions to servable_state_monitor. |
(feel free to re-open) |
Thanks for the answers, @nfiedel - some comments. (PS, I can't reopen this issue for some reason).
I was wondering if there was a way to link only This is shown here - tensorflow/tensorflow#695 - perhaps it can be leveraged?
|
Hi viksit@ |
I have some queries around some real world, production deployments of the system.
Lastly, the example folders contain a bunch of pb2.py files, for which the tutorials don't seem to talk about - for instance, installing grpc, and protobuf3 c++/python, and how to use protoc to compile them into a service definition. Would be good to have for those who aren't familiar with grpc.
The text was updated successfully, but these errors were encountered: