Skip to content

v0.8.0

Compare
Choose a tag to compare
@yuzisun yuzisun released this 15 Feb 07:36
· 569 commits to master since this release
4814664

馃寛 What's New?

This release introduces two new CRDs ServingRuntimes and ClusterServingRuntimes with the only difference between these two is that one is namespace-scoped and one is cluster-scoped. A ServingRuntime defines the templates for Pods that can serve one or more particular model formats. Each ServingRuntime defines key information such as the container image of the runtime and a list of the model formats that the runtime supports.

In previous versions of KServe, supported predictor formats and container images were defined in a config map in the control plane namespace. The ServingRuntime CRD should allow for improved flexibility and extensibility for defining or customizing runtimes to how you see fit without having to modify any controller code or any resources in the controller namespace.

Several out-of-the-box ClusterServingRuntimes are provided with KServe so that users can continue to use KServe how they did before without having to define the runtimes themselves.

鈿狅笍 What's Changed

  • Rename KF prefixed PythonSDK classes by @markwinter in #1951
    KFModel -> Model
    KFServer -> ModelServer
    KFModelRepository -> ModelRepository
  • KServe's pytorchserver has been deprecated, for PyTorch model KServe now defaults to use TorchServe serving runtime.
  • ONNX runtime server has been deprecated, for ONNX model KServe now defaults to use Triton Inference Server.

猬嗭笍 Version upgrades

馃悶 Fixes

Full Changelog: v0.7.0...v0.8.0