KServe 2023 Roadmap

Objective: "Graduate core inference capability to stable/GA"

Promote InferenceService and ClusterServingRuntime/ServingRuntime CRD from v1beta1 to v1
- Improve InferenceService CRD for REST/gRPC protocol interface
- Unify model storage spec and implementation between KServe and ModelMesh
- Add Status to ServingRuntime for both ModelMesh and KServe, surface ServingRuntime validation errors and deployment status
- Deprecate TrainedModel CRD and use InferenceService annotation to allow dynamic model updates as alternative option to storage initializer
- Collocate transformer and predictor in the pod to reduce sidecar resources and networking latency
- Stablize RawDeployment mode with comprehensive testing for supported features
All model formats to support v2 inference protocol including custom serving runtime
- TorchServe to support v2 gRPC inference protocol
- Support batching for v2 inference protocol
- Transformer and Explainer v2 inference protocol interoperability
- Improve codec for v2 inference protocol

Add ModelMesh docs and explain the use cases for classic KServe and ModelMesh
Unify the data plane v1 and v2 page formats
Improve v2 data plane docs to tell the story why and what changed
Clean up the examples in kserve repo and unify them with the website's by creating one source of truth for example documentation
Update any out-of-date documentation and make sure the website as a whole is consistent and cohesive