Skip to content

InferenceModel status conditions update #380

Open
@ahg-g

Description

@ahg-g

InferenceModel defines a Accepted condition type with three possible reasons:

  • Pending, which is the default when the object is created
  • ModelNameInUse, which is set if the ModelName is used by another InferenceModel
  • Accepted, which is set when the model conforms to the state of the InferencePool it references.

The question here is if we want the epp to populate this condition, or some other component like the gateway controller.

As of right now, there is no reason for the gateway controller to be aware of the InferenceModel API, it only cares about the InferencePool to establish the connection between the proxy and the epp.

One problem with having the epp update the condition is that it may run in HA active-active mode, which complicates synchronizing status updates. It also adds the requirement on other epp implementations to do so.

This issue is to open and track the discussion on this topic, and bring awareness that the conditions are currently not being updated by any component.

Metadata

Metadata

Assignees

No one assigned

    Labels

    lifecycle/staleDenotes an issue or PR has remained open with no activity and has become stale.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions