generated from kubernetes/kubernetes-template-project
-
Notifications
You must be signed in to change notification settings - Fork 206
Closed
Labels
triage/acceptedIndicates an issue or PR is ready to be actively worked on.Indicates an issue or PR is ready to be actively worked on.
Milestone
Description
#1324 simplified the InferencePool EPP config by removing the type inlining. Since ExtensionRef is required, it should not be a pointer or include the omitempty tag:
// Extension configures an endpoint picker as an extension service.
//
// +kubebuilder:validation:Required
ExtensionRef *Extension `json:"extensionRef,omitempty"`When the API was first created, we discussed the idea of supporting the EPP extension as a sidecar container, e.g. UDS socket. If we still plan to support this use case, we should determine the configuration details. Here are a few initial thoughts:
- We can keep
ExtensionRefoptional, which would indicate using a UDS socket for the Gateway<>EPP connection. However, the socket path, e.g.,/tmp/uds/my_socket.sock, between the two would not be configurable, andFailureModewould need to reside elsewhere. - We make
ExtensionRefrequired and makeExtensionRef.Nameoptional, which a""value would indicate no Service reference and instead use a UDS socket for the Gateway<>EPP connection. - Add a union type to
ExtensionRefto indicate the connection/reference type and defaults to a GKN reference.
If we do not intend to support the sidecar use case, then update ExtensionRef as follows:
// ExtensionRef is a reference to the Endpoint Picker (EPP) Extension.
//
// +kubebuilder:validation:Required
ExtensionRef Extension `json:"extensionRef"`Note that ExtensionRef implies the only extension supported by an InferencePool is a single EPP extension.
Metadata
Metadata
Assignees
Labels
triage/acceptedIndicates an issue or PR is ready to be actively worked on.Indicates an issue or PR is ready to be actively worked on.